CN107749302A - Audio-frequency processing method, device, storage medium and terminal - Google Patents

Audio-frequency processing method, device, storage medium and terminal Download PDF

Info

Publication number
CN107749302A
CN107749302A CN201711020805.2A CN201711020805A CN107749302A CN 107749302 A CN107749302 A CN 107749302A CN 201711020805 A CN201711020805 A CN 201711020805A CN 107749302 A CN107749302 A CN 107749302A
Authority
CN
China
Prior art keywords
spectrum
target
frame
phase
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711020805.2A
Other languages
Chinese (zh)
Inventor
肖纯智
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Kugou Computer Technology Co Ltd
Original Assignee
Guangzhou Kugou Computer Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Kugou Computer Technology Co Ltd filed Critical Guangzhou Kugou Computer Technology Co Ltd
Priority to CN201711020805.2A priority Critical patent/CN107749302A/en
Publication of CN107749302A publication Critical patent/CN107749302A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4398Processing of audio elementary streams involving reformatting operations of audio signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Stereophonic System (AREA)

Abstract

The invention discloses a kind of audio-frequency processing method, device, storage medium and terminal, belong to multimedia technology field.Methods described includes:Pending audio is converted into the first short-term spectrum signal;It is that original amplitude spectrum and original phase are composed by the first short-term spectrum signal decomposition;Resampling processing, generation target amplitude spectrum are carried out to original amplitude spectrum according to target shift speed multiple;Processing is reconstructed to original phase spectrum, obtains target phase spectrum;According to target amplitude spectrum and target phase spectrum, the target audio that generation can play according to target shift speed multiple.The present invention is after short-term spectrum signal is converted audio signals into, it is further continued for being broken down into amplitude spectrum and phase spectrum, with by carrying out resampling to amplitude spectrum according to target shift speed multiple, and ensure the continuity of phase by the way that phase spectrum is reconstructed, so after above-mentioned processing synthesis target video is carried out, target video can not only realize more times of speed-variation without tone, but also have high quality.

Description

Audio-frequency processing method, device, storage medium and terminal
Technical field
The present invention relates to multimedia technology field, more particularly to a kind of audio-frequency processing method, device, storage medium and end End.
Background technology
The speed-variation without tone of audio refers to, and for the audio of certain time length, keeps tone and semantic information constant, and Change the speed of audio.For example audio is carried out to accelerate adjustment processing or slack-off adjustment processing so that the audio phase after processing Realized compared with original audio and accelerate broadcasting or slack-off broadcasting.
Correlation technique is typically based on WSOLA (Waveform in for the processing of the speed-variation without tone of audio Similarity and Overlap Add, the similar superposition of waveform) algorithm realization.Wherein, WSOLA algorithms are that one kind is based on time domain The change short-cut counting method, for example classical soundtouch gear shift modes are realized based on WSOLA algorithms.
During the present invention is realized, inventor has found that prior art at least has problems with:
WSOLA algorithms are not suitable for more times of significantly audio frequency process, i.e., are carrying out the speed change of more times of speed changes not to audio During modified tone processing, audio quality can be caused drastically to decline.For example, when carrying out fast 3 times of speed changes, the waveform of audio can multiple friendship It is folded, so as to cause tonequality to obscure;When carrying out slow 3 times of speed changes, fundamental tone is broken seriously so as to cause tonequality to become slag, cause sound to quiver Tremble.
The content of the invention
In order to solve problem of the prior art, the embodiments of the invention provide a kind of audio-frequency processing method, device, storage to be situated between Matter and terminal.The technical scheme is as follows:
First aspect, there is provided a kind of audio-frequency processing method, methods described include:
Pending audio is converted into the first short-term spectrum signal;
It is that original amplitude spectrum and original phase are composed by the first short-term spectrum signal decomposition;
Resampling processing, generation target amplitude spectrum are carried out to the original amplitude spectrum according to target shift speed multiple;
Processing is reconstructed to original phase spectrum, obtains target phase spectrum;
According to target amplitude spectrum and target phase spectrum, generation can be according to target shift speed multiple broadcasting Target audio.
In another embodiment, it is described that pending audio is converted into the first short-term spectrum signal, including:
Sub-frame processing is carried out to the pending audio, obtains the audio signal after framing;
Windowing process is carried out to the audio signal after the framing, and is pointed to the audio signal in window and carries out Fu in short-term In leaf transformation, obtain the first short-term spectrum signal.
In another embodiment, it is described that resampling processing, generation are carried out to the amplitude spectrum according to target shift speed multiple Target amplitude is composed, including:
According to the target shift speed multiple, resampling processing is carried out in units of frame to the original amplitude spectrum, generates institute State target amplitude spectrum.
In another embodiment, it is described according to the target shift speed multiple, to the original amplitude spectrum in units of frame Resampling processing is carried out, including:
The frame number and the target shift speed multiple included according to the original amplitude spectrum, determine the target amplitude spectrum bag The frame number contained;
In units of frame, for each Frequency point corresponding to any frame, according to the first object frame of the original amplitude spectrum And second target frame corresponding frequencies point amplitude, determine the amplitude of each Frequency point, obtain the amplitude spectrum of the frame;
Wherein, the first object frame for the original amplitude spectrum resampling opening position former frame, second mesh Mark a later frame of the frame for the resampling opening position of the original amplitude spectrum.
In another embodiment, it is described that processing is reconstructed to original phase spectrum, obtain target phase spectrum, bag Include:
The frame number and the target shift speed multiple included according to original phase spectrum, determine the target phase spectrum bag The frame number contained;
For the first frame of target phase spectrum, the phase spectrum for the first frame that the original phase is composed is as the mesh Mark the phase spectrum of the first frame of phase spectrum;
M frames in being composed for the target phase, the phase spectrum and target of the m-1 frames that the target phase is composed Phase spectrum phase increment and that value is as the m frames;
Wherein, m value is more than 1, and the target phase increment is previous for the resampling opening position of the original amplitude spectrum The phase difference of frame and a later frame.
In another embodiment, described to be composed according to target amplitude spectrum and the target phase, generation can be according to The target audio that the target shift speed multiple plays, including:
The target amplitude is composed and target phase spectrum carries out synthesis processing, obtains the second short-term spectrum signal;
Inverse Fourier transform in short-term is carried out to the second short-term spectrum signal, obtains intermediate treatment audio signal;
Windowing process and overlap-add procedure are carried out to the intermediate treatment audio signal, obtain it is described can be according to the target The target audio that speed change multiple plays.
Second aspect, there is provided a kind of apparatus for processing audio, described device include:
Modular converter, for pending audio to be converted into the first short-term spectrum signal;
Decomposing module, for being that original amplitude spectrum and original phase are composed by the first short-term spectrum signal decomposition;
First processing module, for carrying out resampling processing, generation to the original amplitude spectrum according to target shift speed multiple Target amplitude is composed;
Second processing module, for processing to be reconstructed to original phase spectrum, obtain target phase spectrum;
Generation module, for being composed according to target amplitude spectrum and the target phase, generation can be according to the target The target audio that speed change multiple plays.
In another embodiment, the modular converter, for carrying out sub-frame processing to the pending audio, divided Audio signal after frame;Windowing process is carried out to the audio signal after the framing, and is pointed to the audio signal in window and enters Row Short Time Fourier Transform, obtain the first short-term spectrum signal.
In another embodiment, the first processing module, for according to the target shift speed multiple, to described original Amplitude spectrum carries out resampling processing in units of frame, generates the target amplitude spectrum.
In another embodiment, the first processing module, for the frame number that is included according to the original amplitude spectrum with And the target shift speed multiple, determine the frame number that the target amplitude spectrum includes;In units of frame, for each corresponding to any frame Individual Frequency point, according to the amplitude of the first object frame of the original amplitude spectrum and the corresponding frequencies point of the second target frame, it is determined that The amplitude of each Frequency point, obtains the amplitude spectrum of the frame;
Wherein, the first object frame for the original amplitude spectrum resampling opening position former frame, second mesh Mark a later frame of the frame for the resampling opening position of the original amplitude spectrum.
In another embodiment, the Second processing module, for according to the original phase frame number that includes of spectrum with And the target shift speed multiple, determine the frame number that the target phase spectrum includes;, will for the first frame of target phase spectrum The phase spectrum for the first frame that the phase spectrum of first frame of the original phase spectrum is composed as the target phase;For the target M frames in phase spectrum, the phase spectrums of the m-1 frames that the target phase is composed and target phase increment and value as institute State the phase spectrum of m frames;
Wherein, m value is more than 1, and the target phase increment is previous for the resampling opening position of the original amplitude spectrum The phase difference of frame and a later frame.
In another embodiment, the generation module, for by the target amplitude compose and the target phase compose Synthesis processing is carried out, obtains the second short-term spectrum signal;Inverse Fourier transform in short-term is carried out to the second short-term spectrum signal, Obtain intermediate treatment audio signal;Windowing process and overlap-add procedure are carried out to the intermediate treatment audio signal, obtained described The target audio that can be played according to the target shift speed multiple.
The third aspect, there is provided a kind of storage medium, be stored with least one instruction, at least one section in the storage medium Program, code set or instruction set, at least one instruction, at least one section of program, the code set or the instruction set are by institute Processor is stated to load and perform to realize the audio-frequency processing method as described in above-mentioned first aspect.
Fourth aspect, there is provided a kind of terminal for audio frequency process, the terminal includes processor and memory, described At least one instruction, at least one section of program, code set or instruction set are stored with memory, described at least one instructs, be described At least one section of program, the code set or instruction set is loaded as the processor and performed to realize as described in above-mentioned first aspect Audio-frequency processing method.
The beneficial effect that technical scheme provided in an embodiment of the present invention is brought is:
After short-term spectrum signal is converted audio signals into, it is further continued for being broken down into amplitude spectrum and phase spectrum, with logical Cross according to target shift speed multiple to carry out resampling to amplitude spectrum, and ensure the continuous of phase by the way that phase spectrum is reconstructed Property, so after above-mentioned processing synthesis target video is carried out, target video can not only realize more times of speed-variation without tone, and Also there is high quality.
Brief description of the drawings
Technical scheme in order to illustrate the embodiments of the present invention more clearly, make required in being described below to embodiment Accompanying drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention, for For those of ordinary skill in the art, on the premise of not paying creative work, other can also be obtained according to these accompanying drawings Accompanying drawing.
Fig. 1 is a kind of flow chart of audio-frequency processing method provided in an embodiment of the present invention;
Fig. 2 is a kind of structural representation of apparatus for processing audio provided in an embodiment of the present invention;
Fig. 3 is a kind of structural representation of terminal for audio frequency process provided in an embodiment of the present invention.
Embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to embodiment party of the present invention Formula is described in further detail.
Before the embodiment of the present invention is explained in detail, first to the present embodiments relate to some nouns progress Once illustrate.
Spectrum signal:Fourier transformation is carried out to a time-domain signal, obtains the spectrum signal of the time-domain signal.Wherein, The spectrum signal is made up of two parts, i.e. amplitude spectrum and phase spectrum.
Amplitude spectrum:In the frequency domain description of signal, using frequency as independent variable, to form each frequency content of signal Amplitude is referred to as amplitude spectrum as dependent variable, such frequency function, and it characterizes the amplitude of signal with the distribution situation of frequency.
Phase spectrum:Refer to the curve that phase changes with frequency.It represents each frequency component in phase possessed by timeorigin Position.
Resampling:For broadly, resampling refers to go out another kind of picture dot information according to the message interpolation of a kind of picture dot Process.In embodiments of the present invention, resampling process is directed to amplitude spectrum.
The audio-frequency processing method provided in the embodiment of the present invention mainly realizes the speed-variation without tone to audio.Wherein, the party Method can be applied in the playing process to audio or video.For example realize and variable playback is carried out to currently playing music, or Realize that the word speed spoken to people in currently playing video carries out speed change etc., the embodiment of the present invention is limited without specific this. A kind of expression way is changed, the scheme of the embodiment of the present invention can be applied in audio player or video player, and user is in terminal On after the audio player or video player that are mounted with there is above-mentioned speed-variation conditioning function, just may be implemented in terminal progress During audio or video plays, speed-variation without tone is completed.
In addition, it is necessary to explanation is some the audio frequency process mode of speed-variation without tone provided in an embodiment of the present invention, not only The audio frequency process of voice is can apply to, the audio frequency process of non-voice can also be applied to, for example accompaniment is handled, this hair Bright embodiment is limited without specific this.
Wherein, the embodiment of the present invention to unit with frame in short-term spectrum by carrying out amplitude spectrum resampling, and passes through weight Structure phase spectrum ensures the continuity of phase, realizes on the premise of audio quality is kept, and supports more times of significantly speed change Invariable tone, explanation is explained in detail and refers to following embodiments.
Fig. 1 is a kind of flow chart of audio-frequency processing method provided in an embodiment of the present invention.The executive agent of this method is eventually End, referring to Fig. 1, method flow provided in an embodiment of the present invention includes:
101st, pending audio is converted into the first short-term spectrum signal, and is original by the first short-term spectrum signal decomposition Amplitude spectrum and original phase spectrum.
In embodiments of the present invention, sub-frame processing is carried out to pending audio first, obtains the audio signal after framing.
Wherein, why to pending audio carry out sub-frame processing be because:Voice signal is substantially non-stationary signal, Its non-stationary property is as caused by the physical motion process of phonatory organ.And due to inertia be present in the motion of phonatory organ, institute To be stable only within the short time, so ensureing the time-domain signal when carrying out Fourier transformation by carrying out sub-frame processing Stationarity.
In embodiments of the present invention, when carrying out sub-frame processing, duration of the frame in time domain can be 5ms or 10ms Etc., the embodiment of the present invention is limited without specific this.When 10ms exemplified by an a length of frame sign, then 0~10ms can be one Frame, 5ms~10ms can be next frame.
After sub-frame processing is carried out, spectrum analysis next is carried out to the audio signal after framing.Detailed process is:To dividing Audio signal after frame carries out windowing process, and is pointed to the audio signal in window and carries out Short Time Fourier Transform, and then obtains To short-term spectrum signal.
In another embodiment, due to being directed to audio signal, so typically using the Chinese when carrying out windowing process Bright window.In embodiments of the present invention, include a frame in a window, with the continuous mobile of window and be pointed to the audio in window Signal carries out Short Time Fourier Transform, gradually completes the spectrum analysis to the pending audio in time domain, realizing will wait to locate Reason audio is converted to the first short-term spectrum signal.Wherein, Short Time Fourier Transform is a kind of mathematics related to Fourier transformation Conversion, it is special to determine the frequency and phase of its regional area sine wave of time varying signal.
In another embodiment, after the first short-term spectrum signal is obtained, the embodiment of the present invention can by first in short-term frequency Spectrum signal is decomposed into amplitude spectrum and phase spectrum, for the ease of being made a distinction with the amplitude spectrum hereinafter occurred and phase spectrum, Here the amplitude spectrum occurred and phase spectrum are referred to as original amplitude spectrum and original phase spectrum.
102nd, resampling processing, generation target amplitude spectrum are carried out to original amplitude spectrum according to target shift speed multiple.
Wherein, step 102 is used for completing the resampling processing to original amplitude spectrum.In embodiments of the present invention, it is specifically According to target shift speed multiple, resampling processing is carried out in units of frame to original amplitude spectrum, to generate target amplitude spectrum, in detail step It is rapid as follows.
First, the frame number and target shift speed multiple included according to original amplitude spectrum, the frame that target amplitude spectrum includes is determined Number.
Wherein, target multiple gives shifting target is how many times that speed is changed into raw velocity.Target is referred to symbol α Speed change multiple, if then original amplitude spectrum has N frames, target amplitude spectrum has N/ α frames.
Next, the resampling of amplitude spectrum is carried out in units of frame.Wherein, if α value is more than 1, down-sampling is carried out Processing;If α value is less than 1, up-sampling processing is carried out.
For a frame, after Short Time Fourier Transform is carried out, each frequency content is given in obtained amplitude spectrum Amplitude distribution, that is, give the amplitude of signal in the frame at different frequencies.Due to needing to be inserted in original amplitude spectrum Value or sampling, thus the amplitude spectrum of resampling opening position be former frame based on resampling opening position and a later frame amplitude spectrum come Obtain.Wherein, former frame and a later frame are adjacent two frame of resampling opening position.
And a resampling position just corresponds to a frame of target amplitude spectrum.Wherein, the amplitude of a frame at different frequencies is The amplitude spectrum of one frame can be obtained based on following manner:For each Frequency point, according to the amplitude of corresponding frequencies point in former frame And in a later frame corresponding frequencies point amplitude, to determine the amplitude of this Frequency point.Such as by the amplitude to former frame with And the amplitude of a later frame is weighted processing to determine, the embodiment of the present invention is limited without specific this.
In summary, it is corresponding for any frame of target amplitude spectrum in units of frame during the resampling of amplitude spectrum Each Frequency point, according to the amplitude of the first object frame of original amplitude spectrum and the corresponding frequencies point of the second target frame, it is determined that The amplitude of each Frequency point, obtain the amplitude spectrum of any frame;Wherein, first object frame is the resampling opening position of original amplitude spectrum Former frame, the second target frame for original amplitude spectrum resampling opening position a later frame.
Assuming that α value is 2.5, the frame number of original amplitude spectrum is 100, then the frame number of target amplitude spectrum is 100/2.5= 40 frames, for example, original amplitude spectrum the first frame correspond to target amplitude spectrum the first frame, then it is next should in the opening position of 3.5 frames, I.e. resampling opening position should be in the opening position of 3.5 frames, and original amplitude spectrum is not include 3.5 frames, so according to original width The 3rd frame composed and the 4th frame are spent to generate the second frame of target amplitude spectrum.
103rd, processing is reconstructed to original phase spectrum, obtains target phase spectrum.
For the step, the frame number and target shift speed multiple included can be composed according to original phase first, determines target phase The frame number that position spectrum includes.Wherein, the frame number that target phase spectrum includes is consistent with the frame number that target amplitude spectrum includes.
In embodiments of the present invention, in order to ensure the continuity of phase, original phase can also be composed and processing is reconstructed.Its In, when processing is reconstructed to original phase spectrum, following manner can be taken to realize:
A, for the first frame of target phase spectrum, the phase spectrum of the first frame of original phase spectrum is composed as target phase The phase spectrum of first frame.
B, the m frames in being composed for target phase, the phase spectrum of the m-1 frames of target phase spectrum and target phase are increased Phase amount and that value is as m frames.
That is, in addition to the first frame, the phase spectrum of follow-up each frame is relevant with former frame.
Wherein, m value be more than 1, and target phase increment for original amplitude spectrum resampling opening position former frame and The phase difference of a later frame.Continue by taking above-mentioned example as an example, because resampling position is in the opening position of 3.5 frames, then target phase is composed In the phase spectrum of the second frame come from the phase spectrum of the first frame, and the 3rd frame adjacent with 3.5 frames in original amplitude spectrum and The phase difference of 4th frame.
104th, composed according to target amplitude spectrum and target phase, the target sound that generation can play according to target shift speed multiple Frequently.
Obtained according to above-mentioned steps 102 and 103 target amplitude spectrum and target phase spectrum after, can according to above-mentioned step The process of rapid 101 contrary can be according to the target audio of target shift speed multiple broadcasting to generate.Wherein, concrete operations mode is as follows:
First, target amplitude spectrum and target phase spectrum are subjected to synthesis processing, obtain the second short-term spectrum signal;It Afterwards, inverse Fourier transform in short-term is carried out to the second short-term spectrum signal, obtains intermediate treatment audio signal;Next, to obtaining Intermediate treatment audio signal carry out windowing process and overlap-add procedure again, finally give can according to target shift speed multiple play Target audio.
Method provided in an embodiment of the present invention, after short-term spectrum signal is converted audio signals into, it is further continued for its point Solve as amplitude spectrum and phase spectrum, with by carrying out resampling to amplitude spectrum according to target shift speed multiple, and by phase spectrum It is reconstructed to ensure the continuity of phase, so after above-mentioned processing synthesis target video is carried out, target video not only can be with More times of speed-variation without tone is realized, but also there is high quality.
Fig. 2 is provided in an embodiment of the present invention to provide a kind of structural representation of apparatus for processing audio., should referring to Fig. 2 Device includes:
Modular converter 201, for pending audio to be converted into the first short-term spectrum signal;
Decomposing module 202, for being that original amplitude spectrum and original phase are composed by the first short-term spectrum signal decomposition;
First processing module 203, it is raw for carrying out resampling processing to the original amplitude spectrum according to target shift speed multiple Composed into target amplitude;
Second processing module 204, for processing to be reconstructed to original phase spectrum, obtain target phase spectrum;
Generation module 205, for being composed according to target amplitude spectrum and the target phase, generation can be according to the mesh Mark the target audio that speed change multiple plays.
In another embodiment, modular converter 201, for carrying out sub-frame processing to the pending audio, divided Audio signal after frame;Windowing process is carried out to the audio signal after the framing, and is pointed to the audio signal in window and enters Row Short Time Fourier Transform, obtain the first short-term spectrum signal.
In another embodiment, first processing module 203, for according to the target shift speed multiple, to described original Amplitude spectrum carries out resampling processing in units of frame, generates the target amplitude spectrum.
In another embodiment, first processing module 203, for the frame number that is included according to the original amplitude spectrum and The target shift speed multiple, determine the frame number that the target amplitude spectrum includes;In units of frame, for each corresponding to any frame Frequency point, according to the amplitude of the first object frame of the original amplitude spectrum and the corresponding frequencies point of the second target frame, determine institute The amplitude of each Frequency point is stated, obtains the amplitude spectrum of the frame;
Wherein, the first object frame for the original amplitude spectrum resampling opening position former frame, second mesh Mark a later frame of the frame for the resampling opening position of the original amplitude spectrum.
In another embodiment, Second processing module 204, for according to the original phase frame number that includes of spectrum and The target shift speed multiple, determine the frame number that the target phase spectrum includes;For the first frame of target phase spectrum, by institute State the phase spectrum of the first frame that the phase spectrum of the first frame of original phase spectrum is composed as the target phase;For the target phase M frames in the spectrum of position, the phase spectrums of the m-1 frames that the target phase is composed and target phase increment and value as described in The phase spectrum of m frames;
Wherein, m value is more than 1, and the target phase increment is previous for the resampling opening position of the original amplitude spectrum The phase difference of frame and a later frame.
In another embodiment, generation module 205, for by the target amplitude compose and the target phase compose into Row synthesis is handled, and obtains the second short-term spectrum signal;Inverse Fourier transform in short-term is carried out to the second short-term spectrum signal, obtained To intermediate treatment audio signal;Windowing process and overlap-add procedure are carried out to the intermediate treatment audio signal, obtain it is described can The target audio played according to the target shift speed multiple.
Device provided in an embodiment of the present invention, after short-term spectrum signal is converted audio signals into, it is further continued for its point Solve as amplitude spectrum and phase spectrum, with by carrying out resampling to amplitude spectrum according to target shift speed multiple, and by phase spectrum It is reconstructed to ensure the continuity of phase, so after above-mentioned processing synthesis target video is carried out, target video not only can be with More times of speed-variation without tone is realized, but also there is high quality.
It should be noted that:The apparatus for processing audio that above-described embodiment provides is when handling audio, only with above-mentioned each function The division progress of module, can be as needed and by above-mentioned function distribution by different function moulds for example, in practical application Block is completed, i.e., the internal structure of device is divided into different functional modules, to complete all or part of work(described above Energy.In addition, the apparatus for processing audio that above-described embodiment provides belongs to same design with audio-frequency processing method embodiment, it is specific real Existing process refers to embodiment of the method, repeats no more here.
Fig. 3 is a kind of structural representation of terminal provided in an embodiment of the present invention, and the terminal can be used for performing above-mentioned reality The audio-frequency processing method provided in example is provided.Referring to Fig. 3, the terminal 300 includes:
RF (Radio Frequency, radio frequency) circuit 110, include one or more computer-readable storage mediums Memory 120, input block 130, display unit 140, sensor 150, voicefrequency circuit 160, the WiFi (Wireless of matter Fidelity, Wireless Fidelity) module 170, include one or the processor 180 and power supply of more than one processing core 190 grade parts., can be with it will be understood by those skilled in the art that the restriction of the terminal structure shown in Fig. 3 not structure paired terminal Including than illustrating more or less parts, either combining some parts or different parts arrangement.Wherein:
RF circuits 110 can be used for receive and send messages or communication process in, the reception and transmission of signal, especially, by base station After downlink information receives, transfer to one or more than one processor 180 is handled;In addition, it is sent to up data are related to Base station.Generally, RF circuits 110 include but is not limited to antenna, at least one amplifier, tuner, one or more oscillators, use Family identity module (SIM) card, transceiver, coupler, LNA (Low Noise Amplifier, low-noise amplifier), duplex Device etc..In addition, RF circuits 110 can also be communicated by radio communication with network and other equipment.Radio communication, which can use, appoints (Global System of Mobile communication, the whole world are moved for one communication standard or agreement, including but not limited to GSM Dynamic communication system), GPRS (General Packet Radio Service, general packet radio service), CDMA (Code Division Multiple Access, CDMA), WCDMA (Wideband Code Division Multiple Access, WCDMA), LTE (Long Term Evolution, Long Term Evolution), Email, SMS (Short Messaging Service, Short Message Service) etc..
Memory 120 can be used for storage software program and module, and processor 180 is stored in memory 120 by operation Software program and module, so as to perform various function application and data processing.Memory 120 can mainly include storage journey Sequence area and storage data field, wherein, storing program area can storage program area, the application program (ratio needed at least one function Such as sound-playing function, image player function) etc.;Storage data field can store uses created number according to terminal 300 According to (such as voice data, phone directory etc.) etc..In addition, memory 120 can include high-speed random access memory, can also wrap Include nonvolatile memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts. Correspondingly, memory 120 can also include Memory Controller, to provide processor 180 and input block 130 to memory 120 access.
Input block 130 can be used for the numeral or character information for receiving input, and generation is set with user and function Control relevant keyboard, mouse, action bars, optics or the input of trace ball signal.Specifically, input block 130 may include to touch Sensitive surfaces 131 and other input equipments 132.Touch sensitive surface 131, also referred to as touch display screen or Trackpad, collect and use Family on or near it touch operation (such as user using any suitable object or annex such as finger, stylus in touch-sensitive table Operation on face 131 or near touch sensitive surface 131), and corresponding attachment means are driven according to formula set in advance.It is optional , touch sensitive surface 131 may include both touch detecting apparatus and touch controller.Wherein, touch detecting apparatus detection is used The touch orientation at family, and the signal that touch operation is brought is detected, transmit a signal to touch controller;Touch controller is from touch Touch information is received in detection means, and is converted into contact coordinate, then gives processor 180, and can reception processing device 180 The order sent simultaneously is performed.Furthermore, it is possible to using polytypes such as resistance-type, condenser type, infrared ray and surface acoustic waves Realize touch sensitive surface 131.Except touch sensitive surface 131, input block 130 can also include other input equipments 132.Specifically, Other input equipments 132 can include but is not limited to physical keyboard, function key (such as volume control button, switch key etc.), One or more in trace ball, mouse, action bars etc..
Display unit 140 can be used for display by the information of user's input or be supplied to the information and terminal 300 of user Various graphical user interface, these graphical user interface can be made up of figure, text, icon, video and its any combination. Display unit 140 may include display panel 141, optionally, can use LCD (Liquid Crystal Display, liquid crystal Show device), the form such as OLED (Organic Light-Emitting Diode, Organic Light Emitting Diode) configure display panel 141.Further, touch sensitive surface 131 can cover display panel 141, when touch sensitive surface 131 detects touching on or near it After touching operation, processor 180 is sent to determine the type of touch event, is followed by subsequent processing type of the device 180 according to touch event Corresponding visual output is provided on display panel 141.Although in figure 3, touch sensitive surface 131 and display panel 141 are conducts Two independent parts come realize input and output function, but in some embodiments it is possible to by touch sensitive surface 131 with display Panel 141 is integrated and realizes input and output function.
Terminal 300 may also include at least one sensor 150, such as optical sensor, motion sensor and other sensings Device.Specifically, optical sensor may include ambient light sensor and proximity transducer, wherein, ambient light sensor can be according to environment The light and shade of light adjusts the brightness of display panel 141, and proximity transducer can close display when terminal 300 is moved in one's ear Panel 141 and/or backlight.As one kind of motion sensor, gravity accelerometer can detect in all directions (generally Three axles) acceleration size, size and the direction of gravity are can detect that when static, available for identification mobile phone posture application (ratio Such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, tap);Extremely The other sensors such as the gyroscope that can also configure in terminal 300, barometer, hygrometer, thermometer, infrared ray sensor, herein Repeat no more.
Voicefrequency circuit 160, loudspeaker 161, microphone 162 can provide the COBBAIF between user and terminal 300.Audio Electric signal after the voice data received conversion can be transferred to loudspeaker 161, sound is converted to by loudspeaker 161 by circuit 160 Sound signal exports;On the other hand, the voice signal of collection is converted to electric signal by microphone 162, after being received by voicefrequency circuit 160 Voice data is converted to, then after voice data output processor 180 is handled, through RF circuits 110 to be sent to such as another end End, or voice data is exported to memory 120 further to handle.Voicefrequency circuit 160 is also possible that earphone jack, To provide the communication of peripheral hardware earphone and terminal 300.
WiFi belongs to short range wireless transmission technology, and terminal 300 can help user's transceiver electronicses by WiFi module 170 Mail, browse webpage and access streaming video etc., it has provided the user wireless broadband internet and accessed.
Processor 180 is the control centre of terminal 300, utilizes various interfaces and each portion of connection whole mobile phone Point, by running or performing the software program and/or module that are stored in memory 120, and call and be stored in memory 120 Interior data, the various functions and processing data of terminal 300 are performed, so as to carry out integral monitoring to mobile phone.Optionally, processor 180 may include one or more processing cores;Preferably, processor 180 can integrate application processor and modem processor, Wherein, application processor mainly handles operating system, user interface and application program etc., and modem processor mainly handles nothing Line communicates.It is understood that above-mentioned modem processor can not also be integrated into processor 180.
Terminal 300 also includes the power supply 190 (such as battery) to all parts power supply, it is preferred that power supply can pass through electricity Management system and processor 180 are logically contiguous, so as to realize management charging, electric discharge and power consumption by power-supply management system The functions such as management.Power supply 190 can also include one or more direct current or AC power, recharging system, power supply event The random component such as barrier detection circuit, power supply changeover device or inverter, power supply status indicator.
Although being not shown, terminal 300 can also include camera, bluetooth module etc., will not be repeated here.Specifically in this reality Apply in example, the display unit of terminal is touch-screen display, and terminal also includes memory, and at least one instructs, at least One section of program, code set or instruction set, wherein at least one instruction, at least one section of program, code set or instruction set are stored in In reservoir, and it is configured to be instructed by one or more than one computing device at least one, at least one section of program, code set Or the instruction for being used to perform above-mentioned audio-frequency processing method that instruction set includes.
One of ordinary skill in the art will appreciate that hardware can be passed through by realizing all or part of step of above-described embodiment To complete, by program the hardware of correlation can also be instructed to complete, described program can be stored in a kind of computer-readable In storage medium, storage medium mentioned above can be read-only storage, disk or CD etc..
The foregoing is only presently preferred embodiments of the present invention, be not intended to limit the invention, it is all the present invention spirit and Within principle, any modification, equivalent substitution and improvements made etc., it should be included in the scope of the protection.

Claims (14)

1. a kind of audio-frequency processing method, it is characterised in that methods described includes:
Pending audio is converted into the first short-term spectrum signal;
It is that original amplitude spectrum and original phase are composed by the first short-term spectrum signal decomposition;
Resampling processing, generation target amplitude spectrum are carried out to the original amplitude spectrum according to target shift speed multiple;
Processing is reconstructed to original phase spectrum, obtains target phase spectrum;
According to target amplitude spectrum and target phase spectrum, the target that generation can play according to the target shift speed multiple Audio.
2. according to the method for claim 1, it is characterised in that described that pending audio is converted into the first short-term spectrum letter Number, including:
Sub-frame processing is carried out to the pending audio, obtains the audio signal after framing;
Windowing process is carried out to the audio signal after the framing, and is pointed to the audio signal in window and carries out Fourier in short-term Conversion, obtains the first short-term spectrum signal.
3. according to the method for claim 1, it is characterised in that described that the amplitude spectrum is carried out according to target shift speed multiple Resampling is handled, generation target amplitude spectrum, including:
According to the target shift speed multiple, resampling processing is carried out in units of frame to the original amplitude spectrum, generates the mesh Mark amplitude spectrum.
4. according to the method for claim 3, it is characterised in that it is described according to the target shift speed multiple, to described original Amplitude spectrum carries out resampling processing in units of frame, including:
The frame number and the target shift speed multiple included according to the original amplitude spectrum, determine what the target amplitude spectrum included Frame number;
In units of frame, for each Frequency point corresponding to any frame, according to the first object frame of the original amplitude spectrum and The amplitude of the corresponding frequencies point of second target frame, determines the amplitude of each Frequency point, obtains the amplitude spectrum of the frame;
Wherein, the first object frame for the original amplitude spectrum resampling opening position former frame, second target frame For a later frame of the resampling opening position of the original amplitude spectrum.
5. according to the method for claim 1, it is characterised in that it is described that processing is reconstructed to original phase spectrum, obtain Composed to target phase, including:
The frame number and the target shift speed multiple included according to original phase spectrum, determine what the target phase spectrum included Frame number;
For the first frame of target phase spectrum, the phase spectrum for the first frame that the original phase is composed is as the target phase The phase spectrum of first frame of position spectrum;
M frames in being composed for the target phase, the phase spectrum and target phase of the m-1 frames that the target phase is composed Phase spectrum increment and that value is as the m frames;
Wherein, m value be more than 1, the target phase increment for the original amplitude spectrum resampling opening position former frame with And the phase difference of a later frame.
6. according to the method for claim 1, it is characterised in that described according to target amplitude spectrum and the target phase Position spectrum, the target audio that generation can play according to the target shift speed multiple, including:
The target amplitude is composed and target phase spectrum carries out synthesis processing, obtains the second short-term spectrum signal;
Inverse Fourier transform in short-term is carried out to the second short-term spectrum signal, obtains intermediate treatment audio signal;
Windowing process and overlap-add procedure are carried out to the intermediate treatment audio signal, obtain it is described can be according to the target shift speed The target audio that multiple plays.
7. a kind of apparatus for processing audio, it is characterised in that described device includes:
Modular converter, for pending audio to be converted into the first short-term spectrum signal;
Decomposing module, for being that original amplitude spectrum and original phase are composed by the first short-term spectrum signal decomposition;
First processing module, for carrying out resampling processing to the original amplitude spectrum according to target shift speed multiple, generate target Amplitude spectrum;
Second processing module, for processing to be reconstructed to original phase spectrum, obtain target phase spectrum;
Generation module, for being composed according to target amplitude spectrum and the target phase, generation can be according to the target shift speed The target audio that multiple plays.
8. device according to claim 7, it is characterised in that the modular converter, for entering to the pending audio Row sub-frame processing, obtain the audio signal after framing;Windowing process is carried out to the audio signal after the framing, and is pointed to window Intraoral audio signal carries out Short Time Fourier Transform, obtains the first short-term spectrum signal.
9. device according to claim 7, it is characterised in that the first processing module, for being become according to the target Fast multiple, resampling processing is carried out in units of frame to the original amplitude spectrum, generate the target amplitude spectrum.
10. device according to claim 9, it is characterised in that the first processing module, for according to the original width The frame number and the target shift speed multiple that degree spectrum includes, determine the frame number that the target amplitude spectrum includes;It is right in units of frame In each Frequency point corresponding to any frame, according to the first object frame of the original amplitude spectrum and the respective tones of the second target frame The amplitude of rate point, the amplitude of each Frequency point is determined, obtains the amplitude spectrum of the frame;
Wherein, the first object frame for the original amplitude spectrum resampling opening position former frame, second target frame For a later frame of the resampling opening position of the original amplitude spectrum.
11. device according to claim 7, it is characterised in that the Second processing module, for according to the original phase The frame number and the target shift speed multiple that position spectrum includes, determine the frame number that the target phase spectrum includes;For the target First frame of phase spectrum, the phase for the first frame that the phase spectrum of the first frame that the original phase is composed is composed as the target phase Position spectrum;M frames in being composed for the target phase, the phase spectrum and target phase of the m-1 frames that the target phase is composed Phase spectrum position increment and that value is as the m frames;
Wherein, m value be more than 1, the target phase increment for the original amplitude spectrum resampling opening position former frame with And the phase difference of a later frame.
12. device according to claim 7, it is characterised in that the generation module, for by the target amplitude compose with And the target phase spectrum carries out synthesis processing, obtains the second short-term spectrum signal;The second short-term spectrum signal is carried out Inverse Fourier transform in short-term, obtain intermediate treatment audio signal;To the intermediate treatment audio signal carry out windowing process and Overlap-add procedure, obtain the target audio that can be played according to the target shift speed multiple.
A kind of 13. storage medium, it is characterised in that be stored with the storage medium at least one instruction, at least one section of program, Code set or instruction set, at least one instruction, at least one section of program, the code set or the instruction set are by the processing Device is loaded and performed to realize the audio-frequency processing method as described in any claim in claim 1 to 6.
14. a kind of terminal for audio frequency process, it is characterised in that the terminal includes processor and memory, the storage Be stored with least one instruction, at least one section of program, code set or instruction set in device, at least one instruction, it is described at least One section of program, the code set or instruction set are loaded by the processor and performed to realize such as any power in claim 1 to 6 Profit requires described audio-frequency processing method.
CN201711020805.2A 2017-10-27 2017-10-27 Audio-frequency processing method, device, storage medium and terminal Pending CN107749302A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711020805.2A CN107749302A (en) 2017-10-27 2017-10-27 Audio-frequency processing method, device, storage medium and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711020805.2A CN107749302A (en) 2017-10-27 2017-10-27 Audio-frequency processing method, device, storage medium and terminal

Publications (1)

Publication Number Publication Date
CN107749302A true CN107749302A (en) 2018-03-02

Family

ID=61252553

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711020805.2A Pending CN107749302A (en) 2017-10-27 2017-10-27 Audio-frequency processing method, device, storage medium and terminal

Country Status (1)

Country Link
CN (1) CN107749302A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109102811A (en) * 2018-07-27 2018-12-28 广州酷狗计算机科技有限公司 Generation method, device and the storage medium of audio-frequency fingerprint
CN109448752A (en) * 2018-11-28 2019-03-08 广州市百果园信息技术有限公司 Processing method, device, equipment and the storage medium of audio data
CN109887515A (en) * 2019-01-29 2019-06-14 北京市商汤科技开发有限公司 Audio-frequency processing method and device, electronic equipment and storage medium
CN111653288A (en) * 2020-06-18 2020-09-11 南京大学 Target person voice enhancement method based on conditional variation self-encoder
CN111667805A (en) * 2019-03-05 2020-09-15 腾讯科技(深圳)有限公司 Extraction method, device, equipment and medium of accompaniment music
CN111782865A (en) * 2020-06-23 2020-10-16 腾讯音乐娱乐科技(深圳)有限公司 Audio information processing method and device and storage medium
CN112738634A (en) * 2019-10-14 2021-04-30 北京字节跳动网络技术有限公司 Video file generation method, device, terminal and storage medium
CN113057613A (en) * 2021-03-12 2021-07-02 歌尔科技有限公司 Heart rate monitoring circuit and method and wearable device
RU2775660C1 (en) * 2018-11-28 2022-07-06 Биго Текнолоджи Пте. Лтд. Method and device for processing audio data, as well as a data carrier

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050010397A1 (en) * 2002-11-15 2005-01-13 Atsuhiro Sakurai Phase locking method for frequency domain time scale modification based on a bark-scale spectral partition
CN101290775A (en) * 2008-06-25 2008-10-22 北京中星微电子有限公司 Method for rapidly realizing speed shifting of audio signal
CN101719371A (en) * 2009-11-20 2010-06-02 安凯(广州)微电子技术有限公司 Voice speed changing method
CN102419981A (en) * 2011-11-02 2012-04-18 展讯通信(上海)有限公司 Zooming method and device for time scale and frequency scale of audio signal
CN103632672A (en) * 2012-08-28 2014-03-12 腾讯科技(深圳)有限公司 Voice-changing system, voice-changing method, man-machine interaction system and man-machine interaction method
KR20150009777A (en) * 2013-07-17 2015-01-27 주식회사 더바인코퍼레이션 Phase vocoder

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050010397A1 (en) * 2002-11-15 2005-01-13 Atsuhiro Sakurai Phase locking method for frequency domain time scale modification based on a bark-scale spectral partition
CN101290775A (en) * 2008-06-25 2008-10-22 北京中星微电子有限公司 Method for rapidly realizing speed shifting of audio signal
CN101719371A (en) * 2009-11-20 2010-06-02 安凯(广州)微电子技术有限公司 Voice speed changing method
CN102419981A (en) * 2011-11-02 2012-04-18 展讯通信(上海)有限公司 Zooming method and device for time scale and frequency scale of audio signal
CN103632672A (en) * 2012-08-28 2014-03-12 腾讯科技(深圳)有限公司 Voice-changing system, voice-changing method, man-machine interaction system and man-machine interaction method
KR20150009777A (en) * 2013-07-17 2015-01-27 주식회사 더바인코퍼레이션 Phase vocoder

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
ERICMOULINES,JEANLAROCHE: "Non-parametric techniques for pitch-scale and time-scale modification of speech", 《SPEECH COMMUNICATION》 *
MICHAEL R. PORTNOFF: "Time-Scale Modification of Speech Base on short-Time Fourier Analysis", 《IEEE TRANSACTION ON ACOUSTICS,SPEECH,AND SIGNAL PROCESSING》 *
刘耦耕等: "语音信号变速算法及其TMS320C5402实时实现 ", 《中南大学学报(自然科学版)》 *
李浈祯等: "相位调整法实现语音变速的实时处理", 《测控技术》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109102811B (en) * 2018-07-27 2021-03-30 广州酷狗计算机科技有限公司 Audio fingerprint generation method and device and storage medium
CN109102811A (en) * 2018-07-27 2018-12-28 广州酷狗计算机科技有限公司 Generation method, device and the storage medium of audio-frequency fingerprint
US20220020389A1 (en) * 2018-11-28 2022-01-20 Bigo Technology Pte. Ltd. Audio data processing method, apparatus and device, and storage medium
CN109448752A (en) * 2018-11-28 2019-03-08 广州市百果园信息技术有限公司 Processing method, device, equipment and the storage medium of audio data
US11875814B2 (en) * 2018-11-28 2024-01-16 Bigo Technology Pte. Ltd. Audio data processing method, apparatus and device, and storage medium
CN109448752B (en) * 2018-11-28 2021-01-01 广州市百果园信息技术有限公司 Audio data processing method, device, equipment and storage medium
WO2020108555A1 (en) * 2018-11-28 2020-06-04 广州市百果园信息技术有限公司 Audio data processing method, apparatus and device, and storage medium
RU2775660C1 (en) * 2018-11-28 2022-07-06 Биго Текнолоджи Пте. Лтд. Method and device for processing audio data, as well as a data carrier
CN109887515A (en) * 2019-01-29 2019-06-14 北京市商汤科技开发有限公司 Audio-frequency processing method and device, electronic equipment and storage medium
CN109887515B (en) * 2019-01-29 2021-07-09 北京市商汤科技开发有限公司 Audio processing method and device, electronic equipment and storage medium
CN111667805A (en) * 2019-03-05 2020-09-15 腾讯科技(深圳)有限公司 Extraction method, device, equipment and medium of accompaniment music
CN111667805B (en) * 2019-03-05 2023-10-13 腾讯科技(深圳)有限公司 Accompaniment music extraction method, accompaniment music extraction device, accompaniment music extraction equipment and accompaniment music extraction medium
CN112738634A (en) * 2019-10-14 2021-04-30 北京字节跳动网络技术有限公司 Video file generation method, device, terminal and storage medium
CN111653288A (en) * 2020-06-18 2020-09-11 南京大学 Target person voice enhancement method based on conditional variation self-encoder
CN111782865A (en) * 2020-06-23 2020-10-16 腾讯音乐娱乐科技(深圳)有限公司 Audio information processing method and device and storage medium
CN111782865B (en) * 2020-06-23 2024-07-05 腾讯音乐娱乐科技(深圳)有限公司 Audio information processing method, device and storage medium
CN113057613A (en) * 2021-03-12 2021-07-02 歌尔科技有限公司 Heart rate monitoring circuit and method and wearable device

Similar Documents

Publication Publication Date Title
CN107749302A (en) Audio-frequency processing method, device, storage medium and terminal
CN106531149B (en) Information processing method and device
CN103440862B (en) A kind of method of voice and music synthesis, device and equipment
CN105788612B (en) A kind of method and apparatus detecting sound quality
CN107863095A (en) Acoustic signal processing method, device and storage medium
CN105363201B (en) The display methods and device of prompt message
CN106251890B (en) A kind of methods, devices and systems of recording song audio
CN106782613A (en) Signal detecting method and device
CN107623776A (en) A kind of method for controlling volume, system and mobile terminal
CN107731241A (en) Handle the method, apparatus and storage medium of audio signal
CN106933525A (en) A kind of method and apparatus of display image
CN106782460A (en) The method and apparatus for generating music score
CN106847307A (en) Signal detecting method and device
CN108470571A (en) A kind of audio-frequency detection, device and storage medium
CN103399657B (en) The control method of mouse pointer, device and terminal unit
CN110830368B (en) Instant messaging message sending method and electronic equipment
CN107993672A (en) Frequency expansion method and device
CN107507628A (en) Singing methods of marking, device and terminal
CN109828711A (en) A kind of reading management method, mobile terminal and the storage medium of mobile terminal
CN106328176A (en) Method and device for generating song audio
CN106558299A (en) The mode switching method and device of audio rendition
CN106599204A (en) Method and device for recommending multimedia content
CN106356071B (en) A kind of noise detecting method and device
CN108492837B (en) Method, device and storage medium for detecting audio burst white noise
CN106527666A (en) Control method of central processing unit and terminal equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180302

RJ01 Rejection of invention patent application after publication