CN110211610A

CN110211610A - Assess the method, apparatus and storage medium of audio signal loss

Info

Publication number: CN110211610A
Application number: CN201910537059.7A
Authority: CN
Inventors: 房树明; 王健宗
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2019-06-20
Filing date: 2019-06-20
Publication date: 2019-09-06
Also published as: WO2020253054A1

Abstract

The present invention relates to field of voice signal, it is proposed a kind of method, apparatus and storage medium for assessing audio signal loss, method therein is applied to electronic device, comprising: carries out compression processing to original audio signal, obtains audio compression signal corresponding with the original audio signal；Processing is unziped it to the audio compression signal, obtains audio decompression signal corresponding with the audio compression signal；Obtain the cosine similarity between the audio decompression signal and the original audio signal；According to cosine similarity, the grading system of the original audio signal loss is obtained.The present invention is by establishing general voice signal evaluation rule, the extent of damage of assessment audio signal during processing, convenient for the analysis and judgement to audio signal.

Description

Assess the method, apparatus and storage medium of audio signal loss

Technical field

The present invention relates to speech signal processing technology more particularly to a kind of methods for assessing audio signal loss, dress It sets and computer readable storage medium.

Background technique

In the transmission of audio, the field of signal processing such as speech synthesis, which have, is widely answered for the compression and decompression processing of audio With.Wherein, the audio compression techniques based on μ-law or a-law are common audio compression algorithms, such as μ-law (or Mu- Law) it is standard multimedia digital signal coding and decoding device (compression/de-compression) algorithm for pulse-code modulation.As one kind The method of companding, μ-law can improve S/N rate without increasing more data.

But although audio and original audio after μ-law compression algorithm decompresses again have very high similarity, That is μ-law, which compresses, can sufficiently high signal fidelity, but the loss of still unavoidable signal, to signal processing mistake Cheng Zhong cannot clearly obtain the degree of the loss of signal, will affect analysis and judgement to signal processing results.

Therefore, how to carry out assessment to the extent of damage of audio signal during processing is asking for current urgent need to resolve Topic.

Summary of the invention

The present invention provides a kind of method, electronic device and computer readable storage medium for assessing audio signal loss, Main purpose is the grading system lost by original audio signal, clearly obtains the loss of audio signal during processing Degree, code of points are simply applicable in, and versatility is high.

To achieve the above object, the present invention provides a kind of method for assessing audio signal loss, is applied to electronic device, institute The method of stating includes:

Compression processing is carried out to original audio signal, obtains audio compression signal corresponding with the original audio signal；

Processing is unziped it to the audio compression signal, obtains audio decompression corresponding with the audio compression signal Signal；

Obtain the cosine similarity between the audio decompression signal and the original audio signal；

According to the cosine similarity, the grading system of the original audio signal loss is obtained.

Preferably, described that compression processing is carried out to original audio signal, obtain sound corresponding with the original audio signal The step of frequency compressed signal includes:

Compression processing, the formula of the compression processing are carried out to the original audio signal according to μ-law algorithm are as follows:

Wherein, μ is the companding parameter for indicating compression degree, and x indicates that the original audio signal, F (x) indicate the audio Compressed signal, sgn are sign function.

Preferably, described that processing is unziped it to the audio compression signal, it obtains and the audio compression signal pair The step of audio decompression signal answered includes:

Processing, the formula of the decompression are unziped it to the audio compression signal according to μ-law algorithm are as follows:

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Wherein, μ is the companding parameter for indicating compression degree, and y indicates the audio compression signal, F^-1(y) sound is indicated Frequency decompression signal, sgn is sign function.

Preferably, the meter for obtaining the cosine similarity between the audio decompression signal and the original audio signal Calculate formula are as follows:

Wherein, A is original audio signal, and B is audio decompression signal, and n is the number of corresponding signal.

Preferably, the calculation formula of the grading system for obtaining the original audio signal loss are as follows:

S=M* (1-cos θ)

Wherein, S is grading system, and M is full marks grade, and cos θ is the audio decompression signal and the original audio signal Between cosine similarity, the numerical value of S is bigger, shows that the loss of audio signal is smaller.

To achieve the above object, the present invention also provides a kind of electronic device, which includes: memory and processing Device includes the program of assessment audio signal loss in the memory, and the program of the assessment audio signal loss is by the place Reason device realizes following steps when executing:

Wherein, μ is the companding parameter for indicating compression degree, and x indicates that original audio signal, F (x) indicate audio compression letter Number, sgn is sign function.

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Wherein, μ is the companding parameter for indicating compression degree, and y indicates audio compression signal, F^-1(y) audio decompression letter is indicated Number, sgn is sign function.

To achieve the above object, the present invention also provides a kind of computer readable storage medium, the computer-readable storages It include the program of assessment audio signal loss in medium, it is real when the program of the assessment audio signal loss is executed by processor The method and step of audio signal loss is now assessed as described above.

Method, electronic device and the computer readable storage medium of assessment audio signal loss proposed by the present invention, pass through It obtains to the original audio signal cosine similarity that treated between audio decompression signal and original audio signal, described in acquisition The grading system of original audio signal loss, can be to the loss amount of progress of audio signal during processing by grading system Change, loss of signal degree can be specified, and correctly signal processing results are analyzed and judged.

Detailed description of the invention

Fig. 1 is the application environment schematic diagram of the method specific embodiment of present invention assessment audio signal loss；

Fig. 2 is the module diagram that the program specific embodiment of audio signal loss is assessed in Fig. 1；

Fig. 3 is the flow chart of the method specific embodiment of present invention assessment audio signal loss.

The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.

Specific embodiment

It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.

The present invention provides a kind of method for assessing audio signal loss, is applied to a kind of electronic device 1.Shown in referring to Fig.1, For the application environment schematic diagram of the method preferred embodiment of present invention assessment audio signal loss.

In the present embodiment, electronic device 1 can be server, smart phone, tablet computer, portable computer, on table Type computer etc. has the terminal device of calculation function.

The electronic device 1 includes: processor 12, memory 11, network interface 14 and communication bus 15.

Memory 11 includes the readable storage medium storing program for executing of at least one type.The readable storage medium storing program for executing of at least one type It can be the non-volatile memory medium of such as flash memory, hard disk, multimedia card, card-type memory 11.In some embodiments, described Readable storage medium storing program for executing can be the internal storage unit of the electronic device 1, such as the hard disk of the electronic device 1.At other In embodiment, the readable storage medium storing program for executing is also possible to the external memory 11 of the electronic device 1, such as the electronic device The plug-in type hard disk being equipped on 1, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) block, flash card (Flash Card) etc..

In the present embodiment, the readable storage medium storing program for executing of the memory 11 is installed on the electronic device commonly used in storage The program 10 etc. of 1 assessment audio signal loss.The memory 11, which can be also used for temporarily storing, to have exported or has incited somebody to action The data to be exported.

Processor 12 can be in some embodiments a central processing unit (Central Processing Unit, CPU), microprocessor or other data processing chips, program code or processing data for being stored in run memory 11, example Such as execute the program 10 of assessment audio signal loss.

Network interface 14 optionally may include standard wireline interface and wireless interface (such as WI-FI interface), be commonly used in Communication connection is established between the electronic device 1 and other electronic equipments.

Communication bus 15 is for realizing the connection communication between these components.

Fig. 1 illustrates only the electronic device 1 with component 11-15, it should be understood that being not required for implementing all show Component out, the implementation that can be substituted is more or less component.

Optionally, which can also include user interface, and user interface may include input unit such as keyboard (Keyboard), speech input device such as microphone (microphone) etc. has the equipment of speech identifying function, voice defeated Device such as sound equipment, earphone etc. out, optionally user interface can also include standard wireline interface and wireless interface.

Optionally, which can also include display, and display is referred to as display screen or display unit. It can be light-emitting diode display, liquid crystal display, touch-control liquid crystal display and Organic Light Emitting Diode in some embodiments (Organic Light-Emitting Diode, OLED) touches device etc..Display is used to show and handle in the electronic apparatus 1 Information and for showing visual user interface.

Optionally, which further includes touch sensor.It is touched provided by the touch sensor for user The region for touching operation is known as touch area.In addition, touch sensor described here can be resistive touch sensor, capacitor Formula touch sensor etc..Moreover, the touch sensor not only includes the touch sensor of contact, proximity may also comprise Touch sensor etc..In addition, the touch sensor can be single sensor, or such as multiple biographies of array arrangement Sensor.

In addition, the area of the display of the electronic device 1 can be identical as the area of the touch sensor, it can also not Together.Optionally, display and touch sensor stacking are arranged, to form touch display screen.The device is based on touching aobvious Display screen detects the touch control operation of user's triggering.

Optionally, which can also include radio frequency (Radio Frequency, RF) circuit, sensor, audio Circuit etc., details are not described herein.

In Installation practice shown in Fig. 1, as may include in a kind of memory 11 of computer storage medium behaviour Make system and assesses the program 10 of audio signal loss；Processor 12 executes the assessment audio signal stored in memory 11 Following steps are realized when the program 10 of loss:

Firstly, carrying out compression processing to original audio signal, audio compression corresponding with the original audio signal is obtained Signal.

In this step, audio compression refers mainly to use number appropriate to original digital audio signal stream (pcm encoder) Signal processing technology, do not lose useful information amount or it is introduced loss it is insignificant under the conditions of, reduce (compression) its code rate, Also referred to as compressed encoding.It must have corresponding inverse transformation, referred to as decompress or decode, and audio signal is passing through a volume solution A large amount of noise and certain distortion may be introduced after code system, using appraisal procedure of the invention, can be existed to audio signal The extent of damage in treatment process carries out quantitative evaluation.

Then, processing is unziped it to the audio compression signal, obtains sound corresponding with the audio compression signal Frequency decompression signal.

In this step, with the development of computer multimedia technology, the data processing amount of audio signal increasingly increases, for up to To improve audio signal transmission rate, reduce signal bandwidth, and reduce audio signal memory space the purpose of, to original Audio signal carries out high proportion compression, and making it easier to actual use seems particularly significant.And decompressing (Decompression) is The inverse process of compression is the appearance being restored to one by various things such as document, the files of Software Compression before compression.

Then, the cosine similarity between the audio decompression signal and the original audio signal is obtained.

Wherein, the calculating for obtaining the cosine similarity between the audio decompression signal and the original audio signal Formula are as follows:

Finally, obtaining the grading system of the original audio signal loss according to the cosine similarity.

Wherein, the calculation formula of the grading system are as follows:

S=M* (1-cos θ)

The method of assessment audio signal loss of the invention will be carried out detailed with μ-law algorithm as a specific example below It is thin to illustrate.Specifically includes the following steps:

Step 1: carrying out compression processing to original audio signal according to μ-law algorithm, obtain and the original audio signal Corresponding audio compression signal.

In this step, the formula of compression processing is carried out to original audio signal are as follows:

Wherein, μ is the companding parameter for indicating compression degree, and in international standard, μ value 255, x indicates the original sound Frequency signal, F (x) indicate the audio compression signal, and sgn is sign function.

Step 2 unzips it processing to the audio compression signal according to μ-law algorithm, obtains and the audio pressure The corresponding audio decompression signal of contracting signal.

In this step, the formula of processing is unziped it to the audio compression signal are as follows:

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Wherein, μ is the companding parameter for indicating compression degree, and in international standard, μ value 255, y indicates the audio pressure Contracting signal, F^-1(y) the audio decompression signal is indicated, sgn is sign function.

Step 3 obtains the cosine similarity between audio decompression signal and original audio signal.

Cosine similarity calculation formula is as follows:

Step 4 obtains former according to the cosine similarity between the audio decompression signal and the original audio signal The grading system of beginning audio signal loss.

Obtain the grading system calculation formula of the original audio signal loss are as follows:

S=M* (1-cos θ)

The method of the above-mentioned assessment μ-law algorithm loss of signal is equally applicable to a-law etc. to audio signal In the algorithm of reason, in general, audio compression techniques can be divided into lossless (lossless) compression and damage (lossy) compression Two major classes, and according to the difference of compression scheme, and time domain data compression, transformation compression, subband compression and a variety of can be divided into Mixing compression that technology mutually merges etc..A variety of different compress techniques, complexity (including the time complexity of algorithm And space complexity), audio quality, efficiency of algorithm (i.e. compression factor) and encoding and decoding delay etc. be all very different.Respectively The application of kind compress technique is also therefore different.

Wherein, time domain data compression (or being waveform coding) technology, is handled directly against the sample value of audio pcm stream, Code stream is compressed by means such as mute detection, nonlinear quantization, difference.The common feature of such compress technique is algorithm Complexity is low, and sound quality is general, and compression ratio is small (CD Quality > 400kbps), and encoding and decoding are delayed most short (relatively other technologies). Such compress technique is generally chiefly used in compress speech, the occasion of low bit- rate application (source signal bandwidth is small).Time domain data compression technology master It to include that G.711 (G.711 the audio coding mode formulated by International Telecommunication Union (ITU-T), also known as ITU-T, are divided into Alaw and ulaw), ADPCM, LPC, CELP, and the block the companding technology such as NICAM, subband that grow up in these technologies ADPCM (SB-ADPCM) technology.

Below by taking the sample rate 8k of one section of 1ms, bit is the audio signals of 16bit as an example, illustrate the assessment of the loss of signal Process.

If original audio signal X=[2203,0,233,24011,1,7,400,2903].

Step 1: the original audio signal of 16bit is converted to 8bit according to following compression processing formula.

Wherein, μ is the companding parameter for indicating compression degree, and in international standard, μ value 255, x indicates that 16bit's is original Audio signal X, F (x) indicate the audio compression signal, and sgn is sign function.

Obtain X₁=F (x)=[194,128,151,248,128,129,160,200].

Step 2: the audio compression signal of 8bit is switched to 16bit according to following decompression formula.

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Wherein, μ is the companding parameter for indicating compression degree, in international standard, μ value 255, y=X₁, sgn is symbol Function.

Obtain audio decompression signal X₂=F^-1(y)=[2188,0,228,24133,2,8,399,2879].

Third step obtains X and X₂Between cosine similarity.

Wherein, A=X, B=X₂, n=1.

4th step, according to X and X₂Between cosine similarity, obtain the grading system of original audio signal:

S=5* (1-cos θ)

Wherein, S is grading system, and 5 be full marks grade, and the numerical value of S is bigger, shows that the loss of audio signal is smaller.

In other embodiments, the program 10 for assessing audio signal loss can also be divided into one or more mould Block, one or more module are stored in memory 11, and are executed by processor 12, to complete the present invention.Institute of the present invention The module of title is the series of computation machine program instruction section for referring to complete specific function.Referring to shown in Fig. 2, to be assessed in Fig. 1 The Program modual graph of 10 preferred embodiment of program of audio signal loss.

As shown in Fig. 2, the program 10 of the assessment audio signal loss can be divided into:

Compression processing unit 11 obtains and the original audio signal for carrying out compression processing to original audio signal Corresponding audio compression signal.

Decompression processing unit 12 obtains and the audio pressure for unziping it processing to the audio compression signal The corresponding audio decompression signal of contracting signal.

Similarity acquiring unit 13, for obtaining the cosine between the audio decompression signal and the original audio signal Similarity.

Score unit 14, for obtaining the grading system of the original audio signal loss according to the cosine similarity.

In addition, the present invention also provides a kind of methods for assessing audio signal loss.Referring to shown in Fig. 3, for present invention assessment The flow chart of the method preferred embodiment of audio signal loss.This method can be executed by a device, which can be by soft Part and/or hardware realization.

In the present embodiment, the method for assessing audio signal loss includes:

Step S110: compression processing is carried out to original audio signal, obtains audio corresponding with the original audio signal Compressed signal.

In this step, audio compression refers mainly to use number appropriate to original digital audio signal stream (pcm encoder) Signal processing technology, do not lose useful information amount or it is introduced loss it is insignificant under the conditions of, reduce (compression) its code rate, Also referred to as compressed encoding.It must have corresponding inverse transformation, referred to as decompress or decode, and audio signal is passing through a volume solution A large amount of noise and certain distortion may be introduced after code system, using appraisal procedure of the invention, can be existed to audio signal The extent of damage in treatment process is assessed.

Step S120: processing is unziped it to the audio compression signal, is obtained corresponding with the audio compression signal Audio decompression signal.

Step S130: the cosine similarity between the audio decompression signal and the original audio signal is obtained.

Step S140: according to the cosine similarity, the grading system of the original audio signal loss is obtained.

Wherein, the calculation formula of the grading system for obtaining the original audio signal loss are as follows:

S=M* (1-cos θ)

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Cosine similarity calculation formula is as follows:

Grading system calculation formula are as follows:

S=M* (1-cos θ)

If original audio signal X=[2203,0,233,24011,1,7,400,2903].

Obtain X₁=F (x)=[194,128,151,248,128,129,160,200].

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Obtain audio decompression signal X₂=F^-1(y)=[2188,0,228,24133,2,8,399,2879].

Third step obtains X and X₂Between cosine similarity.

Wherein, A=X, B=X₂, n=1.

S=5* (1-cos θ)

Wherein, S is grading system, and 5 be full marks grade, and S numerical value is bigger, shows the loss of audio signal during processing It is smaller.

The method of assessment audio signal loss proposed by the present invention, by obtaining to original audio signal treated audio The cosine similarity between signal and original audio signal is decompressed, the grading system of original audio signal loss is obtained, by commenting Graduation can quantify the loss of audio signal during processing, can specify the damage of audio signal during processing Mistake degree, and correctly signal processing results are analyzed and judged.

In addition, the embodiment of the present invention also proposes a kind of computer readable storage medium, the computer readable storage medium In include assessment audio signal loss program, the program of the assessment audio signal loss realizes such as when being executed by processor Lower operation:

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Preferably, the calculation formula of the grading system are as follows:

S=M* (1-cos θ)

The method of the specific embodiment of the computer readable storage medium of the present invention and above-mentioned assessment audio signal loss, The specific embodiment of electronic device is roughly the same, and details are not described herein.

It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that the process, device, article or the method that include a series of elements not only include those elements, and And further include other elements that are not explicitly listed, or further include for this process, device, article or method institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do There is also other identical elements in the process, device of element, article or method.

The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.Pass through above embodiment party The description of formula, it is required general that those skilled in the art can be understood that above-described embodiment method can add by software The mode of hardware platform is realized, naturally it is also possible to which by hardware, but in many cases, the former is more preferably embodiment.It is based on Such understanding, substantially the part that contributes to existing technology can be with software product in other words for technical solution of the present invention Form embody, which is stored in a storage medium (such as ROM/RAM, magnetic disk, light as described above Disk) in, including some instructions use is so that a terminal device (can be mobile phone, computer, server or the network equipment Deng) execute method described in each embodiment of the present invention.

The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills Art field, is included within the scope of the present invention.

Claims

1. a kind of method for assessing audio signal loss, is applied to electronic device, which is characterized in that the described method includes:

Processing is unziped it to the audio compression signal, obtains audio decompression letter corresponding with the audio compression signal Number；

2. the method for assessment audio signal loss according to claim 1, which is characterized in that described to original audio signal Carry out compression processing, obtain audio compression signal corresponding with the original audio signal the step of include:

Wherein, μ is the companding parameter for indicating compression degree, and x indicates that the original audio signal, F (x) indicate the audio compression Signal, sgn are sign function.

3. the method for assessment audio signal loss according to claim 1, which is characterized in that described to the audio compression The step of signal unzips it processing, obtains audio decompression signal corresponding with the audio compression signal include:

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Wherein, μ is the companding parameter for indicating compression degree, and y indicates the audio compression signal, F^-1(y) the audio solution is indicated Signal is pressed, sgn is sign function.

4. the method for assessment audio signal loss according to claim 1, which is characterized in that described to obtain the audio solution Press the calculation formula of the cosine similarity between signal and the original audio signal are as follows:

5. the method for assessment audio signal loss according to claim 1, which is characterized in that described to obtain the original sound The calculation formula of the grading system of the frequency loss of signal are as follows:

S=M* (1-cos θ)

Wherein, S is grading system, and M is full marks grade, and cos θ is between the audio decompression signal and the original audio signal Cosine similarity, the numerical value of S is bigger, shows that the loss of audio signal is smaller.

6. a kind of electronic device, which is characterized in that the electronic device includes: memory and processor, includes in the memory The program of audio signal loss is assessed, the program of the assessment audio signal loss realizes following step when being executed by the processor It is rapid:

7. electronic device according to claim 6, which is characterized in that it is described that compression processing is carried out to original audio signal, Obtain audio compression signal corresponding with the original audio signal the step of include:

Wherein, μ is the companding parameter for indicating compression degree, and x indicates that original audio signal, F (x) indicate audio compression signal, sgn For sign function.

8. electronic device according to claim 6, which is characterized in that described to be unziped it to the audio compression signal Processing, obtain audio decompression signal corresponding with the audio compression signal the step of include:

F^-1(y)=sgn (y) (1/ μ) ((1+ μ)^|y|-1)-1≤y≤1

Wherein, μ is the companding parameter for indicating compression degree, and y indicates audio compression signal, F^-1(y) audio decompression signal is indicated, Sgn is sign function.

9. electronic device according to claim 6, which is characterized in that described to obtain the audio decompression signal and the original The calculation formula of cosine similarity between beginning audio signal are as follows:

10. a kind of computer readable storage medium, which is characterized in that include assessment audio in the computer readable storage medium When the program of the program of the loss of signal, the assessment audio signal loss is executed by processor, realize as in claim 1 to 5 The method and step of described in any item assessment audio signal loss.