CN107871509A

CN107871509A - Method for processing digital audio signal

Info

Publication number: CN107871509A
Application number: CN201610848381.8A
Authority: CN
Inventors: 李庆成; 鹿毅忠; 陶玮
Original assignee: Individual
Current assignee: Individual
Priority date: 2016-09-23
Filing date: 2016-09-23
Publication date: 2018-04-03

Abstract

The invention provides a kind of method for processing digital audio signal, is specifically based on MP3 frame structure, selects " 0 frame " as embedded audio frame；Select the zoom factor scale-value of frame side information to make a reservation for embedded position for " 0 ", in the scale factor of predetermined band in the non-short window of audio frame, be set to the data value to be embedded in.Present invention also offers a kind of method for processing digital audio signal, the structure such as foregoing " 0 frame " reads the numerical value of the predetermined embedded position of wherein scale factor.The present invention can be to the foregoing this kind of operation realized data insertion through overcompression, the audio file of coding or data and read.

Description

Method for processing digital audio signal

Technical field

The present invention relates to a kind of method for processing digital audio signal, and especially one kind is embedded in digital audio encoding and reads The method for evidence of fetching, belongs to audio frequency and video treatment technology.

Background technology

Digital audio and video signals are come to carry information be that industry is widely paid close attention to and puts into suitable manpower and financial resources and studied and opened The technology of hair.Using such technology, people can normally listen to music, TV reception on one side, while using having The equipment of Audio Signal Processing ability, such as：Mobile communication terminal, it is entrained in foregoing music or TV programme to obtain Data message.Applicant of the present invention was once largely analyzed and studied excessively with regard to embedding data in digital audio and video signals, And propose No. 201510447092.2 Chinese patent applications.

Claimed a kind of embedded number in uncompressed, coding voice data in above-mentioned Chinese patent application According to technical scheme.It is well known that：It is widely used at present be largely through overcompression, the audio file of coding or data, Such as：Using MPEG1-Layer3 (hereinafter referred to as：MP3) the voice data of frame structure.Although it will can compress, the audio of coding File or data are first changed, and are allowed to possess the place for carrying out data insertion after the condition using aforementioned invention embedding data again Reason.But on the one hand so doing, can increase a large amount of unnecessary conversion operations, on the other hand, the form of voice data can be caused Inevitably it is forced to change, is unfavorable for using, stores, the unnecessary waste that can more give related equipment belt to carry out resource.Therefore, Need a kind of technology that can be directed to and data embedding operation is realized through overcompression, the audio file of coding or data.

The content of the invention

The first aspect of the invention there is provided a kind of method for processing digital audio signal, specifically include：Based on MP3's Frame structure, the particle group # of selection for " 0 " audio frame (hereinafter referred to as：" 0 frame ") as embedded audio frame；Select frame side The zoom factor scale-value of information is " 0 ", positioned at more than one scale factor corresponding to predetermined band in the non-short window of audio frame； The numerical value of the predetermined embedded position of the aforementioned ratio factor is arranged to and the numerical value corresponding to the data value to be embedded in；Foregoing is predetermined Embedded position is more than one.

Another aspect of the present invention there is provided a kind of method for processing digital audio signal, specifically include：Based on MP3's Frame structure, select " 0 frame "；Also, the zoom factor scale-value of frame side information is selected as the audio frame of " 0 "；And selection is foregoing More than one scale factor in the non-short window of audio frame corresponding to predetermined band；Read the number of the predetermined embedded position of the scale factor Value；Foregoing predetermined embedded position is more than one.

The method for processing digital audio signal that the first aspect of the invention is provided, it can be directed to using based on MP3's The digital audio-frequency data of frame structure performs the processing of data insertion so that need not do the processing of complexity, it becomes possible to foregoing this kind of The operation of data insertion is realized through overcompression, the audio file of coding or data.

The method for processing digital audio signal that another aspect of the present invention is provided, can be to using preceding solution The digital audio-frequency data perform decoding processing of embedding data, and effectively recovery obtains embedded number from audio file or data It is believed that breath.

Embodiment

Technical scheme is illustrated below with reference to various embodiments, for those of ordinary skill in the art For, without having to pay creative labor, it can be concluded and be summarized according to following various embodiments, its As a result necessarily and the technical scheme is that identical.

In a kind of embodiment of the present invention, voice data is encoded using the frame structure based on MP3. But not all audio frame is suitable for using technical scheme；In the specific embodiment of the present invention In, only select the zoom factor scale-value of frame side information to perform the operation of embedding data for " 0 frame " of " 0 ".Foregoing " 0 One or more scale factor in frame " in selection non-short window therein corresponding to some predetermined frequency range is as embedding data Operation object.Then, the numerical value for making a reservation for embedded position in the aforementioned ratio factor is arranged to right with the data value to be embedded in institute The numerical value answered.

In foregoing scheme, " 0 frame " is selected to be in order to ensure follow-up selected scale factor has stability.It is logical Often, for particle group # for " 1 " audio frame (hereinafter referred to as：" 1 frame "), because its scale factor shares " 0 sometimes Scale factor in frame ".Therefore, the scale factor only in " 0 frame " has stable feature.So in all of the present invention Embodiment in, using " 0 frame " as embedded object.

In MP3 audio frame, a variety of windows are generally comprised, i.e.,：Long window, start window, terminate window and short window.Wherein, long window, The susceptibility of start window and end window in the time domain will be less than short window.Therefore, in some specific embodiments of the present invention, Object of the scale factor as insertion in short window is not selected.The scale factor of long window, start window and end window can then be made The object being embedded in for data, but how on earth use, can voluntarily it be set in specific embodiments by the designer of correlation It is fixed.

As the object for being embedded into data, generally there is more than one data bit in a scale factor.Specific embedding It is fashionable, it can both select one of data bit to select multiple data bit as predetermined embedding as predetermined embedded position Enter position.For specific insertion, the data of predetermined embedded position are exactly arranged to the operation for the data to be embedded in.Such as： When it is " 0 " to need embedded data value, then the data value of the predetermined embedded position of selected scale factor is arranged to " 0 "； Equally, when it is " 1 " to need embedded data value, then the data value of the predetermined embedded position of selected scale factor is arranged to “1”。

Sometimes, based on some specific needs, embedded data value may be antilogical, i.e., to be embedded in originally Data value be " 0 ", and when being actually embedded in, according to the requirement of antilogical, predetermined embedded position is arranged to data value “1”；Then it is to set predetermined embedded position in the case of the data value to be embedded in is " 1 " conversely, on the premise of antilogical It is set to data value " 0 ".

Although more than one predetermined embedded position can be set, and each predetermined embedded position can in a scale factor To be embedded into data.But fully it is possible to audio quality can be caused to be affected using all predetermined embedded positions sometimes. Therefore, the scheme of an optimization is：For any one scale factor, only selection uses one of those predetermined embedded position.More It is for the scheme of optimization：It is predetermined embedded position that only selection, which uses the lowest order in the scale factor, in a scale factor. Under this situation, the difference between audio quality and former audio quality after embedding data, human ear can not identify.

It is well-known：The audio frequency range that the mankind can perceive is 20-20000Hz.For technical scheme For, it can be adapted in above-mentioned whole audiorange.

But in the audio frequency range that the whole mankind can perceive, the relatively low frequency range of frequency and frequency are relative Embedding data in higher frequency range, its robustness are not ideal, it is necessary to take technical measures to compensate in addition.Although this can OK, the device resource expense and cost during processing data but can undoubtedly be increased.Therefore, the present invention is above-mentioned feasible, larger On the basis of frequency range, there is provided the frequency range more optimized, i.e.,：Predetermined frequency range is determined into the scope in 80-5120Hz. The not audibility for so allowing for robustness and embedding data is obtained for guarantee.

Relative to all kinds of specific embodiments of foregoing embedding data in technical solution of the present invention, from using the technology When reading is embedded into data in the audio signal of scheme embedding data, it is obviously desirable to using following some embodiments：

As previously described, because in embedding data, the zoom factor scale-value of frame side information is only selected as " 0 frame " of " 0 " To perform the operation of embedding data.Therefore, when reading data, foregoing " 0 frame " is also selected, and ignores other audio frames. Herein on basis, equally to select one or more ratio in non-short window therein corresponding to some predetermined frequency range because Reading object of the son as embedding data.Then, the numerical value of the predetermined embedded position of these scale factors is read.Based on foregoing susceptibility The reason for, non-short window actually refers to：Long window, start window or this terminate window.

With above corresponding about making a reservation for the selection scheme of embedded position in a scale factor, when reading data, it is necessary to During with embedding data corresponding to used scheme.One scheme relatively optimized is：Only selected in a scale factor The lowest order selected in the scale factor is predetermined embedded position, reads the data value of the predetermined embedded position.Reading predetermined embedded position Data value after, can further handle the data of reading according to overall embedded scheme is positive logic or antilogical.

Based on the reason for identical with embedding data scheme, although for technical scheme, the mankind can be adapted to The whole audio frequency range that can be perceived：20-20000Hz.But, it is contemplated that the factor of foregoing robustness etc., this One frequency range more optimized of invention is 80-5120Hz.I.e.：Predetermined frequency range is determined as 80-5120Hz In scope, the embedding data in scale factor in this frequency range is only read.

Finally it should be noted that：Various embodiments above is merely illustrative of the technical solution of the present invention, rather than its limitations；To the greatest extent The present invention is described in detail with reference to foregoing embodiments for pipe, it will be understood by those within the art that：Its according to The technical scheme described in foregoing embodiments can so be modified, either which part or all technical characteristic are entered Row equivalent substitution；And these modifications or replacement, the essence of appropriate technical solution is departed from various embodiments of the present invention technology The scope of scheme.

Claims

1. a kind of method for processing digital audio signal, including：

Frame structure based on MP3, the particle group # of selection are used as embedded audio frame for the audio frame of " 0 "；Select frame side information Zoom factor scale-value for " 0 ", positioned at more than one scale factor corresponding to predetermined band in the non-short window of audio frame；

The numerical value of the predetermined embedded position of the scale factor is arranged to and the numerical value corresponding to the data value to be embedded in；Described Predetermined embedded position is more than one.

2. according to the method for claim 1, it is characterised in that：

Described non-short window includes：Long window, start window and/or end window；And/or

The numerical value by the predetermined embedded position of the scale factor is arranged to have with the numerical value corresponding to the data value to be embedded in Body includes：

Embedded position predetermined to one, when the data value to be embedded in is 0, is then arranged to 0 by the predetermined embedded position；When wanting When embedded data value is 1, then the predetermined embedded position is arranged to 1；Or

Embedded position predetermined to one, when the data value to be embedded in is 0, is then arranged to 1 by the predetermined embedded position；When wanting When embedded data value is 1, then the predetermined embedded position is arranged to 0.

3. method according to claim 1 or 2, it is characterised in that：Described predetermined embedded position is the scale factor Lowest order.

4. method according to claim 1 or 2, it is characterised in that：The predetermined band includes whole audio frequency, i.e.,： 20-20000Hz。

5. according to the method for claim 4, it is characterised in that：The predetermined band includes：80-5120Hz.

6. a kind of method for processing digital audio signal, including：

Frame structure based on MP3, the particle group # of selection for " 0 " audio frame；Also, select the zoom factor of frame side information Scale-value is the audio frame of " 0 "；Also, more than one ratio in the non-short window of the selection audio frame corresponding to predetermined band The factor；

Read the numerical value of the predetermined embedded position of the scale factor；Described predetermined embedded position is more than one.

7. according to the method for claim 6, it is characterised in that：Described non-short window includes：Long window, start window and/or knot Beam window.

8. the method according to claim 6 or 7, it is characterised in that：Described predetermined embedded position is the scale factor Lowest order.

9. the method according to claim 6 or 7, it is characterised in that：The predetermined band includes whole audio frequency, i.e.,： 20-20000Hz。

10. according to the method for claim 9, it is characterised in that：The predetermined band includes：80-5120Hz.