CN107871509A - Method for processing digital audio signal - Google Patents

Method for processing digital audio signal Download PDF

Info

Publication number
CN107871509A
CN107871509A CN201610848381.8A CN201610848381A CN107871509A CN 107871509 A CN107871509 A CN 107871509A CN 201610848381 A CN201610848381 A CN 201610848381A CN 107871509 A CN107871509 A CN 107871509A
Authority
CN
China
Prior art keywords
predetermined
embedded
frame
data
embedded position
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610848381.8A
Other languages
Chinese (zh)
Inventor
李庆成
鹿毅忠
陶玮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201610848381.8A priority Critical patent/CN107871509A/en
Publication of CN107871509A publication Critical patent/CN107871509A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)

Abstract

The invention provides a kind of method for processing digital audio signal, is specifically based on MP3 frame structure, selects " 0 frame " as embedded audio frame;Select the zoom factor scale-value of frame side information to make a reservation for embedded position for " 0 ", in the scale factor of predetermined band in the non-short window of audio frame, be set to the data value to be embedded in.Present invention also offers a kind of method for processing digital audio signal, the structure such as foregoing " 0 frame " reads the numerical value of the predetermined embedded position of wherein scale factor.The present invention can be to the foregoing this kind of operation realized data insertion through overcompression, the audio file of coding or data and read.

Description

Method for processing digital audio signal
Technical field
The present invention relates to a kind of method for processing digital audio signal, and especially one kind is embedded in digital audio encoding and reads The method for evidence of fetching, belongs to audio frequency and video treatment technology.
Background technology
Digital audio and video signals are come to carry information be that industry is widely paid close attention to and puts into suitable manpower and financial resources and studied and opened The technology of hair.Using such technology, people can normally listen to music, TV reception on one side, while using having The equipment of Audio Signal Processing ability, such as:Mobile communication terminal, it is entrained in foregoing music or TV programme to obtain Data message.Applicant of the present invention was once largely analyzed and studied excessively with regard to embedding data in digital audio and video signals, And propose No. 201510447092.2 Chinese patent applications.
Claimed a kind of embedded number in uncompressed, coding voice data in above-mentioned Chinese patent application According to technical scheme.It is well known that:It is widely used at present be largely through overcompression, the audio file of coding or data, Such as:Using MPEG1-Layer3 (hereinafter referred to as:MP3) the voice data of frame structure.Although it will can compress, the audio of coding File or data are first changed, and are allowed to possess the place for carrying out data insertion after the condition using aforementioned invention embedding data again Reason.But on the one hand so doing, can increase a large amount of unnecessary conversion operations, on the other hand, the form of voice data can be caused Inevitably it is forced to change, is unfavorable for using, stores, the unnecessary waste that can more give related equipment belt to carry out resource.Therefore, Need a kind of technology that can be directed to and data embedding operation is realized through overcompression, the audio file of coding or data.
The content of the invention
The first aspect of the invention there is provided a kind of method for processing digital audio signal, specifically include:Based on MP3's Frame structure, the particle group # of selection for " 0 " audio frame (hereinafter referred to as:" 0 frame ") as embedded audio frame;Select frame side The zoom factor scale-value of information is " 0 ", positioned at more than one scale factor corresponding to predetermined band in the non-short window of audio frame; The numerical value of the predetermined embedded position of the aforementioned ratio factor is arranged to and the numerical value corresponding to the data value to be embedded in;Foregoing is predetermined Embedded position is more than one.
Another aspect of the present invention there is provided a kind of method for processing digital audio signal, specifically include:Based on MP3's Frame structure, select " 0 frame ";Also, the zoom factor scale-value of frame side information is selected as the audio frame of " 0 ";And selection is foregoing More than one scale factor in the non-short window of audio frame corresponding to predetermined band;Read the number of the predetermined embedded position of the scale factor Value;Foregoing predetermined embedded position is more than one.
The method for processing digital audio signal that the first aspect of the invention is provided, it can be directed to using based on MP3's The digital audio-frequency data of frame structure performs the processing of data insertion so that need not do the processing of complexity, it becomes possible to foregoing this kind of The operation of data insertion is realized through overcompression, the audio file of coding or data.
The method for processing digital audio signal that another aspect of the present invention is provided, can be to using preceding solution The digital audio-frequency data perform decoding processing of embedding data, and effectively recovery obtains embedded number from audio file or data It is believed that breath.
Embodiment
Technical scheme is illustrated below with reference to various embodiments, for those of ordinary skill in the art For, without having to pay creative labor, it can be concluded and be summarized according to following various embodiments, its As a result necessarily and the technical scheme is that identical.
In a kind of embodiment of the present invention, voice data is encoded using the frame structure based on MP3. But not all audio frame is suitable for using technical scheme;In the specific embodiment of the present invention In, only select the zoom factor scale-value of frame side information to perform the operation of embedding data for " 0 frame " of " 0 ".Foregoing " 0 One or more scale factor in frame " in selection non-short window therein corresponding to some predetermined frequency range is as embedding data Operation object.Then, the numerical value for making a reservation for embedded position in the aforementioned ratio factor is arranged to right with the data value to be embedded in institute The numerical value answered.
In foregoing scheme, " 0 frame " is selected to be in order to ensure follow-up selected scale factor has stability.It is logical Often, for particle group # for " 1 " audio frame (hereinafter referred to as:" 1 frame "), because its scale factor shares " 0 sometimes Scale factor in frame ".Therefore, the scale factor only in " 0 frame " has stable feature.So in all of the present invention Embodiment in, using " 0 frame " as embedded object.
In MP3 audio frame, a variety of windows are generally comprised, i.e.,:Long window, start window, terminate window and short window.Wherein, long window, The susceptibility of start window and end window in the time domain will be less than short window.Therefore, in some specific embodiments of the present invention, Object of the scale factor as insertion in short window is not selected.The scale factor of long window, start window and end window can then be made The object being embedded in for data, but how on earth use, can voluntarily it be set in specific embodiments by the designer of correlation It is fixed.
As the object for being embedded into data, generally there is more than one data bit in a scale factor.Specific embedding It is fashionable, it can both select one of data bit to select multiple data bit as predetermined embedding as predetermined embedded position Enter position.For specific insertion, the data of predetermined embedded position are exactly arranged to the operation for the data to be embedded in.Such as: When it is " 0 " to need embedded data value, then the data value of the predetermined embedded position of selected scale factor is arranged to " 0 "; Equally, when it is " 1 " to need embedded data value, then the data value of the predetermined embedded position of selected scale factor is arranged to “1”。
Sometimes, based on some specific needs, embedded data value may be antilogical, i.e., to be embedded in originally Data value be " 0 ", and when being actually embedded in, according to the requirement of antilogical, predetermined embedded position is arranged to data value “1”;Then it is to set predetermined embedded position in the case of the data value to be embedded in is " 1 " conversely, on the premise of antilogical It is set to data value " 0 ".
Although more than one predetermined embedded position can be set, and each predetermined embedded position can in a scale factor To be embedded into data.But fully it is possible to audio quality can be caused to be affected using all predetermined embedded positions sometimes. Therefore, the scheme of an optimization is:For any one scale factor, only selection uses one of those predetermined embedded position.More It is for the scheme of optimization:It is predetermined embedded position that only selection, which uses the lowest order in the scale factor, in a scale factor. Under this situation, the difference between audio quality and former audio quality after embedding data, human ear can not identify.
It is well-known:The audio frequency range that the mankind can perceive is 20-20000Hz.For technical scheme For, it can be adapted in above-mentioned whole audiorange.
But in the audio frequency range that the whole mankind can perceive, the relatively low frequency range of frequency and frequency are relative Embedding data in higher frequency range, its robustness are not ideal, it is necessary to take technical measures to compensate in addition.Although this can OK, the device resource expense and cost during processing data but can undoubtedly be increased.Therefore, the present invention is above-mentioned feasible, larger On the basis of frequency range, there is provided the frequency range more optimized, i.e.,:Predetermined frequency range is determined into the scope in 80-5120Hz. The not audibility for so allowing for robustness and embedding data is obtained for guarantee.
Relative to all kinds of specific embodiments of foregoing embedding data in technical solution of the present invention, from using the technology When reading is embedded into data in the audio signal of scheme embedding data, it is obviously desirable to using following some embodiments:
As previously described, because in embedding data, the zoom factor scale-value of frame side information is only selected as " 0 frame " of " 0 " To perform the operation of embedding data.Therefore, when reading data, foregoing " 0 frame " is also selected, and ignores other audio frames. Herein on basis, equally to select one or more ratio in non-short window therein corresponding to some predetermined frequency range because Reading object of the son as embedding data.Then, the numerical value of the predetermined embedded position of these scale factors is read.Based on foregoing susceptibility The reason for, non-short window actually refers to:Long window, start window or this terminate window.
With above corresponding about making a reservation for the selection scheme of embedded position in a scale factor, when reading data, it is necessary to During with embedding data corresponding to used scheme.One scheme relatively optimized is:Only selected in a scale factor The lowest order selected in the scale factor is predetermined embedded position, reads the data value of the predetermined embedded position.Reading predetermined embedded position Data value after, can further handle the data of reading according to overall embedded scheme is positive logic or antilogical.
Based on the reason for identical with embedding data scheme, although for technical scheme, the mankind can be adapted to The whole audio frequency range that can be perceived:20-20000Hz.But, it is contemplated that the factor of foregoing robustness etc., this One frequency range more optimized of invention is 80-5120Hz.I.e.:Predetermined frequency range is determined as 80-5120Hz In scope, the embedding data in scale factor in this frequency range is only read.
Finally it should be noted that:Various embodiments above is merely illustrative of the technical solution of the present invention, rather than its limitations;To the greatest extent The present invention is described in detail with reference to foregoing embodiments for pipe, it will be understood by those within the art that:Its according to The technical scheme described in foregoing embodiments can so be modified, either which part or all technical characteristic are entered Row equivalent substitution;And these modifications or replacement, the essence of appropriate technical solution is departed from various embodiments of the present invention technology The scope of scheme.

Claims (10)

1. a kind of method for processing digital audio signal, including:
Frame structure based on MP3, the particle group # of selection are used as embedded audio frame for the audio frame of " 0 ";Select frame side information Zoom factor scale-value for " 0 ", positioned at more than one scale factor corresponding to predetermined band in the non-short window of audio frame;
The numerical value of the predetermined embedded position of the scale factor is arranged to and the numerical value corresponding to the data value to be embedded in;Described Predetermined embedded position is more than one.
2. according to the method for claim 1, it is characterised in that:
Described non-short window includes:Long window, start window and/or end window;And/or
The numerical value by the predetermined embedded position of the scale factor is arranged to have with the numerical value corresponding to the data value to be embedded in Body includes:
Embedded position predetermined to one, when the data value to be embedded in is 0, is then arranged to 0 by the predetermined embedded position;When wanting When embedded data value is 1, then the predetermined embedded position is arranged to 1;Or
Embedded position predetermined to one, when the data value to be embedded in is 0, is then arranged to 1 by the predetermined embedded position;When wanting When embedded data value is 1, then the predetermined embedded position is arranged to 0.
3. method according to claim 1 or 2, it is characterised in that:Described predetermined embedded position is the scale factor Lowest order.
4. method according to claim 1 or 2, it is characterised in that:The predetermined band includes whole audio frequency, i.e.,: 20-20000Hz。
5. according to the method for claim 4, it is characterised in that:The predetermined band includes:80-5120Hz.
6. a kind of method for processing digital audio signal, including:
Frame structure based on MP3, the particle group # of selection for " 0 " audio frame;Also, select the zoom factor of frame side information Scale-value is the audio frame of " 0 ";Also, more than one ratio in the non-short window of the selection audio frame corresponding to predetermined band The factor;
Read the numerical value of the predetermined embedded position of the scale factor;Described predetermined embedded position is more than one.
7. according to the method for claim 6, it is characterised in that:Described non-short window includes:Long window, start window and/or knot Beam window.
8. the method according to claim 6 or 7, it is characterised in that:Described predetermined embedded position is the scale factor Lowest order.
9. the method according to claim 6 or 7, it is characterised in that:The predetermined band includes whole audio frequency, i.e.,: 20-20000Hz。
10. according to the method for claim 9, it is characterised in that:The predetermined band includes:80-5120Hz.
CN201610848381.8A 2016-09-23 2016-09-23 Method for processing digital audio signal Pending CN107871509A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610848381.8A CN107871509A (en) 2016-09-23 2016-09-23 Method for processing digital audio signal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610848381.8A CN107871509A (en) 2016-09-23 2016-09-23 Method for processing digital audio signal

Publications (1)

Publication Number Publication Date
CN107871509A true CN107871509A (en) 2018-04-03

Family

ID=61751631

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610848381.8A Pending CN107871509A (en) 2016-09-23 2016-09-23 Method for processing digital audio signal

Country Status (1)

Country Link
CN (1) CN107871509A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1638479A (en) * 2003-12-27 2005-07-13 Lg电子有限公司 Digital audio watermark inserting/detecting apparatus and method
CN101290773A (en) * 2008-06-13 2008-10-22 清华大学 Adaptive MP3 digital watermark embedding method
US8799002B1 (en) * 2008-12-01 2014-08-05 Marvell International Ltd. Efficient scalefactor estimation in advanced audio coding and MP3 encoder

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1638479A (en) * 2003-12-27 2005-07-13 Lg电子有限公司 Digital audio watermark inserting/detecting apparatus and method
CN101290773A (en) * 2008-06-13 2008-10-22 清华大学 Adaptive MP3 digital watermark embedding method
US8799002B1 (en) * 2008-12-01 2014-08-05 Marvell International Ltd. Efficient scalefactor estimation in advanced audio coding and MP3 encoder

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
文仁轶,潘峰,申军伟: "针对MP3压缩域比例因子的音频水印算法", 《计算机工程与应用》 *
晁婷婷,王新房,蒋存云: "基于压缩域的MP3音频数字水印算法", 《计算机工程》 *
曹军梅: "一种用于MP3文件版权保护的鲁棒水印算法", 《现代电子技术》 *
李伟,等。: "数字音频水印技术综述", 《通信学报》 *
董亚坤: "《硕士学位论文》", 30 August 2015, 北京邮电大学 *
金渊智: "一种奇偶量化的音频水印算法", 《濮阳职业技术学院学报》 *

Similar Documents

Publication Publication Date Title
CN104917671B (en) Audio-frequency processing method and device based on mobile terminal
CN102652337A (en) Device and method for acoustic communication
CN101964189B (en) Audio signal switching method and device
CN106297811A (en) Audio treatment unit and audio-frequency decoding method
CN103295578A (en) Method and device for processing voice frequency signal
CN104299622A (en) Audio processing method
US9584944B2 (en) Stereo decoding method and apparatus using group delay and group phase parameters
CN104835520A (en) Mobile equipment recording method and device
CN105551512A (en) Audio format conversion method and apparatus
CN112530444B (en) Audio coding method and device
JP6608380B2 (en) Communication system, method and apparatus with improved noise resistance
US20210264926A1 (en) Inter-channel phase difference parameter encoding method and apparatus
CN104284018A (en) Terminal
US7428440B2 (en) Method and apparatus for preserving matrix surround information in encoded audio/video
CN111681664A (en) Method, system, storage medium and equipment for reducing audio coding rate
CN109587489A (en) A kind of method of video compression
CN103413553A (en) Audio coding method, audio decoding method, coding terminal, decoding terminal and system
WO2021244418A1 (en) Audio encoding method and audio encoding apparatus
US9312893B2 (en) Systems, methods and devices for electronic communications having decreased information loss
CN113259827A (en) Hearing-aid method, system, earphone, medium and equipment based on audio encoding and decoding
CN107871509A (en) Method for processing digital audio signal
US20230145725A1 (en) Multi-channel audio signal encoding and decoding method and apparatus
EP4270388A1 (en) Bit allocation method and apparatus for audio object
CN101989425A (en) Method, device and system for multiple description voice frequency coding and decoding
CN105261373B (en) Adaptive grid configuration method and apparatus for bandwidth extension encoding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180403

RJ01 Rejection of invention patent application after publication