WO2022194277A1 - Procédé et appareil de traitement d'empreinte audio, et dispositif informatique et support de stockage - Google Patents

Procédé et appareil de traitement d'empreinte audio, et dispositif informatique et support de stockage Download PDF

Info

Publication number
WO2022194277A1
WO2022194277A1 PCT/CN2022/081680 CN2022081680W WO2022194277A1 WO 2022194277 A1 WO2022194277 A1 WO 2022194277A1 CN 2022081680 W CN2022081680 W CN 2022081680W WO 2022194277 A1 WO2022194277 A1 WO 2022194277A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
data
target
fingerprint
fingerprint data
Prior art date
Application number
PCT/CN2022/081680
Other languages
English (en)
Chinese (zh)
Inventor
李敬
何莹男
Original Assignee
百果园技术(新加坡)有限公司
李敬
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 百果园技术(新加坡)有限公司, 李敬 filed Critical 百果园技术(新加坡)有限公司
Publication of WO2022194277A1 publication Critical patent/WO2022194277A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/61Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/65Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings

Definitions

  • the audio signal contains a large number of frequency components, and multiple frequency components are independent of each other and change continuously along the time axis.
  • the frequency components and frequency components in different audio signals are different.
  • by analyzing the audio signal The characteristics of the audio signal are obtained from the frequency characteristics of the frequency.
  • the audio signal in the time domain is usually converted to the frequency domain to obtain a spectrogram, where the horizontal axis (X coordinate) of the spectrogram is time. , the vertical axis (Y coordinate) is the frequency.
  • a first distance in time between each peak point and each of the other peak points may be measured, and the first distance may be used as the characteristic information of each peak point.
  • a second distance in frequency between each peak point and each of the other peak points may be measured, and the second distance may be used as characteristic information of each peak point.
  • Step 102 Match the target fingerprint data with the reference fingerprint data in the first audio fingerprint database and the reference fingerprint data in the second audio fingerprint database.
  • Embodiment 2 is a flowchart of an audio fingerprint processing method provided in Embodiment 2 of the present application. Based on the foregoing embodiments, this embodiment adds clustering of target audio data, use of time-to-live to manage reference fingerprint data, and reference fingerprint data.
  • the operation of data transfer database, the method includes the following steps:
  • the indicator satisfies the preset library transfer conditions, it means that the reference fingerprint data belongs to relatively popular audio data, possibly a newly released song, etc.
  • the reference fingerprint data can be transferred from the second audio fingerprint database to the first audio fingerprint database, and generate prompt information, the prompt information is used to prompt the operator to add copyright information to the audio data to which the reference fingerprint data belongs.
  • the lifetime of the reference fingerprint data in the first audio fingerprint database can also be set to be equal to or less than that of the second audio fingerprint database.
  • the lifetime of the reference fingerprint data in the fingerprint database that is, the first value is equal to or smaller than the second value, which is not limited in this embodiment.
  • the reference fingerprint data in the first audio fingerprint database has been attenuated, that is, the current value is 0, it means that the frequency of use of the audio data to which the reference fingerprint data belongs is relatively low.
  • the reference fingerprint data can be deleted from the first audio fingerprint database.
  • reduce the data volume of the reference fingerprint data stored in the first audio fingerprint database release the space of the first audio fingerprint database, thereby Effectively meet the storage requirements of processing continuous fingerprint data under the condition of limited storage capacity.
  • the index statistics module is set to, if the reference fingerprint data in the second audio fingerprint database is successfully matched with the target fingerprint data, then the reference fingerprint data is statistically matched to the index of the successful matching; the fingerprint data database moving module is set to if If the index satisfies the preset database transfer condition, the reference fingerprint data is transferred from the second audio fingerprint database to the first audio fingerprint database.
  • computer device 12 takes the form of a general-purpose computing device.
  • Components of computer device 12 may include, but are not limited to, one or more processors or processing units 16 , system memory 28 , and a bus 18 connecting various system components including system memory 28 and processing unit 16 .
  • a program/utility 40 having a set (at least one) of program modules 42, which may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data , each or some combination of these examples may include an implementation of a network environment.
  • Program modules 42 generally perform the functions and/or methods of the embodiments described herein.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention concerne un procédé et un appareil de traitement d'empreinte audio, ainsi qu'un dispositif informatique et un support de stockage. Le procédé de traitement d'empreinte audio consiste à générer des données d'empreintes cibles pour des données audio cibles (101) ; à mettre en correspondance, respectivement, les données d'empreintes cibles avec des données d'empreintes de référence dans une première base de données d'empreintes audio et des données d'empreintes de référence dans une seconde base de données d'empreintes audio (102) ; si la mise en correspondance échoue, à solliciter une interface de service d'interrogation de musique pour interroger des informations de droit d'auteur des données audio cibles ; si les informations de droit d'auteur sont trouvées, à stocker les données d'empreintes cibles dans la première base de données d'empreintes audio, à prendre celles-ci en tant que nouvelles données d'empreintes de référence dans la première base de données d'empreintes audio, et à enregistrer les informations de droit d'auteur des données audio cibles ; et si aucune information de droit d'auteur n'est trouvée, à stocker les données d'empreintes cibles dans la seconde base de données d'empreintes audio, et à prendre celles-ci en tant que nouvelles données d'empreintes de référence dans la seconde base de données d'empreintes audio.
PCT/CN2022/081680 2021-03-18 2022-03-18 Procédé et appareil de traitement d'empreinte audio, et dispositif informatique et support de stockage WO2022194277A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110292844.8 2021-03-18
CN202110292844.8A CN112784100A (zh) 2021-03-18 2021-03-18 一种音频指纹的处理方法、装置、计算机设备和存储介质

Publications (1)

Publication Number Publication Date
WO2022194277A1 true WO2022194277A1 (fr) 2022-09-22

Family

ID=75762743

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/081680 WO2022194277A1 (fr) 2021-03-18 2022-03-18 Procédé et appareil de traitement d'empreinte audio, et dispositif informatique et support de stockage

Country Status (2)

Country Link
CN (1) CN112784100A (fr)
WO (1) WO2022194277A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112784100A (zh) * 2021-03-18 2021-05-11 百果园技术(新加坡)有限公司 一种音频指纹的处理方法、装置、计算机设备和存储介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101453333A (zh) * 2008-10-16 2009-06-10 北京光线传媒有限公司 一种针对媒体文件的版权识别方法、装置及***
US20120191231A1 (en) * 2010-05-04 2012-07-26 Shazam Entertainment Ltd. Methods and Systems for Identifying Content in Data Stream by a Client Device
US20140012572A1 (en) * 2011-12-30 2014-01-09 Tilman Herberger System and method for content recognition in portable devices
US20160247512A1 (en) * 2014-11-21 2016-08-25 Thomson Licensing Method and apparatus for generating fingerprint of an audio signal
CN107967922A (zh) * 2017-12-19 2018-04-27 成都嗨翻屋文化传播有限公司 一种基于特征的音乐版权识别方法
CN110047515A (zh) * 2019-04-04 2019-07-23 腾讯音乐娱乐科技(深圳)有限公司 一种音频识别方法、装置、设备及存储介质
CN112784100A (zh) * 2021-03-18 2021-05-11 百果园技术(新加坡)有限公司 一种音频指纹的处理方法、装置、计算机设备和存储介质

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140172429A1 (en) * 2012-12-14 2014-06-19 Microsoft Corporation Local recognition of content
CN109657093A (zh) * 2018-11-27 2019-04-19 腾讯音乐娱乐科技(深圳)有限公司 音频检索方法、装置及存储介质
CN111599378A (zh) * 2020-04-30 2020-08-28 讯飞智元信息科技有限公司 音频匹配方法,电子设备及存储介质

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101453333A (zh) * 2008-10-16 2009-06-10 北京光线传媒有限公司 一种针对媒体文件的版权识别方法、装置及***
US20120191231A1 (en) * 2010-05-04 2012-07-26 Shazam Entertainment Ltd. Methods and Systems for Identifying Content in Data Stream by a Client Device
US20140012572A1 (en) * 2011-12-30 2014-01-09 Tilman Herberger System and method for content recognition in portable devices
US20160247512A1 (en) * 2014-11-21 2016-08-25 Thomson Licensing Method and apparatus for generating fingerprint of an audio signal
CN107967922A (zh) * 2017-12-19 2018-04-27 成都嗨翻屋文化传播有限公司 一种基于特征的音乐版权识别方法
CN110047515A (zh) * 2019-04-04 2019-07-23 腾讯音乐娱乐科技(深圳)有限公司 一种音频识别方法、装置、设备及存储介质
CN112784100A (zh) * 2021-03-18 2021-05-11 百果园技术(新加坡)有限公司 一种音频指纹的处理方法、装置、计算机设备和存储介质

Also Published As

Publication number Publication date
CN112784100A (zh) 2021-05-11

Similar Documents

Publication Publication Date Title
US11314805B2 (en) Method and apparatus for retrieving audio file, server, and computer-readable storage medium
Haitsma et al. A highly robust audio fingerprinting system with an efficient search strategy
Haitsma et al. A highly robust audio fingerprinting system.
Cano et al. Robust sound modeling for song detection in broadcast audio
US9542488B2 (en) Associating audio tracks with video content
EP3508986B1 (fr) Identification de reprise de musique pour recherche, conformité et octroi de licences
US8706276B2 (en) Systems, methods, and media for identifying matching audio
US7031921B2 (en) System for monitoring audio content available over a network
JP5907511B2 (ja) オーディオメディア認識のためのシステム及び方法
US20140280304A1 (en) Matching versions of a known song to an unknown song
JP2004536348A (ja) 録音の自動識別
CN108447501A (zh) 一种云存储环境下基于音频字的盗版视频检测方法与***
EP3945435A1 (fr) Identification dynamique de média inconnu
WO2022194277A1 (fr) Procédé et appareil de traitement d'empreinte audio, et dispositif informatique et support de stockage
US20220238087A1 (en) Methods and systems for determining compact semantic representations of digital audio signals
JP4267463B2 (ja) 音声コンテンツを特定する方法、音声信号の記録の一部分を特定する特徴を形成する方法およびシステム、音声ストリームが音声信号の既知の記録の少なくとも一部分を含んでいるか否かを判断する方法、コンピュータ・プログラム、音声信号の記録を特定するシステム
Zhang et al. An encrypted speech retrieval algorithm based on Chirp-Z transform and perceptual hashing second feature extraction
WO2022161291A1 (fr) Procédé et appareil de recherche audio, dispositif informatique et support de stockage
Kekre et al. A review of audio fingerprinting and comparison of algorithms
Li et al. Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain
KR101002732B1 (ko) 온라인을 통한 디지털 컨텐츠 관리 시스템
You et al. Music Identification System Using MPEG‐7 Audio Signature Descriptors
Hellmuth et al. Advanced audio identification using MPEG-7 content description
Chickanbanjar Comparative analysis between audio fingerprinting algorithms
CN117807564A (zh) 音频数据的侵权识别方法、装置、设备及介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22770629

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22770629

Country of ref document: EP

Kind code of ref document: A1