CN111444376A - Audio fingerprint identification method and device and equipment - Google Patents

Audio fingerprint identification method and device and equipment Download PDF

Info

Publication number
CN111444376A
CN111444376A CN202010293633.1A CN202010293633A CN111444376A CN 111444376 A CN111444376 A CN 111444376A CN 202010293633 A CN202010293633 A CN 202010293633A CN 111444376 A CN111444376 A CN 111444376A
Authority
CN
China
Prior art keywords
audio fingerprint
audio
identified
user
personal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010293633.1A
Other languages
Chinese (zh)
Inventor
肖龙源
李稀敏
刘晓葳
谭玉坤
叶志坚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Kuaishangtong Technology Co Ltd
Original Assignee
Xiamen Kuaishangtong Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Kuaishangtong Technology Co Ltd filed Critical Xiamen Kuaishangtong Technology Co Ltd
Priority to CN202010293633.1A priority Critical patent/CN111444376A/en
Publication of CN111444376A publication Critical patent/CN111444376A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/61Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Game Theory and Decision Science (AREA)
  • Collating Specific Patterns (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an audio fingerprint identification method, an audio fingerprint identification device and audio fingerprint identification equipment. Wherein the method comprises the following steps: the method comprises the steps of collecting audio data and personal features of at least one user, carrying out audio fingerprint extraction on the audio data of each user according to the personal features of each user, constructing an audio fingerprint database related to at least one common feature of the personal features according to the personal features and the audio fingerprints, and identifying a user corresponding to an audio fingerprint with the highest similarity with the audio fingerprint to be identified from the audio fingerprint database related to at least one common feature of the personal features corresponding to the audio fingerprint to be identified according to the audio fingerprint to be identified and the personal features corresponding to the audio fingerprint to be identified, wherein the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database. Through the mode, the efficiency of identifying the audio fingerprints can be improved.

Description

Audio fingerprint identification method and device and equipment
Technical Field
The present invention relates to the field of audio fingerprint technologies, and in particular, to an audio fingerprint identification method, apparatus, and device.
Background
The audio fingerprint refers to that unique digital features in a piece of audio are extracted in the form of identifiers through a specific algorithm and are used for identifying massive sound samples or tracking and positioning the positions of the samples in a database. The audio fingerprint is used as a core algorithm of a content automatic identification technology, and is widely applied to the fields of music identification, copyright content monitoring and broadcasting, content library duplicate removal, television second screen interaction and the like.
In the existing audio fingerprint identification scheme, similarity comparison is generally performed between an audio fingerprint to be identified and all audio fingerprints in an audio fingerprint database, and a user corresponding to the audio fingerprint with the highest similarity is compared and is a user corresponding to the audio fingerprint to be identified.
However, the inventors found that at least the following problems exist in the prior art:
according to the existing audio fingerprint identification scheme, in the identification process of the audio fingerprint, the similarity comparison is required to be carried out on the audio fingerprint to be identified and all the audio fingerprints in the audio fingerprint database, the time consumption of the similarity comparison process is long, and the identification efficiency of the audio fingerprint is general.
Disclosure of Invention
In view of this, the present invention provides an audio fingerprint identification method, an audio fingerprint identification device, and an audio fingerprint identification apparatus, which can improve the audio fingerprint identification efficiency.
According to an aspect of the present invention, there is provided an audio fingerprint identification method, including:
collecting audio data and personal characteristics of at least one user; according to the personal characteristics of each user, audio fingerprint extraction is carried out on the audio data of each user; constructing an audio fingerprint database associating at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprint; according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, identifying a user corresponding to the audio fingerprint with the highest similarity with the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be identified; and the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database.
Wherein, the audio fingerprint extraction of the audio data of each user according to the personal characteristics of each user comprises: according to the personal characteristics of each user, defining a rule set of user grouping of at least one common characteristic related to the personal characteristics, grouping each user according to the defined rule set, and simultaneously or respectively carrying out audio fingerprint extraction on the audio data of each group of grouped users.
Wherein said constructing an audio fingerprint database associating at least one common characteristic of said personal characteristics from said personal characteristics and said audio fingerprints comprises: and screening out a set of audio fingerprint data associated with at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprints, and constructing an audio fingerprint database associated with at least one common characteristic of the personal characteristics according to the screened set of audio fingerprint data.
Wherein, according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, identifying the user corresponding to the audio fingerprint with the highest similarity to the audio fingerprint to be identified from the audio fingerprint database associated with at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be identified comprises: the method comprises the steps of dividing an audio fingerprint to be identified into at least two audio fingerprint segments, and identifying a user corresponding to the audio fingerprint with the highest similarity with the at least two audio fingerprint segments from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristic of the audio fingerprint to be identified according to the personal characteristic corresponding to the audio fingerprint to be identified.
Wherein, after the identifying, according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, a user corresponding to an audio fingerprint with the highest similarity to the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be identified, the method further comprises: and correcting the corresponding audio fingerprint associated with the user according to the audio fingerprint to be identified.
According to another aspect of the present invention, there is provided an audio fingerprint recognition apparatus including: the device comprises an acquisition module, an extraction module, a construction module and an identification module; the acquisition module is used for acquiring audio data and personal characteristics of at least one user; the extraction module is used for extracting the audio fingerprint of the audio data of each user according to the personal characteristics of each user; the construction module is used for constructing an audio fingerprint database which is related to at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprint; the identification module is used for identifying a user corresponding to the audio fingerprint with the highest similarity to the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristic of the audio fingerprint to be identified according to the audio fingerprint to be identified and the personal characteristic corresponding to the audio fingerprint to be identified; and the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database.
Wherein, the extraction module is specifically configured to: according to the personal characteristics of each user, defining a rule set of user grouping of at least one common characteristic related to the personal characteristics, grouping each user according to the defined rule set, and simultaneously or respectively carrying out audio fingerprint extraction on the audio data of each group of grouped users.
Wherein the building block is specifically configured to: and screening out a set of audio fingerprint data associated with at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprints, and constructing an audio fingerprint database associated with at least one common characteristic of the personal characteristics according to the screened set of audio fingerprint data.
The identification module is specifically configured to: the method comprises the steps of dividing an audio fingerprint to be identified into at least two audio fingerprint segments, and identifying a user corresponding to the audio fingerprint with the highest similarity with the at least two audio fingerprint segments from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristic of the audio fingerprint to be identified according to the personal characteristic corresponding to the audio fingerprint to be identified.
Wherein, the device for identifying the audio fingerprint further comprises: a correction module; and the correction module is used for correcting the corresponding audio fingerprint associated with the user according to the audio fingerprint to be identified.
According to still another aspect of the present invention, there is provided an audio fingerprint recognition apparatus including: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform any of the methods of audio fingerprint identification described above.
According to a further aspect of the present invention, there is provided a computer-readable storage medium storing a computer program which, when executed by a processor, implements the method of audio fingerprint identification of any of the above.
It can be found that, by the above scheme, the audio data and the personal characteristics of at least one user can be collected, the audio data of each user can be subjected to audio fingerprint extraction according to the personal characteristics of each user, an audio fingerprint database associated with at least one common characteristic of the personal characteristics can be constructed according to the personal characteristics and the audio fingerprint, and a user corresponding to an audio fingerprint with the highest similarity to the audio fingerprint to be identified can be identified from the audio fingerprint database associated with at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be identified according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, wherein the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database, so that the time consumption of the similarity comparison process of the audio fingerprint to be identified and the audio fingerprint in the audio fingerprint database can be shortened, and then can realize improving the discernment efficiency of audio frequency fingerprint.
Further, according to the above scheme, a rule set for grouping users associated with at least one common characteristic of the personal characteristics may be defined according to the personal characteristics of each user, each user may be grouped according to the defined rule set, and audio fingerprint extraction may be performed on the audio data of each group of users grouped simultaneously or respectively.
Furthermore, according to the scheme, the set of the audio fingerprint data associated with the personal characteristic and the audio fingerprint can be screened out according to the personal characteristic and the audio fingerprint, and the audio fingerprint database associated with at least one common characteristic of the personal characteristic can be constructed according to the screened set of the audio fingerprint data.
Furthermore, the above scheme can divide the audio fingerprint to be identified into at least two audio fingerprint segments, and according to the personal characteristics corresponding to the audio fingerprint to be identified, identify the user corresponding to the audio fingerprint with the highest similarity with the at least two audio fingerprint segments from the audio fingerprint database associated with the at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be identified.
Furthermore, according to the above scheme, the audio fingerprint associated with the corresponding user can be corrected according to the audio fingerprint to be identified, which has the advantage of being able to improve the accuracy of the audio voiceprint of the corresponding user in the constructed audio fingerprint database.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a flowchart illustrating an audio fingerprint recognition method according to an embodiment of the present invention;
FIG. 2 is a flowchart illustrating an audio fingerprint recognition method according to another embodiment of the present invention;
FIG. 3 is a schematic structural diagram of an apparatus for identifying an audio fingerprint according to an embodiment of the present invention;
FIG. 4 is a schematic structural diagram of an apparatus for identifying an audio fingerprint according to another embodiment of the present invention;
fig. 5 is a schematic structural diagram of an embodiment of an audio fingerprint identification device according to the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings and examples. It is to be noted that the following examples are only illustrative of the present invention, and do not limit the scope of the present invention. Similarly, the following examples are only some but not all examples of the present invention, and all other examples obtained by those skilled in the art without any inventive work are within the scope of the present invention.
The invention provides an audio fingerprint identification method which can improve the identification efficiency of audio fingerprints.
Referring to fig. 1, fig. 1 is a flowchart illustrating an audio fingerprint identification method according to an embodiment of the present invention. It should be noted that the method of the present invention is not limited to the flow sequence shown in fig. 1 if the results are substantially the same. As shown in fig. 1, the method comprises the steps of:
s101: audio data and personal characteristics of at least one user are collected.
In this embodiment, the audio data of multiple users may be collected at one time, may be collected for multiple times, may be collected one by one, and the like, and the present invention is not limited thereto.
In this embodiment, multiple audio data of the same user may be collected, a single audio data of the same user may be collected, multiple audio data of multiple users may be collected, and the like, which is not limited in the present invention.
In this embodiment, the audio data may be a live voice audio, a recorded voice audio, a voice audio after calculation, or the like, and the present invention is not limited thereto.
In this embodiment, the personal characteristic may be gender, age, nationality, or the like, and the present invention is not limited thereto.
S102: and performing audio fingerprint extraction on the audio data of each user according to the personal characteristics of each user.
Wherein, the audio fingerprint extraction of the audio data of each user according to the personal characteristics of each user may include:
the method has the advantages that the audio fingerprint extraction can be carried out on the audio data of each grouped user simultaneously or respectively, the time consumption for carrying out the audio fingerprint extraction on the audio data of each user can be shortened, the extraction efficiency of the audio fingerprint can be improved, and meanwhile, the audio fingerprints of each grouped user can be conveniently managed in a grouping mode.
In this embodiment, the at least one common characteristic of the personal characteristics may be a characteristic comprising the same gender and/or the same age and/or the same ethnicity and/or the same nationality, etc.
S103: and constructing an audio fingerprint database which is associated with at least one common characteristic of the personal characteristic according to the personal characteristic and the audio fingerprint.
Wherein the constructing an audio fingerprint database of at least one common feature associated with the personal feature based on the personal feature and the audio fingerprint may comprise:
according to the personal characteristic and the audio fingerprint, a set of audio fingerprint data which is associated with the personal characteristic and has one common characteristic is screened out, and an audio fingerprint database which is associated with at least one common characteristic of the personal characteristic is constructed according to the screened set of audio fingerprint data.
S104: according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, identifying a user corresponding to the audio fingerprint with the highest similarity with the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be identified; and the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database.
Wherein, the identifying, according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, a user corresponding to an audio fingerprint with a highest similarity to the audio fingerprint to be identified from the audio fingerprint database associated with at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be identified may include:
the method has the advantages that the audio fingerprint to be recognized can be divided into at least two audio fingerprint sections, and the user corresponding to the audio fingerprint with the highest similarity with the at least two audio fingerprint sections is recognized from the audio fingerprint database which is associated with at least one common characteristic corresponding to the personal characteristic of the audio fingerprint to be recognized according to the personal characteristic corresponding to the audio fingerprint to be recognized.
After identifying the user corresponding to the audio fingerprint with the highest similarity to the audio fingerprint to be identified from the audio fingerprint database associated with at least one common feature of the personal features corresponding to the audio fingerprint to be identified according to the audio fingerprint to be identified and the personal features corresponding to the audio fingerprint to be identified, the method may further include:
and correcting the audio fingerprint associated with the corresponding user according to the audio fingerprint to be identified, so that the advantage of improving the accuracy of the audio voiceprint of the corresponding user in the constructed audio fingerprint database can be realized.
It can be found that, in this embodiment, the audio data and the personal characteristics of at least one user may be collected, the audio data of each user may be subjected to audio fingerprint extraction according to the personal characteristics of each user, an audio fingerprint database associating at least one common characteristic of the personal characteristics may be constructed according to the personal characteristics and the audio fingerprints, and a user corresponding to an audio fingerprint with the highest similarity to the audio fingerprint to be recognized may be identified from the audio fingerprint database associating at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be recognized according to the audio fingerprint to be recognized and the personal characteristics corresponding to the audio fingerprint to be recognized, where the user corresponding to the audio fingerprint to be recognized is the user in the constructed audio fingerprint database, so as to shorten the time consumption of the similarity comparison process between the audio fingerprint to be recognized and the audio fingerprint in the audio fingerprint database, and then can realize improving the discernment efficiency of audio frequency fingerprint.
Further, in this embodiment, a rule set for grouping users associated with at least one common feature of the personal features may be defined according to the personal features of each user, each user may be grouped according to the defined rule set, and audio fingerprint extraction may be performed on the audio data of each group of users that are grouped simultaneously or respectively.
Further, in this embodiment, a set of audio fingerprint data associated with the personal characteristic and corresponding to one common characteristic may be screened out according to the personal characteristic and the audio fingerprint, and an audio fingerprint database associated with at least one common characteristic of the personal characteristic may be constructed according to the screened set of audio fingerprint data.
Further, in this embodiment, the audio fingerprint to be recognized may be divided into at least two audio fingerprint segments, and according to the personal characteristics corresponding to the audio fingerprint to be recognized, the user corresponding to the audio fingerprint with the highest similarity to the at least two audio fingerprint segments is identified from the audio fingerprint database associated with the at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be recognized.
Referring to fig. 2, fig. 2 is a flowchart illustrating an audio fingerprint identification method according to another embodiment of the present invention. In this embodiment, the method includes the steps of:
s201: audio data and personal characteristics of at least one user are collected.
As described above in S101, further description is omitted here.
S202: and performing audio fingerprint extraction on the audio data of each user according to the personal characteristics of each user.
As described above in S102, further description is omitted here.
S203: and constructing an audio fingerprint database which is associated with at least one common characteristic of the personal characteristic according to the personal characteristic and the audio fingerprint.
As described above in S103, which is not described herein.
S204: according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, identifying a user corresponding to the audio fingerprint with the highest similarity with the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be identified; and the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database.
As described above in S104, and will not be described herein.
S205: and correcting the audio fingerprint associated with the corresponding user according to the audio fingerprint to be identified.
It can be found that, in the present embodiment, the audio fingerprint associated with the corresponding user can be corrected according to the audio fingerprint to be identified, which has the advantage of being able to achieve an improvement in the accuracy of the audio voiceprint of the corresponding user in the constructed audio fingerprint database.
The invention also provides an audio fingerprint identification device, which can improve the identification efficiency of the audio fingerprint.
Referring to fig. 3, fig. 3 is a schematic structural diagram of an audio fingerprint identification device according to an embodiment of the present invention. In this embodiment, the device 30 for identifying an audio fingerprint includes an acquisition module 31, an extraction module 32, a construction module 33, and an identification module 34.
The acquisition module 31 is configured to acquire audio data and personal characteristics of at least one user.
The extracting module 32 is configured to perform audio fingerprint extraction on the audio data of each user according to the personal characteristics of each user.
The building module 33 is configured to build an audio fingerprint database associating at least one common feature of the personal feature with the audio fingerprint according to the personal feature and the audio fingerprint.
The identification module 34 is configured to identify, according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, a user corresponding to an audio fingerprint with a highest similarity to the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be identified; and the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database.
Optionally, the extracting module 32 may be specifically configured to:
according to the personal characteristics of each user, defining a rule set of user grouping of at least one common characteristic related to the personal characteristics, grouping each user according to the defined rule set, and simultaneously or respectively carrying out audio fingerprint extraction on the audio data of each group of grouped users.
Optionally, the building block 33 may be specifically configured to:
and screening out a set of audio fingerprint data which is associated with the personal characteristic and has one common characteristic according to the personal characteristic and the audio fingerprint, and constructing an audio fingerprint database which is associated with at least one common characteristic of the personal characteristic according to the screened set of audio fingerprint data.
Optionally, the identification module 34 may be specifically configured to:
the audio fingerprint to be identified is divided into at least two audio fingerprint segments, and according to the personal characteristics corresponding to the audio fingerprint to be identified, the user corresponding to the audio fingerprint with the highest similarity of the at least two audio fingerprint segments is identified from the audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be identified.
Referring to fig. 4, fig. 4 is a schematic structural diagram of an audio fingerprint identification device according to another embodiment of the present invention. Different from the previous embodiment, the audio fingerprint identification apparatus 40 of the present embodiment further includes a correction module 41.
The correcting module 41 is configured to correct the audio fingerprint associated with the corresponding user according to the audio fingerprint to be identified.
Each unit module of the audio fingerprint identification device 30/40 may perform the corresponding steps in the above method embodiments, so that the detailed description of each unit module is omitted here, and please refer to the description of the corresponding steps above.
The present invention further provides an audio fingerprint recognition apparatus, as shown in fig. 5, including: at least one processor 51; and a memory 52 communicatively coupled to the at least one processor 51; the memory 52 stores instructions executable by the at least one processor 51, and the instructions are executed by the at least one processor 51 to enable the at least one processor 51 to execute the above-mentioned audio fingerprint identification method.
Wherein the memory 52 and the processor 51 are coupled in a bus, which may comprise any number of interconnected buses and bridges, which couple one or more of the various circuits of the processor 51 and the memory 52 together. The bus may also connect various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. A bus interface provides an interface between the bus and the transceiver. The transceiver may be one element or a plurality of elements, such as a plurality of receivers and transmitters, providing a means for communicating with various other apparatus over a transmission medium. The data processed by the processor 51 is transmitted over a wireless medium via an antenna, which further receives the data and transmits the data to the processor 51.
The processor 51 is responsible for managing the bus and general processing and may also provide various functions including timing, peripheral interfaces, voltage regulation, power management, and other control functions. And the memory 52 may be used to store data used by the processor 51 in performing operations.
The present invention further provides a computer-readable storage medium storing a computer program. The computer program realizes the above-described method embodiments when executed by a processor.
It can be found that, by the above scheme, the audio data and the personal characteristics of at least one user can be collected, the audio data of each user can be subjected to audio fingerprint extraction according to the personal characteristics of each user, an audio fingerprint database associated with at least one common characteristic of the personal characteristics can be constructed according to the personal characteristics and the audio fingerprint, and a user corresponding to an audio fingerprint with the highest similarity to the audio fingerprint to be identified can be identified from the audio fingerprint database associated with at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be identified according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, wherein the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database, so that the time consumption of the similarity comparison process of the audio fingerprint to be identified and the audio fingerprint in the audio fingerprint database can be shortened, and then can realize improving the discernment efficiency of audio frequency fingerprint.
Further, according to the above scheme, a rule set for grouping users associated with at least one common characteristic of the personal characteristics may be defined according to the personal characteristics of each user, each user may be grouped according to the defined rule set, and audio fingerprint extraction may be performed on the audio data of each group of users grouped simultaneously or respectively.
Furthermore, according to the scheme, the set of the audio fingerprint data associated with the personal characteristic and the audio fingerprint can be screened out according to the personal characteristic and the audio fingerprint, and the audio fingerprint database associated with at least one common characteristic of the personal characteristic can be constructed according to the screened set of the audio fingerprint data.
Furthermore, the above scheme can divide the audio fingerprint to be identified into at least two audio fingerprint segments, and according to the personal characteristics corresponding to the audio fingerprint to be identified, identify the user corresponding to the audio fingerprint with the highest similarity with the at least two audio fingerprint segments from the audio fingerprint database associated with the at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be identified.
Furthermore, according to the above scheme, the audio fingerprint associated with the corresponding user can be corrected according to the audio fingerprint to be identified, which has the advantage of being able to improve the accuracy of the audio voiceprint of the corresponding user in the constructed audio fingerprint database.
In the several embodiments provided in the present invention, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, a division of a module or a unit is merely a logical division, and an actual implementation may have another division, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
Units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be substantially or partially implemented in the form of a software product stored in a storage medium and including instructions for causing a computer device (which may be a personal computer, a server, a network device, or the like) or a processor (processor) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
The above description is only a part of the embodiments of the present invention, and not intended to limit the scope of the present invention, and all equivalent devices or equivalent processes performed by the present invention through the contents of the specification and the drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims (10)

1. A method for identifying an audio fingerprint, comprising:
collecting audio data and personal characteristics of at least one user;
according to the personal characteristics of each user, audio fingerprint extraction is carried out on the audio data of each user;
constructing an audio fingerprint database associating at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprint;
according to the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, identifying a user corresponding to the audio fingerprint with the highest similarity with the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristics of the audio fingerprint to be identified; and the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database.
2. The method for identifying audio fingerprints as claimed in claim 1, wherein the audio fingerprint extraction of the audio data of each user according to the personal characteristics of each user comprises:
according to the personal characteristics of each user, defining a rule set of user grouping of at least one common characteristic related to the personal characteristics, grouping each user according to the defined rule set, and simultaneously or respectively carrying out audio fingerprint extraction on the audio data of each group of grouped users.
3. The method for identifying audio fingerprints as claimed in claim 1, wherein the constructing an audio fingerprint database associating at least one common feature of the personal features based on the personal features and the audio fingerprints comprises:
and screening out a set of audio fingerprint data associated with at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprints, and constructing an audio fingerprint database associated with at least one common characteristic of the personal characteristics according to the screened set of audio fingerprint data.
4. The method for identifying audio fingerprints according to claim 1, wherein the identifying, from the audio fingerprint to be identified and the personal characteristics corresponding to the audio fingerprint to be identified, the user corresponding to the audio fingerprint with the highest similarity to the audio fingerprint to be identified from the audio fingerprint database associated with at least one common characteristic of the personal characteristics corresponding to the audio fingerprint to be identified comprises:
the method comprises the steps of dividing an audio fingerprint to be identified into at least two audio fingerprint segments, and identifying a user corresponding to the audio fingerprint with the highest similarity with the at least two audio fingerprint segments from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristic of the audio fingerprint to be identified according to the personal characteristic corresponding to the audio fingerprint to be identified.
5. The method for identifying audio fingerprints as claimed in claim 1, wherein after identifying the user corresponding to the audio fingerprint with the highest similarity to the audio fingerprint to be identified from the audio fingerprint database associated with at least one common feature corresponding to the personal feature of the audio fingerprint to be identified according to the audio fingerprint to be identified and the personal feature corresponding to the audio fingerprint to be identified, the method further comprises:
and correcting the corresponding audio fingerprint associated with the user according to the audio fingerprint to be identified.
6. An apparatus for identifying an audio fingerprint, comprising:
the device comprises an acquisition module, an extraction module, a construction module and an identification module;
the acquisition module is used for acquiring audio data and personal characteristics of at least one user;
the extraction module is used for extracting the audio fingerprint of the audio data of each user according to the personal characteristics of each user;
the construction module is used for constructing an audio fingerprint database which is related to at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprint;
the identification module is used for identifying a user corresponding to the audio fingerprint with the highest similarity to the audio fingerprint to be identified from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristic of the audio fingerprint to be identified according to the audio fingerprint to be identified and the personal characteristic corresponding to the audio fingerprint to be identified; and the user corresponding to the audio fingerprint to be identified is the user in the constructed audio fingerprint database.
7. The apparatus for identifying an audio fingerprint according to claim 6, wherein the extracting module is specifically configured to:
according to the personal characteristics of each user, defining a rule set of user grouping of at least one common characteristic related to the personal characteristics, grouping each user according to the defined rule set, and simultaneously or respectively carrying out audio fingerprint extraction on the audio data of each group of grouped users.
8. The apparatus for identifying an audio fingerprint according to claim 6, wherein the construction module is specifically configured to:
and screening out a set of audio fingerprint data associated with at least one common characteristic of the personal characteristics according to the personal characteristics and the audio fingerprints, and constructing an audio fingerprint database associated with at least one common characteristic of the personal characteristics according to the screened set of audio fingerprint data.
9. The apparatus for identifying an audio fingerprint according to claim 6, wherein the identification module is specifically configured to:
the method comprises the steps of dividing an audio fingerprint to be identified into at least two audio fingerprint segments, and identifying a user corresponding to the audio fingerprint with the highest similarity with the at least two audio fingerprint segments from an audio fingerprint database associated with at least one common characteristic corresponding to the personal characteristic of the audio fingerprint to be identified according to the personal characteristic corresponding to the audio fingerprint to be identified.
10. The apparatus for audio fingerprint recognition according to claim 6, further comprising:
a correction module;
and the correction module is used for correcting the corresponding audio fingerprint associated with the user according to the audio fingerprint to be identified.
CN202010293633.1A 2020-04-15 2020-04-15 Audio fingerprint identification method and device and equipment Pending CN111444376A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010293633.1A CN111444376A (en) 2020-04-15 2020-04-15 Audio fingerprint identification method and device and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010293633.1A CN111444376A (en) 2020-04-15 2020-04-15 Audio fingerprint identification method and device and equipment

Publications (1)

Publication Number Publication Date
CN111444376A true CN111444376A (en) 2020-07-24

Family

ID=71653127

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010293633.1A Pending CN111444376A (en) 2020-04-15 2020-04-15 Audio fingerprint identification method and device and equipment

Country Status (1)

Country Link
CN (1) CN111444376A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130259211A1 (en) * 2012-03-28 2013-10-03 Kevin Vlack System and method for fingerprinting datasets
CN105430494A (en) * 2015-12-02 2016-03-23 百度在线网络技术(北京)有限公司 Method and device for identifying audio from video in video playback equipment
CN108280074A (en) * 2017-01-05 2018-07-13 北京酷我科技有限公司 The recognition methods of audio and system
CN109036436A (en) * 2018-09-18 2018-12-18 广州势必可赢网络科技有限公司 A kind of voice print database method for building up, method for recognizing sound-groove, apparatus and system
CN109271501A (en) * 2018-09-19 2019-01-25 北京容联易通信息技术有限公司 A kind of management method and system of audio database
CN109657093A (en) * 2018-11-27 2019-04-19 腾讯音乐娱乐科技(深圳)有限公司 Audio search method, device and storage medium
CN110956966A (en) * 2019-11-01 2020-04-03 平安科技(深圳)有限公司 Voiceprint authentication method, voiceprint authentication device, voiceprint authentication medium and electronic equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130259211A1 (en) * 2012-03-28 2013-10-03 Kevin Vlack System and method for fingerprinting datasets
CN105430494A (en) * 2015-12-02 2016-03-23 百度在线网络技术(北京)有限公司 Method and device for identifying audio from video in video playback equipment
CN108280074A (en) * 2017-01-05 2018-07-13 北京酷我科技有限公司 The recognition methods of audio and system
CN109036436A (en) * 2018-09-18 2018-12-18 广州势必可赢网络科技有限公司 A kind of voice print database method for building up, method for recognizing sound-groove, apparatus and system
CN109271501A (en) * 2018-09-19 2019-01-25 北京容联易通信息技术有限公司 A kind of management method and system of audio database
CN109657093A (en) * 2018-11-27 2019-04-19 腾讯音乐娱乐科技(深圳)有限公司 Audio search method, device and storage medium
CN110956966A (en) * 2019-11-01 2020-04-03 平安科技(深圳)有限公司 Voiceprint authentication method, voiceprint authentication device, voiceprint authentication medium and electronic equipment

Similar Documents

Publication Publication Date Title
US10366275B2 (en) Method and device for improving fingerprint template, and terminal device
CN103548076A (en) Device and method for recognizing content using audio signals
WO2019052162A1 (en) Method, apparatus and device for improving data cleaning efficiency, and readable storage medium
CN111210842A (en) Voice quality inspection method, device, terminal and computer readable storage medium
CN110991170A (en) Chinese disease name intelligent standardization method and system based on electronic medical record information
CN111104540A (en) Image searching method, device, equipment and computer readable storage medium
CN110889009B (en) Voiceprint clustering method, voiceprint clustering device, voiceprint processing equipment and computer storage medium
CN112199935A (en) Data comparison method and device, electronic equipment and computer readable storage medium
CN114722199A (en) Risk identification method and device based on call recording, computer equipment and medium
CN111027316A (en) Text processing method and device, electronic equipment and computer readable storage medium
CN112307318A (en) Content publishing method, system and device
CN112232290B (en) Data clustering method, server, system and computer readable storage medium
CN111767419B (en) Picture searching method, device, equipment and computer readable storage medium
CN111444376A (en) Audio fingerprint identification method and device and equipment
CN115545809B (en) Method for constructing standard library of electronic commerce commodity, data alignment method, device and equipment
CN113925517B (en) Cognitive disorder recognition method, device and medium based on electroencephalogram signals
CN111326163B (en) Voiceprint recognition method, device and equipment
CN114817645A (en) Time sequence data storage and reading method, device, equipment and storage medium
CN111782684B (en) Distribution network electronic handover information matching method and device
CN111460209A (en) Audio fingerprint retrieval method and device and equipment
CN113470630A (en) Voice recognition method, system, device and storage medium based on big data
CN112016466A (en) Face recognition method, face recognition system, electronic device and computer storage medium
CN111581430B (en) Audio fingerprint generation method and device and equipment
CN111522991B (en) Audio fingerprint extraction method, device and equipment
CN108241708B (en) Media name processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200724