CN102402983A

CN102402983A - Cloud data center speech recognition method

Info

Publication number: CN102402983A
Application number: CN2011103801667A
Authority: CN
Inventors: 吕广杰; 朱锦雷; 朱波
Original assignee: Inspur Electronic Information Industry Co Ltd
Current assignee: Inspur Electronic Information Industry Co Ltd
Priority date: 2011-11-25
Filing date: 2011-11-25
Publication date: 2012-04-04

Abstract

The invention provides a cloud data center speech recognition method. An HBR110 chip is used to process and analyze the speech by using the dynamic time warping algorithm and recognize the authorization of a speech owner to realize the recognition of the speech. The design of a system comprises the overall architecture design, the hardware design and the software design, wherein the overall architecture design is the key design of the system. The overall architecture of the system is designed by analyzing the system requirements and researching the mainstream speech recognition products in the market. A human speech recognition processor HBR110 chip is selected and used together with a 8031 SCM, an audio amplification circuit, an SPIFLASH memory and necessary peripheral circuits to process and analyze the speech by using the dynamic time warping algorithm to realize the speech recognition and authorization distribution safety function.

Description

A kind of cloud data center audio recognition method

Technical field

The present invention relates to computer application field, specifically a kind of cloud data center audio recognition method.

Background technology

Development along with Information technology; Cloud computing progressively becomes the development focus of industry, and the cloud computing service platform of domestic and international all big enterprises also begins to put into one after another a plurality of fields such as science, education, culture, health, government, high-performance calculation, ecommerce, Internet of Things to be used.

In order to ensure the safety of cloud data center, in the machine room of most of cloud data center the password identification system has been installed.But, cause the password identification system to have a lot of security breaches because the traditional text password has transreplication, shortcoming such as is prone to have things stolen, is prone to forget.

Voice are as a kind of biological characteristic, have human body intrinsic not reproducible uniqueness.Speech recognition system is linked to each other with cloud data center, can be with the alternative sounds information of different user as key, identification user's identity, and decision user's rights of using.Like this,, be difficult to more decode, have higher security with respect to the traditional text password.

In addition, be generally the limitation that single persona certa discerns and unspecified person is discerned to speech recognition system on the market, native system proposes many persona certas speech recognition schemes, has solved the assignment problem of the multi-user of cloud data center rights of using.

Summary of the invention

The purpose of this invention is to provide a kind of cloud data center audio recognition method.

The objective of the invention is to realize, utilize the HBR110 chip, carry out speech processes and analysis through the dynamic time warping algorithm by following mode; The possessory authority of recognizing voice realizes the identification of voice, and system comprises: 1) overall architecture design; 2) hardware designs and 3) software design, wherein

1) overall architecture design is the primary design effort of this system, through analytic system demand and the main flow speech recognition product of investigating on the market, design system overall architecture; Select people's voice recognition processor HBR110 chip; In conjunction with 8031 single-chip microcomputers, audio amplifier circuit, SPI FLASH storer and necessary peripheral circuit; Utilize the dynamic time warping algorithm to carry out speech processes and analysis, realize the security function of speech recognition and right assignment;

2) hardware designs, hardware designs work comprises systematic schematic diagram design, PCB design;

3) software design, Software Design work uses assembly language that 8031 single-chip microcomputers are programmed, and realizes the control to hardware system; Control HBR110 chip is accomplished following operation:

S1 pre-service: comprise the noise effect that voice signal sampling, anti aliasing bandpass filtering, removal individual pronunciation difference and equipment, environment cause, and relate to choosing and the end-point detection problem of speech recognition primitive;

S2 feature extraction: be used for extracting the parameters,acoustic of voice reflection essential characteristic, comprise average energy, on average stride zero rate, resonance peak;

S3 training: before identification,, from the raw tone sample, remove redundant information, keep critical data through letting the talker repeatedly repeat voice, again according to certain rule to data cluster in addition, form library;

The S4 pattern match: be the core of whole speech recognition system, it calculates the similarity between input feature vector and the stock's pattern according to certain rule and expertise, judges the meaning of one's words information of input voice.

The invention has the beneficial effects as follows:

A) many persona certas speech recognition technology: broken through the speech recognition system limitation that is generally single persona certa's identification or unspecified person identification in the market;

B) right assignment improves security: native system can distribute different rights of using to different user, thereby has improved the security performance of system;

C) treatment technology of unique accent: the user must not use RP, gets final product this system of smooth and easy use;

D) diversified speech model sample: the speech model of machine when training input can be decided by user's request, the sample-specific that needn't using system provides.

Through experimental verification, native system has higher accuracy and practicality, and the voice match accuracy reaches more than 90%.

Description of drawings

Fig. 1 is the speech recognition process flow diagram;

Fig. 2 is speech recognition hardware structure figure.

Embodiment

Explanation at length below with reference to Figure of description method of the present invention being done.

A kind of cloud of the present invention data center audio recognition method, its structure be by

The realization flow of native system is shown in accompanying drawing 1.As described in the summary of the invention, architecture of the present invention mainly comprises: overall architecture design, hardware designs, software design.

Wherein, the overall architecture design is the primary design effort of system, through extensive investigation, selects to adopt the hardware structure like accompanying drawing 2.Coprocessor HBR110 be responsible for accomplishing input to sound, identification, processing, output services by, main control chip 8031 is responsible for accomplishing the control corresponding operation, the latter is the core of system through the programmed control whole system operation.

Hardware designs is second step of system design.Element characteristic through each electronic component of analysis-by-synthesis, heat radiation requirement, working environment etc.; Design peripheral circuit and the audio amplifier circuit and the SPI FLASH memory circuit of primary processor 8031 single-chip microcomputers, coprocessor HBR110 chip respectively, accomplish systematic schematic diagram and PCB figure.

Software design is the final step of system design.Use assembly language that 8031 single-chip microcomputers are programmed, control HBR110 chip is accomplished following operation:

S1 pre-service: comprise voice signal sampling, anti aliasing bandpass filtering, remove noise effect that individual pronunciation difference and equipment, environment cause etc., and relate to choosing and the end-point detection problem of speech recognition primitive;

S2 feature extraction: be used for extracting the parameters,acoustic of voice reflection essential characteristic, like average energy, on average stride zero rate, resonance peak etc.;

Except that the described technical characterictic of instructions, be the known technology of those skilled in the art.

Claims

1. a cloud data center audio recognition method is characterized in that utilizing the HBR110 chip, carries out speech processes and analysis through the dynamic time warping algorithm; The possessory authority of recognizing voice; Realize the identification of voice, system comprises: 1) overall architecture design, 2) hardware designs and 3) software design; Wherein