CN107017007A - A kind of substation field operation remote command method based on voice transfer - Google Patents

A kind of substation field operation remote command method based on voice transfer Download PDF

Info

Publication number
CN107017007A
CN107017007A CN201710334346.9A CN201710334346A CN107017007A CN 107017007 A CN107017007 A CN 107017007A CN 201710334346 A CN201710334346 A CN 201710334346A CN 107017007 A CN107017007 A CN 107017007A
Authority
CN
China
Prior art keywords
voice
sub
data
sound
parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710334346.9A
Other languages
Chinese (zh)
Inventor
朱毅
朱文亮
赵鹏飞
靳海军
赵娜
朱子剑
亓超
于千千
王安山
赵书楠
丁晶
丁振华
张学凯
刘小宁
张辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Economic and Technological Research Institute of State Grid Shandong Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
Economic and Technological Research Institute of State Grid Shandong Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Economic and Technological Research Institute of State Grid Shandong Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201710334346.9A priority Critical patent/CN107017007A/en
Publication of CN107017007A publication Critical patent/CN107017007A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q5/00Selecting arrangements wherein two or more subscriber stations are connected by the same line to the exchange
    • H04Q5/24Selecting arrangements wherein two or more subscriber stations are connected by the same line to the exchange for two-party-line systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a kind of substation field operation remote command method based on voice transfer, it decodes three processes by encoded speech data, data transfer and speech data and carries out remote speech transmission, with the interactive voice of this commander's talk back equipment for completing live talk back equipment and remote command center, so that to instruct substation field operation, code efficiency is effectively raised, and obtains more preferable sense of hearing effect;And the pure and impure tone parameter of subband has carried out quantification treatment in sound transmission course, improves the quantitative efficiency of sub-band surd and sonant degree parameter, the naturalness of synthesis voice is improved.

Description

A kind of substation field operation remote command method based on voice transfer
Technical field
The present invention relates to a kind of substation field operation remote command method based on voice transfer, belong to substation safety Operating technology field.
Background technology
It is directly used in production and uses electric energy, the electrical equipment higher than control loop (secondary device) voltage is referred to as once set It is standby.Main composition has:Generator (motor), transformer, breaker, disconnecting switch, automatic switch, contactor, knife-like switch, mother Line, transmission line of electricity, power cable, reactor etc..Operation for primary equipment needs high degree of safety,《Electric power safety work Make code》Middle regulation, the operation of operations staff is divided into monitoring operation and one man operation.Wherein, monitoring operation is held by least two people OK, wherein a people guards to the equipment person of being familiar with, one man operation is by people's complete independently.At present in operation, one As in the case of, administrative staff can not know the concrete condition of operation, and when operation goes wrong, administrative staff at most can only Gone and found out what's going on by phone, it is impossible to obtain live visual information.The drawbacks of this operational method has certain, particularly operator Member is to some primary equipment when will operate, administrative staff control centre miscellaneous equipment is turned off or change its course so that pair The primary equipment can be with safety operation, but administrative staff extremely easily select primary equipment to malfunction, will it is other once Equipment safety is isolated, and the primary equipment that will be operated is not isolated from, and causes very big potential safety hazard, or even harm execute-in-place people The life of member.This occurred many in the actual course of work, but prevented this feelings there is presently no preferable method The generation of condition, sees clearly equipment to isolate in can only working again, dependence be staff careful degree.
The utility model of Application No. 201120347568.2 discloses a kind of transforming plant protecting, observing and controlling screen cabinet voice message Device, including voice module, power supply, horn module, electromagnetism normally closed contact and electromagnet, wherein electromagnetism normally closed contact are arranged on guarantor In shield, the doorframe of observing and controlling screen cabinet, electromagnet is arranged in cabinet door, one end of electromagnetism normally closed contact and the triggering of voice cue module Signal input part is connected, and other end ground connection, the voice signal output end of voice module and the signal input part of loudspeaker are connected, power supply Powered to modules.Operator reports the device name to be operated with voice, kept away when opening protection, observing and controlling screen cabinet door Exempt from operator and go to the wrong way interval to cause maloperation, prevent the generation of artificial misoperation fault from source, improve electric network reliability. The utility model is once to be thought verification before operation, but people can have fault phenomenon, it is possible to causing meaning Outer accident.
During remote command, speech quality is to determine a key factor of field operation success or failure, speech quality Quality influence both sides' communication effectiveness and operation quality.Accordingly, it would be desirable to a kind of high-quality voice transmission method.
The content of the invention
It is existing the invention provides a kind of transformer station based on voice transfer in order to overcome the shortcomings of that above-mentioned prior art is present Field operation remote command method, it is checked the primary equipment at scene by voice dialogue, makes administrative staff's remote command Site operation personnel carries out safety operation.
The technical scheme adopted by the invention to solve the technical problem is that:A kind of substation field based on voice transfer is made Industry remote command method, carries out interactive voice to instruct by commander's talk back equipment of live talk back equipment and remote command center Substation field operation, it is characterized in that, including procedure below:
First, encoded speech data
Step 11, obtain live talk back equipment or command the speech data of the telephone transmitter input of talk back equipment;
Step 12, the speech data of acquisition is subjected to SILK coded treatments and generates voice sub-band parameter;
Step 13, voice sub-band surd and sonant parameter is quantified;The step 13 is comprised the following steps that:
Step 131, sub-band surd and sonant in voice sub-band parameter is judged;Subband is clear in the sub-band parameter to voice The process that voiced sound is judged is:Pure and impure sound is set to judge data set first;Then voice training data are utilized and described pure and impure Sound judges data set, and the pure and impure sound decision model for judging tree construction to y-bend is trained, and wherein the y-bend is judged in tree construction Non-leaf nodes judge the problem of data are concentrated for the pure and impure sound, leaf node is pure and impure sound result of determination;Finally receive Tone testing data, and tone testing data are judged as voiceless sound or voiced sound using the pure and impure sound decision model after training, when Tone testing data are judged as after voiced sound, and the pumping signal of the tone testing data is assumed to be during phonetic synthesis Shock response sequence;After tone testing data are judged as voiceless sound, the tone testing data during phonetic synthesis Pumping signal is assumed to be white noise;
Step 132, timer m1=0 is exited in coding side setting, and interval T0 is exited in setting;
Step 133, to the speech data sampling point framing in chronological order of acquisition, and sub-band surd and sonant degree ginseng is extracted by frame Number;
Step 134, multiple speech frame composition superframes, sub-band surd and sonant degree parameter composition superframe of multiple speech frames are combined With pure and impure tone parameter;
Step 135, judge m1 whether the integral multiple for being T0, if it is using initial codebook to superframe sub-band surd and sonant degree Parameter carries out vector quantization;Otherwise the sub-band surd and sonant degree parameter after a upper superframe is quantified is quantified using initial codebook, Assuming that quantized value is n, the corresponding code book Cn of the index is chosen from condition codebook set according to quantization index value n, code book is utilized Cn is quantified to current super frame sub-band surd and sonant degree parameter and obtains index value index;Described initial codebook and condition code This collection is obtained by off-line training;
Step 136, index value is subjected to coding transmission, makes m1=m1+1 and go to step 135;
Step 14, by the corresponding bit stream of voice sub-band parameter compression generation current speech data after real-time quantization;
2nd, data transfer
Live talk back equipment or the speech data for the telephone transmitter input for commanding talk back equipment are sent to commander's talk back equipment Or the telephone receiver of live talk back equipment;
The detailed process of the data transfer comprises the following steps:
Step 21, telephone transmitter send voice superframes when, in the head time slot of each voice superframes place synchronous code with it is complete Whole signaling parameter, the signaling parameter includes all parameters for being used to carry out the foundation of voice call link;
Step 22, when telephone receiver receives signal, detect the signal that receives whether be voice superframes voice head time slot, such as Fruit is then to be transferred to step 23, otherwise, is transferred to step 24;
Step 23, the signaling parameter of carrying is parsed from the voice head time slot, and is entered according to the signaling parameter parsed The foundation of row voice call link;
Step 24, according to the content for receiving signal, signal transacting is carried out;The concrete operations of the step 24 are:If connect The signal received is data head time slot, then sets up data interaction link;If the signal received is data block pdu, skill According to block;If the signal received is high-rise control signaling, the high-rise control signaling of parsing;
3rd, speech data is decoded
Step 31, the corresponding bit stream of current speech data is obtained, and by SILK decoders to current speech data ratio Spy's stream decodes the voice sub-band parameter after being quantified;
Step 32, extract quantify after voice sub-band parameter each sub-band surd and sonant parameter and synthesize pumping signal;The step Rapid 32 comprise the following steps that:
Step 321, decoding end exits timer m2=0 with coding side setting simultaneously, and interval T0 is exited in setting;
Step 322, decoding end judge m2 whether the integral multiple for being T0, if it is search for initial according to index value index Code book obtains superframe sub-band surd and sonant degree parameter;Otherwise the sub-band surd and sonant degree after being quantified using initial codebook to a upper superframe is joined Number is quantified, and obtains index value n, and concentrates selection code book Cn from condition code book according to index value n, is received according to decoding end Index value index search code book Cn obtains superframe sub-band surd and sonant degree parameter;
Step 323, according to superframe sub-band surd and sonant degree parameter, the sub-band surd and sonant degree parameter of each speech frame is obtained, is sent Enter pumping signal synthesis end generation pumping signal;Make m=m+1 and go to step 322;
Step 33, the voice of synthesis is exported in real time.
Preferably, the process that the pure and impure sound decision model for judging y-bend tree construction is trained as:For pure and impure Sound judges each problem that data are concentrated, and is to calculate voiced sound ratio value respectively with no voice training data for answering, and And it is used as root node the problem of selection voiced sound proportional difference maximum;Enter line splitting for voice training data under root node, with Form non-leaf nodes and leaf node.
Preferably, during the pure and impure sound decision model for judging y-bend tree construction is trained,
After division stop condition set in advance is met, stop the division, wherein the division stops bar Part is:Non-leaf nodes or the voice training data of leaf node are less than the first threshold value set in advance;
After division stop condition set in advance is met, stop the division, wherein the division stops bar Part is:The voiced sound proportional difference of non-leaf nodes or leaf node is less than the second threshold value set in advance.
Preferably, the pure and impure sound judges whether data set is vowel, plosive, friction as the phoneme belonging to speech frame Sound, read again, nasal sound, high and level tone, rising tone, upper sound or falling tone, and forward and backward one in speech sentence of phoneme belonging to speech frame Whether phoneme is vowel, plosive, fricative, stressed, nasal sound, high and level tone, rising tone, upper sound or falling tone.
Preferably, interval T0 is exited to be set according to the size of superframe.
Preferably, the speech data sampling point to acquisition is by 8kHz frequency samplings and done by high-pass filtering removal power frequency The voice sampling point disturbed.
Preferably, the quantity y of the condition code book and the quantizing bit number x of sub-band surd and sonant degree parameter relational expression are:y =2x
The beneficial effects of the invention are as follows,
The present invention decodes three processes by encoded speech data, data transfer and speech data and carries out remote speech biography It is defeated, with the interactive voice of this commander's talk back equipment for completing live talk back equipment and remote command center, so as to instruct power transformation Stand field operation, effectively raise code efficiency, and obtain more preferable sense of hearing effect.
Checked by the command methods of the present invention come the primary equipment to substation field operation, it is ensured that to correct Primary equipment is isolated, and verification is checked using flexecutive's real-time instruction, it is ensured that the accuracy of isolation;The present invention Convenient scene and the communication of control centre, can be sent to control centre by the sound at scene, be easy to the technical barrier to scene Quickly solved, administrative staff's remote command site operation personnel is carried out safety operation.
Present invention subband pure and impure tone parameter in sound transmission course has carried out quantification treatment, it is contemplated that continuous superframe The correlation of sub-band surd and sonant degree parameter, the principle of adoption status transfer during to current super frame sub-band surd and sonant metrization, is examined Consider the sub-band surd and sonant degree parameter of a upper superframe, therefore improve the quantitative efficiency of sub-band surd and sonant degree parameter.The present invention The naturalness of synthesis voice is improved, is adapted to low rate and ultralow speed parametric speech coding with below 2400bps.
The present invention during sub-band surd and sonant degree parameter quantification treatment first to voice sub-band parameter in sub-band surd and sonant Judged, it sets pure and impure sound to judge data set;Using voice training data and the pure and impure sound decision problem collection, to y-bend Judge that the pure and impure sound decision model of tree construction is trained, wherein the y-bend judges the non-leaf nodes in tree construction to be described clear The problem of voiced sound decision problem is concentrated, leaf node is pure and impure sound result of determination;Tone testing data are received, and are utilized after training The pure and impure sound decision model judge tone testing data as voiceless sound or voiced sound.As can be seen here, embodiment of the present invention is used Independent pure and impure sound decision model pairing is judged into plain the pure and impure of state of voice middle pitch, is judged into so as to improve pure and impure sound Power, solve synthesis voice voiced sound Qinghua and voiceless sound turbidity caused by the low defect of synthetic effect, improve language The quality of sound synthesis.
The data transfer mode of the present invention can reduce overhead on the premise of system transmission performance is not reduced, System efficiency of transmission is improved, and reduces system processing complexity, the flow of voice transfer link establishment is more rationalized.
Brief description of the drawings
The present invention is further described below in conjunction with the accompanying drawings:
Fig. 1 is flow chart of the method for the present invention;
Fig. 2 is the flow chart of encoded speech data of the present invention;
Fig. 3 is the flow chart of data transfer of the present invention;
Fig. 4 is the flow chart of speech data decoding of the present invention.
Embodiment
For the technical characterstic for illustrating this programme can be understood, below by embodiment, and its accompanying drawing is combined, to this hair It is bright to be described in detail.Following disclosure provides many different embodiments or example is used for realizing the different knots of the present invention Structure.In order to simplify disclosure of the invention, hereinafter the part and setting of specific examples are described.In addition, the present invention can be with Repeat reference numerals and/or letter in different examples.This repetition is that for purposes of simplicity and clarity, itself is not indicated Relation between various embodiments are discussed and/or set.It should be noted that part illustrated in the accompanying drawings is not necessarily to scale Draw.Present invention omits the description to known assemblies and treatment technology and process to avoid being unnecessarily limiting the present invention.
As shown in figure 1, a kind of substation field operation remote command method based on voice transfer of the present invention, the present invention Three processes are decoded by encoded speech data, data transfer and speech data and carry out remote speech transmission, scene is completed with this The interactive voice of commander's talk back equipment of talk back equipment and remote command center, so as to instruct substation field operation, effectively Improve code efficiency, and obtain more preferable sense of hearing effect., command methods, which are somebody's turn to do, include procedure below.
First, encoded speech data
As shown in Fig. 2 the detailed process of encoded speech data comprises the following steps:
Step 11, obtain live talk back equipment or command the speech data of the telephone transmitter input of talk back equipment;
Step 12, the speech data of acquisition is subjected to SILK coded treatments and generates voice sub-band parameter;
Step 13, voice sub-band surd and sonant parameter is quantified;The step 13 is comprised the following steps that:
Step 131, sub-band surd and sonant in voice sub-band parameter is judged;Subband is clear in the sub-band parameter to voice The process that voiced sound is judged is:Pure and impure sound is set to judge data set first;Then voice training data are utilized and described pure and impure Sound judges data set, and the pure and impure sound decision model for judging tree construction to y-bend is trained, and wherein the y-bend is judged in tree construction Non-leaf nodes judge the problem of data are concentrated for the pure and impure sound, leaf node is pure and impure sound result of determination;Finally receive Tone testing data, and tone testing data are judged as voiceless sound or voiced sound using the pure and impure sound decision model after training, when Tone testing data are judged as after voiced sound, and the pumping signal of the tone testing data is assumed to be during phonetic synthesis Shock response sequence;After tone testing data are judged as voiceless sound, the tone testing data during phonetic synthesis Pumping signal is assumed to be white noise;
Step 132, timer m1=0 is exited in coding side setting, and interval T0 is exited in setting;
Step 133, to the speech data sampling point framing in chronological order of acquisition, and sub-band surd and sonant degree ginseng is extracted by frame Number;
Step 134, multiple speech frame composition superframes, sub-band surd and sonant degree parameter composition superframe of multiple speech frames are combined With pure and impure tone parameter;
Step 135, judge m1 whether the integral multiple for being T0, if it is using initial codebook to superframe sub-band surd and sonant degree Parameter carries out vector quantization;Otherwise the sub-band surd and sonant degree parameter after a upper superframe is quantified is quantified using initial codebook, Assuming that quantized value is n, the corresponding code book Cn of the index is chosen from condition codebook set according to quantization index value n, code book is utilized Cn is quantified to current super frame sub-band surd and sonant degree parameter and obtains index value index;Described initial codebook and condition code This collection is obtained by off-line training;
Step 136, index value is subjected to coding transmission, makes m1=m1+1 and go to step 135;
Step 14, by the corresponding bit stream of voice sub-band parameter compression generation current speech data after real-time quantization.
The present invention during sub-band surd and sonant degree parameter quantification treatment first to voice sub-band parameter in sub-band surd and sonant Judged, it sets pure and impure sound to judge data set;Using voice training data and the pure and impure sound decision problem collection, to y-bend Judge that the pure and impure sound decision model of tree construction is trained, wherein the y-bend judges the non-leaf nodes in tree construction to be described clear The problem of voiced sound decision problem is concentrated, leaf node is pure and impure sound result of determination;Tone testing data are received, and are utilized after training The pure and impure sound decision model judge tone testing data as voiceless sound or voiced sound.As can be seen here, embodiment of the present invention is used Independent pure and impure sound decision model pairing is judged into plain the pure and impure of state of voice middle pitch, is judged into so as to improve pure and impure sound Power, solve synthesis voice voiced sound Qinghua and voiceless sound turbidity caused by the low defect of synthetic effect, improve language The quality of sound synthesis.
2nd, data transfer
Live talk back equipment or the speech data for the telephone transmitter input for commanding talk back equipment are sent to commander's talk back equipment Or the telephone receiver of live talk back equipment.As shown in figure 3, the detailed process of data transfer comprises the following steps:
Step 21, telephone transmitter send voice superframes when, in the head time slot of each voice superframes place synchronous code with it is complete Whole signaling parameter, the signaling parameter includes all parameters for being used to carry out the foundation of voice call link;Synchronous code is used for Receiving terminal is synchronous with transmitting terminal;
Step 22, when telephone receiver receives signal, detect the signal that receives whether be voice superframes voice head time slot, such as Fruit is to be transferred to step 23, is otherwise transferred to step 24;
Step 23, the signaling parameter of carrying is parsed from the voice head time slot, and is entered according to the signaling parameter parsed The foundation of row voice call link;
Step 24, according to the content for receiving signal, signal transacting is carried out;The concrete operations of the step 24 are:If connect The signal received is data head time slot, then sets up data interaction link;If the signal received is data block pdu, skill According to block;If the signal received is high-rise control signaling, the high-rise control signaling of parsing.
Signaling parameter described above includes physical layer signaling and high-rise control signaling, physical layer signaling be arranged on synchronous code it Afterwards;High-rise control signaling is arranged in the Deta bearer after Part I signaling.Physical layer signaling parameter include the machine colour coding, Data type mark, time-gap number mark, slot transmission pattern identification and parity check sum flag bit, wherein data type include Speech frame frame head, data frame frame head, data block Sub-PDU and high-rise control signaling.
The data transfer mode of the present invention can reduce overhead on the premise of system transmission performance is not reduced, System efficiency of transmission is improved, and reduces system processing complexity, the flow of voice transfer link establishment is more rationalized.
3rd, speech data is decoded
As shown in figure 4, the detailed process of speech data decoding comprises the following steps:
Step 31, the corresponding bit stream of current speech data is obtained, and by SILK decoders to current speech data ratio Spy's stream decodes the voice sub-band parameter after being quantified;
Step 32, extract quantify after voice sub-band parameter each sub-band surd and sonant parameter and synthesize pumping signal;The step Rapid 32 comprise the following steps that:
Step 321, decoding end exits timer m2=0 with coding side setting simultaneously, and interval T0 is exited in setting;
Step 322, decoding end judge m2 whether the integral multiple for being T0, if it is search for initial according to index value index Code book obtains superframe sub-band surd and sonant degree parameter;Otherwise the sub-band surd and sonant degree after being quantified using initial codebook to a upper superframe is joined Number is quantified, and obtains index value n, and concentrates selection code book Cn from condition code book according to index value n, is received according to decoding end Index value index search code book Cn obtains superframe sub-band surd and sonant degree parameter;
Step 323, according to superframe sub-band surd and sonant degree parameter, the sub-band surd and sonant degree parameter of each speech frame is obtained, is sent Enter pumping signal synthesis end generation pumping signal;Make m=m+1 and go to step 322;
Step 33, the voice of synthesis is exported in real time.
Preferably, the process that the pure and impure sound decision model for judging y-bend tree construction is trained as:For pure and impure Sound judges each problem that data are concentrated, and is to calculate voiced sound ratio value respectively with no voice training data for answering, and And it is used as root node the problem of selection voiced sound proportional difference maximum;Enter line splitting for voice training data under root node, with Form non-leaf nodes and leaf node.
Preferably, during the pure and impure sound decision model for judging y-bend tree construction is trained,
After division stop condition set in advance is met, stop the division, wherein the division stops bar Part is:Non-leaf nodes or the voice training data of leaf node are less than the first threshold value set in advance;
After division stop condition set in advance is met, stop the division, wherein the division stops bar Part is:The voiced sound proportional difference of non-leaf nodes or leaf node is less than the second threshold value set in advance.
Preferably, the pure and impure sound judges whether data set is vowel, plosive, friction as the phoneme belonging to speech frame Sound, read again, nasal sound, high and level tone, rising tone, upper sound or falling tone, and forward and backward one in speech sentence of phoneme belonging to speech frame Whether phoneme is vowel, plosive, fricative, stressed, nasal sound, high and level tone, rising tone, upper sound or falling tone.
Preferably, interval T0 is exited to be set according to the size of superframe.
Preferably, the speech data sampling point to acquisition is by 8kHz frequency samplings and done by high-pass filtering removal power frequency The voice sampling point disturbed.
Preferably, the quantity y of the condition code book and the quantizing bit number x of sub-band surd and sonant degree parameter relational expression are:y =2x
Checked by the command methods of the present invention come the primary equipment to substation field operation, it is ensured that to correct Primary equipment is isolated, and verification is checked using flexecutive's real-time instruction, it is ensured that the accuracy of isolation;The present invention Convenient scene and the communication of control centre, can be sent to control centre by the sound at scene, be easy to the technical barrier to scene Quickly solved, administrative staff's remote command site operation personnel is carried out safety operation.
The quantizing process that carried out to voice sub-band surd and sonant parameter of the present invention has taken into full account subband between successive frame (superframe) The correlation of pure and impure tone parameter, utilizes the correlation, according to previous superframe sub-band surd and sonant degree parameter by state transfer Quantized result, the quantization code book of adaptively selected current super frame sub-band surd and sonant degree parameter improves sub-band surd and sonant parameter Quantitative efficiency.Meanwhile, quantizing process proposes timing escape mechanism, often crosses time T, quantifies code book and restarts source code This, so as to effectively prevent the diffusion of channel error code, makes the influence that error code spreads be confined within time T.This method can be with In low rate, ultralow speed speech coding algorithm effective for below 2400bps, so as to improve its synthetic speech quality.
Simply the preferred embodiment of the present invention described above, for those skilled in the art, Without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications are also regarded as this hair Bright protection domain.

Claims (7)

1. a kind of substation field operation remote command method based on voice transfer, passes through live talk back equipment and remote command Commander's talk back equipment at center carries out interactive voice to instruct substation field operation, it is characterized in that, including procedure below:
First, encoded speech data
Step 11, obtain live talk back equipment or command the speech data of the telephone transmitter input of talk back equipment;
Step 12, the speech data of acquisition is subjected to SILK coded treatments and generates voice sub-band parameter;
Step 13, voice sub-band surd and sonant parameter is quantified;The step 13 is comprised the following steps that:
Step 131, sub-band surd and sonant in voice sub-band parameter is judged;Sub-band surd and sonant in the sub-band parameter to voice The process judged is:Pure and impure sound is set to judge data set first;Then sentenced using voice training data and the pure and impure sound Determine data set, the pure and impure sound decision model for judging tree construction to y-bend is trained, and wherein the y-bend judges non-in tree construction Leaf node judges the problem of data are concentrated for the pure and impure sound, and leaf node is pure and impure sound result of determination;Finally receive voice Test data, and judge that tone testing data, as voiceless sound or voiced sound, work as voice using the pure and impure sound decision model after training Test data is judged as after voiced sound, and the pumping signal of the tone testing data is assumed to be impact during phonetic synthesis Response sequence;After tone testing data are judged as voiceless sound, the excitation of the tone testing data during phonetic synthesis Signal is assumed to be white noise;
Step 132, timer m1=0 is exited in coding side setting, and interval T0 is exited in setting;
Step 133, to the speech data sampling point framing in chronological order of acquisition, and sub-band surd and sonant degree parameter is extracted by frame;
Step 134, multiple speech frame composition superframes are combined, the sub-band surd and sonant degree parameter composition superframe subband of multiple speech frames is clear Voiced sound degree parameter;
Step 135, judge m1 whether the integral multiple for being T0, if it is using initial codebook to superframe sub-band surd and sonant degree parameter Carry out vector quantization;Otherwise the sub-band surd and sonant degree parameter after a upper superframe is quantified is quantified using initial codebook, it is assumed that Quantized value is n, and the corresponding code book Cn of the index is chosen from condition codebook set according to quantization index value n, utilizes Cn pairs of code book Current super frame sub-band surd and sonant degree parameter is quantified and obtains index value index;Described initial codebook and condition code book collection Obtained by off-line training;
Step 136, index value is subjected to coding transmission, makes m1=m1+1 and go to step 135;
Step 14, by the corresponding bit stream of voice sub-band parameter compression generation current speech data after real-time quantization;
2nd, data transfer
Live talk back equipment or the speech data for the telephone transmitter input for commanding talk back equipment are sent to commander's talk back equipment or existing The telephone receiver of field talk back equipment;
The detailed process of the data transfer comprises the following steps:
Step 21, telephone transmitter is when sending voice superframes, placed in the head time slot of each voice superframes synchronous code with it is complete Signaling parameter, the signaling parameter includes all parameters for being used to carry out the foundation of voice call link;
Step 22, when telephone receiver receives signal, detect the signal that receives whether be voice superframes voice head time slot, if It is then to be transferred to step 23, otherwise, is transferred to step 24;
Step 23, the signaling parameter of carrying is parsed from the voice head time slot, and language is carried out according to the signaling parameter parsed The foundation of sound call link;
Step 24, according to the content for receiving signal, signal transacting is carried out;The concrete operations of the step 24 are:If received Signal be data head time slot, then set up data interaction link;If the signal received is data block pdu, data block is solved; If the signal received is high-rise control signaling, the high-rise control signaling of parsing;
3rd, speech data is decoded
Step 31, the corresponding bit stream of current speech data is obtained, and by SILK decoders to current speech data bit stream Decode the voice sub-band parameter after being quantified;
Step 32, extract quantify after voice sub-band parameter each sub-band surd and sonant parameter and synthesize pumping signal;The step 32 Comprise the following steps that:
Step 321, decoding end exits timer m2=0 with coding side setting simultaneously, and interval T0 is exited in setting;
Step 322, decoding end judge m2 whether the integral multiple for being T0, if it is according to index value index search for initial codebook Obtain superframe sub-band surd and sonant degree parameter;Otherwise the sub-band surd and sonant degree parameter after being quantified using initial codebook to a upper superframe is entered Row quantifies, and obtains index value n, and concentrate selection code book Cn, the index received according to decoding end from condition code book according to index value n Value index search code book Cn obtains superframe sub-band surd and sonant degree parameter;
Step 323, according to superframe sub-band surd and sonant degree parameter, the sub-band surd and sonant degree parameter of each speech frame is obtained, feeding swashs Encourage signal synthesis end generation pumping signal;Make m=m+1 and go to step 322;
Step 33, the voice of synthesis is exported in real time.
2. a kind of substation field operation remote command method based on voice transfer according to claim 1, its feature Be, the process that the pure and impure sound decision model for judging y-bend tree construction is trained as:Judge data set for pure and impure sound In each problem, be to calculate voiced sound ratio value respectively with no voice training data for answering, and select voiced sound ratio The problem of example difference is maximum is used as root node;Enter line splitting for voice training data under root node, to form non-leaf section Point and leaf node.
3. a kind of substation field operation remote command method based on voice transfer according to claim 2, its feature It is, during the pure and impure sound decision model for judging y-bend tree construction is trained,
After division stop condition set in advance is met, stop the division, wherein the division stop condition is: Non-leaf nodes or the voice training data of leaf node are less than the first threshold value set in advance;
After division stop condition set in advance is met, stop the division, wherein the division stop condition is: The voiced sound proportional difference of non-leaf nodes or leaf node is less than the second threshold value set in advance.
4. a kind of substation field operation remote command method based on voice transfer according to claim 1, its feature Be, the pure and impure sound judge data set as the phoneme belonging to speech frame whether be vowel, plosive, fricative, read again, nasal sound, Whether high and level tone, rising tone, upper sound or falling tone, and a forward and backward phoneme of the phoneme in speech sentence belonging to speech frame are members Sound, plosive, fricative, stressed, nasal sound, high and level tone, rising tone, upper sound or falling tone.
5. a kind of substation field operation remote command method based on voice transfer according to claim 1, its feature It is to exit interval T0 to be set according to the size of superframe.
6. a kind of substation field operation remote command method based on voice transfer according to claim 1, its feature It is that the speech data sampling point to acquisition is by 8kHz frequency samplings and by the voice sampling point of high-pass filtering removal Hz noise.
7. a kind of substation field operation remote command method based on voice transfer according to claim 1, its feature It is that the quantity y and the quantizing bit number x of sub-band surd and sonant degree parameter of the condition code book relational expression are:Y=2x
CN201710334346.9A 2017-05-12 2017-05-12 A kind of substation field operation remote command method based on voice transfer Pending CN107017007A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710334346.9A CN107017007A (en) 2017-05-12 2017-05-12 A kind of substation field operation remote command method based on voice transfer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710334346.9A CN107017007A (en) 2017-05-12 2017-05-12 A kind of substation field operation remote command method based on voice transfer

Publications (1)

Publication Number Publication Date
CN107017007A true CN107017007A (en) 2017-08-04

Family

ID=59450245

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710334346.9A Pending CN107017007A (en) 2017-05-12 2017-05-12 A kind of substation field operation remote command method based on voice transfer

Country Status (1)

Country Link
CN (1) CN107017007A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN202196988U (en) * 2011-09-16 2012-04-18 河南省电力公司周口供电公司 Voice prompt device for protecting, observing and controlling screen cabinet of transformer substation
CN102855878A (en) * 2012-09-21 2013-01-02 山东省计算中心 Quantification method of pure and impure pitch parameters of narrow-band voice sub-band
CN104143342A (en) * 2013-05-15 2014-11-12 腾讯科技(深圳)有限公司 Voiceless sound and voiced sound judging method and device and voice synthesizing system
CN104469293A (en) * 2014-11-27 2015-03-25 国网山东省电力公司泰安供电公司 Monitoring device for remote electric operation in transformer substation
CN105356606A (en) * 2015-11-21 2016-02-24 国网河南省电力公司平顶山供电公司 Remote operation voice prompting device of transformer substation
CN105592077A (en) * 2011-05-19 2016-05-18 福建联拓科技有限公司 Voice transmission method
CN107256711A (en) * 2017-05-12 2017-10-17 国网山东省电力公司经济技术研究院 A kind of power distribution network emergency maintenance remote commanding system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105592077A (en) * 2011-05-19 2016-05-18 福建联拓科技有限公司 Voice transmission method
CN202196988U (en) * 2011-09-16 2012-04-18 河南省电力公司周口供电公司 Voice prompt device for protecting, observing and controlling screen cabinet of transformer substation
CN102855878A (en) * 2012-09-21 2013-01-02 山东省计算中心 Quantification method of pure and impure pitch parameters of narrow-band voice sub-band
CN104143342A (en) * 2013-05-15 2014-11-12 腾讯科技(深圳)有限公司 Voiceless sound and voiced sound judging method and device and voice synthesizing system
CN104469293A (en) * 2014-11-27 2015-03-25 国网山东省电力公司泰安供电公司 Monitoring device for remote electric operation in transformer substation
CN105356606A (en) * 2015-11-21 2016-02-24 国网河南省电力公司平顶山供电公司 Remote operation voice prompting device of transformer substation
CN107256711A (en) * 2017-05-12 2017-10-17 国网山东省电力公司经济技术研究院 A kind of power distribution network emergency maintenance remote commanding system

Similar Documents

Publication Publication Date Title
JP3366903B2 (en) Method and apparatus for coding an audio signal and method and apparatus for decoding a bitstream
TW318239B (en)
CN108833722A (en) Audio recognition method, device, computer equipment and storage medium
CN103325375B (en) One extremely low code check encoding and decoding speech equipment and decoding method
CN105845135A (en) Sound recognition system and method for robot system
JPS603240A (en) Voice activation detecting method
CN112365882A (en) Speech synthesis method, model training method, device, equipment and storage medium
CN105206271A (en) Intelligent equipment voice wake-up method and system for realizing method
US8571204B2 (en) Apparatus and method for echo control in parameter domain
CN103533129B (en) Real-time voiced translation communication means, system and the communication apparatus being applicable
CN101273403A (en) Scalable encoding apparatus, scalable decoding apparatus, and methods of them
WO2020238058A1 (en) Voice transmission method and apparatus, computer device and storage medium
CN105895082A (en) Acoustic model training method and device as well as speech recognition method and device
CN101719966A (en) Monitoring method of call tone quality, monitoring terminal and system therefor
CN110444224A (en) A kind of method of speech processing and device based on production confrontation network
CN107256711A (en) A kind of power distribution network emergency maintenance remote commanding system
CN107017007A (en) A kind of substation field operation remote command method based on voice transfer
CN117713377A (en) Intelligent voice joint debugging system of dispatching automation master station
CN107464569A (en) Vocoder
CN103854655A (en) Low-bit-rate voice coder and decoder
Yong Study of voice packet reconstruction methods applied to CELP speech coding
CN110533750A (en) A method of it converts the audio into as the sign language animation with customized 3D role
CN106325260A (en) Rolling mill transmission trip fault voice alarm system
KR100984094B1 (en) A voiced/unvoiced decision method for the smv of 3gpp2 using gaussian mixture model
Nabil et al. Formant distortion after codecs for Arabic

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170804