EP1048025A1 - Procede de determination instrumentale de la qualite vocale - Google Patents

Procede de determination instrumentale de la qualite vocale

Info

Publication number
EP1048025A1
EP1048025A1 EP99942871A EP99942871A EP1048025A1 EP 1048025 A1 EP1048025 A1 EP 1048025A1 EP 99942871 A EP99942871 A EP 99942871A EP 99942871 A EP99942871 A EP 99942871A EP 1048025 A1 EP1048025 A1 EP 1048025A1
Authority
EP
European Patent Office
Prior art keywords
spectral
calculated
speech signal
evaluated
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP99942871A
Other languages
German (de)
English (en)
Other versions
EP1048025B1 (fr
Inventor
Jens Berger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Deutsche Telekom AG
Original Assignee
Deutsche Telekom AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsche Telekom AG filed Critical Deutsche Telekom AG
Publication of EP1048025A1 publication Critical patent/EP1048025A1/fr
Application granted granted Critical
Publication of EP1048025B1 publication Critical patent/EP1048025B1/fr
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/69Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Definitions

  • the invention relates to a method for instrumental ("objective") speech quality determination, in which characteristic values for determining the speech quality (speech quality) are derived by comparing properties of a speech signal to be evaluated with properties of a reference speech signal (undisturbed signal).
  • Speech quality determinations of speech signals are generally carried out by means of auditory ("subjective") examinations with test subjects.
  • the aim of instrumental ("objective") methods for determining speech quality is to determine from the properties of the speech signal to be assessed, using suitable computing methods, characteristic values which describe the speech quality of the speech signal to be assessed, without having to resort to judgments from test subjects.
  • the calculated parameters and the underlying method for instrumental language quality determination are considered recognized if a high correlation to the results of auditory comparative examinations is achieved.
  • the language quality values obtained by means of auditory examinations thus represent the target values that are to be achieved by instrumental methods.
  • Known methods for instrumental speech quality determination are based on a comparison of a reference speech signal with the speech signal to be evaluated.
  • the reference speech signal and the speech signal to be evaluated are segmented into short time segments.
  • the spectral properties of the two signals are compared in these segments.
  • the spectral intensity map calculated in this way for each period of time under consideration can be understood as a series of numerical values in which the number of individual values corresponds to the number of frequency bands used, the numerical values themselves represent the calculated intensity values and a continuous index of the frequency bands describes the sequence of the numerical values.
  • the limits of the frequency bands used are kept constant on the frequency axis.
  • the calculated intensities of the speech signal to be evaluated and the reference speech signal in each band are compared with one another.
  • the difference between the two values, or the similarity of the two resulting spectral intensity images, is the basis for the calculation of a quality value
  • a disadvantage of the methods known today in such cases is that when comparing the speech signal to be evaluated with a reference speech signal, differences between the two signal sections in the selected display level flow into the quality characteristic to be calculated, which are not or hardly at all - also perceptible in the auditory test - lead to qualitative impairment.
  • Frequency band limitations and spectral deformations of the speech signal to be evaluated e.g. caused by filter properties of the telephone device or the transmission channel
  • the object of the invention is to reduce the influence of spectral limitations and deformations of the speech signal to be evaluated and of shifts in spectral short-term maxima before comparing the spectral properties of a signal to be tested with a reference speech signal and calculating a quality value in instrumental methods.
  • a spectral weighting function is generated in the invention described here, which is based on medium spectral envelopes, e.g. the average spectral power density, based on the speech signal to be evaluated and the reference speech signal. This also enables the method to be used for non-linear and time-variant transmission.
  • the spectral weighting function is calculated from the quotients of the base values of the mean spectral power density of the signal Phi ⁇ (f) to be evaluated and that of the input signal of the transmission system Phi ⁇ (f) in such a way that the weighting function over
  • the evaluation function a (f) can weight the weighting function W ⁇ (f) differently over the effective range, in the simplest case it is constant 1.
  • the spectral weighting function W ⁇ (f) calculated in this way approximates the mean spectral envelopes of the speech signal and the reference speech signal to be evaluated, so that differences between the two spectral envelopes are only incorporated to a reduced extent in the calculated quality value.
  • the spectral weighting function W ⁇ (f) can be applied to the reference speech signal.
  • the average spectral power density of the reference speech signal is approximated to the signal to be evaluated (FIG. 2a).
  • the spectral weighting function can be applied inverted to the signal to be evaluated. This is equalized and, with regard to its average spectral power density, approximated to the reference speech signal (FIG. 2b).
  • Another part of the invention relates to the correction of shifts in short-term spectral maxima caused by the transmission systems.
  • the intensity is integrated in frequency bands for each time period.
  • the result is a series of intensity values for each spectral representation of a signal section, each individual value representing the intensity in a frequency band.
  • the shifts in short-term spectral maxima can lead to deviating calculated intensities in the frequency bands of the reference speech signal and the speech signal to be evaluated.
  • variable band limits for calculating the spectral intensity mapping is not only limited to the signal in which the described spectral weighting function W ⁇ (f) is also used, but can also be applied to the other signal and even to both signals, ( see FIGS. 2a and 2b).
  • a special exemplary embodiment shows an implementation according to FIG. 3, which is referred to as TOSQA (Telecommunication Objective Speech Quality Assessment). This involves advanced preprocessing of the reference speech signal.
  • TOSQA Telecommunication Objective Speech Quality Assessment
  • speech pauses are recognized here by means of a speech pause recognizer and do not go into the quality measure.
  • the reference speech signal and the speech signal to be evaluated are also filtered with a bandpass 300 ... 3400 Hz and the frequency response of a telephone handset is filtered.
  • the spectral power density is integrated in frequency groups, which form the basis for the calculation of the specific loudness.
  • the calculated loudness patterns are supplemented by an error evaluation function.
  • the calculated quality value is formed from the mean value of the co-correlation coefficients of the specific loudnesses for each short time segment under consideration from the number of evaluated speech segments.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Machine Translation (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
EP99942871A 1998-08-27 1999-08-14 Procede de determination instrumentale de la qualite vocale Expired - Lifetime EP1048025B1 (fr)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
DE19840548A DE19840548C2 (de) 1998-08-27 1998-08-27 Verfahren zur instrumentellen Sprachqualitätsbestimmung
DE19840548 1998-08-27
PCT/EP1999/005972 WO2000013173A1 (fr) 1998-08-27 1999-08-14 Procede de determination instrumentale de la qualite vocale

Publications (2)

Publication Number Publication Date
EP1048025A1 true EP1048025A1 (fr) 2000-11-02
EP1048025B1 EP1048025B1 (fr) 2003-11-05

Family

ID=7879918

Family Applications (1)

Application Number Title Priority Date Filing Date
EP99942871A Expired - Lifetime EP1048025B1 (fr) 1998-08-27 1999-08-14 Procede de determination instrumentale de la qualite vocale

Country Status (6)

Country Link
US (1) US7013266B1 (fr)
EP (1) EP1048025B1 (fr)
AT (1) ATE253765T1 (fr)
CA (1) CA2305652A1 (fr)
DE (2) DE19840548C2 (fr)
WO (1) WO2000013173A1 (fr)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001065543A1 (fr) * 2000-02-29 2001-09-07 Telefonaktiebolaget Lm Ericsson (Publ) Compensation du filtrage lineaire a l'aide de facteurs de ponderation de frequence
EP1241663A1 (fr) * 2001-03-13 2002-09-18 Koninklijke KPN N.V. Procédé et dispositif pour déterminer la qualité d'un signal vocal
EP1292036B1 (fr) * 2001-08-23 2012-08-01 Nippon Telegraph And Telephone Corporation Méthodes et appareils de decodage de signaux numériques
DE10142846A1 (de) * 2001-08-29 2003-03-20 Deutsche Telekom Ag Verfahren zur Korrektur von gemessenen Sprachqualitätswerten
DE10150519B4 (de) * 2001-10-12 2014-01-09 Hewlett-Packard Development Co., L.P. Verfahren und Anordnung zur Sprachverarbeitung
US7305341B2 (en) * 2003-06-25 2007-12-04 Lucent Technologies Inc. Method of reflecting time/language distortion in objective speech quality assessment
DE60305306T2 (de) * 2003-06-25 2007-01-18 Psytechnics Ltd. Vorrichtung und Verfahren zur binauralen Qualitätsbeurteilung
PT1792304E (pt) * 2004-09-20 2008-12-04 Tno Compensação de frequência para análise de percepção de voz
EP2249333B1 (fr) * 2009-05-06 2014-08-27 Nuance Communications, Inc. Procédé et appareil d'évaluation d'une fréquence fondamentale d'un signal vocal
EP2388779B1 (fr) * 2010-05-21 2013-02-20 SwissQual License AG Procédé d'évaluation de la qualité vocale
EP2828853B1 (fr) * 2012-03-23 2018-09-12 Dolby Laboratories Licensing Corporation Méthode et dispositif de détermination d'un niveau de parole corrigé
CN112233693B (zh) * 2020-10-14 2023-12-01 腾讯音乐娱乐科技(深圳)有限公司 一种音质评估方法、装置和设备

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3708002A1 (de) * 1987-03-12 1988-09-22 Telefonbau & Normalzeit Gmbh Messverfahren zum beurteilen der guete von sprachcodierern und/oder uebertragungsstrecken
US4860360A (en) * 1987-04-06 1989-08-22 Gte Laboratories Incorporated Method of evaluating speech
GB9213459D0 (en) 1992-06-24 1992-08-05 British Telecomm Characterisation of communications systems using a speech-like test stimulus
SE517836C2 (sv) * 1995-02-14 2002-07-23 Telia Ab Metod och anordning för fastställande av talkvalitet
NL9500512A (nl) * 1995-03-15 1996-10-01 Nederland Ptt Inrichting voor het bepalen van de kwaliteit van een door een signaalbewerkingscircuit te genereren uitgangssignaal, alsmede werkwijze voor het bepalen van de kwaliteit van een door een signaalbewerkingscircuit te genereren uitgangssignaal.
ES2161965T3 (es) * 1996-05-21 2001-12-16 Koninkl Kpn Nv Dispositivo y procedimiento para la determinacion de la calidad de una señal de salida, para ser generada por un circuito de procesamiento de señal.

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0013173A1 *

Also Published As

Publication number Publication date
EP1048025B1 (fr) 2003-11-05
US7013266B1 (en) 2006-03-14
CA2305652A1 (fr) 2000-03-09
WO2000013173A1 (fr) 2000-03-09
DE59907623D1 (de) 2003-12-11
ATE253765T1 (de) 2003-11-15
DE19840548A1 (de) 2000-03-02
DE19840548C2 (de) 2001-02-15

Similar Documents

Publication Publication Date Title
DE69401514T2 (de) Vom rechenaufwand her effiziente adaptive bitzuteilung für kodierverfahren und kodiereinrichtung
DE10041512B4 (de) Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen
DE19952538C2 (de) Automatische Verstärkungsregelung in einem Spracherkennungssystem
DE60024501T2 (de) Verbesserung der perzeptuellen Qualität von SBR (Spektralbandreplikation) UND HFR (Hochfrequenzen-Rekonstruktion) Kodierverfahren mittels adaptivem Addieren von Grundrauschen und Begrenzung der Rauschsubstitution
DE60303214T2 (de) Verfahren zur reduzierung von aliasing-störungen, die durch die anpassung der spektralen hüllkurve in realwertfilterbanken verursacht werden
DE69121312T2 (de) Geräuschsignalvorhersagevorrichtung
EP0938831B1 (fr) Evaluation de la qualite, a adaptation auditive, de signaux audio
EP1048025B1 (fr) Procede de determination instrumentale de la qualite vocale
DE69730721T2 (de) Verfahren und vorrichtungen zur geräuschkonditionierung von signalen welche audioinformationen darstellen in komprimierter und digitalisierter form
DE602004010634T2 (de) Verfahren und system zur sprachqualitätsvorhersage eines audioübertragungssystems
DE3043516C2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE2636032B2 (de) Elektrische Schaltungsanordnung zum Extrahieren der Grundschwingungsperiode aus einem Sprachsignal
DE19505435C1 (de) Verfahren und Vorrichtung zum Bestimmen der Tonalität eines Audiosignals
DE60024403T2 (de) Verfahren zur extraktion von klangquellen-informationen
EP1382034B1 (fr) Procede de determination de valeurs caracteristiques d'intensite de bruits de fond dans des pauses de voix de signaux vocaux
DE10157535B4 (de) Verfahren und Vorrichtung zur Reduzierung zufälliger, kontinuierlicher, instationärer Störungen in Audiosignalen
DE69401959T2 (de) Vom rechenaufwand her effiziente adaptive bitzuteilung für kodierverfahren und einrichtung mit toleranz für dekoderspektralverzerrungen
WO2001084536A1 (fr) Procede de calcul d'une decision d'activite vocale (detecteur d'activite vocale)
EP0535425B1 (fr) Procédé d'amplification de signaux acoustiques pour les malentendants et dispositif pour la réalisation du procédé
DE2357949A1 (de) Verfahren zum ermitteln des der periode der anregungsfrequenz der stimmbaender entsprechenden intervalls
DE4437287C2 (de) Verfahren zur Messung der Erhaltung stereophoner Audiosignale und Verfahren zur Erkennung gemeinsam codierter stereophoner Audiosignale
DE2506771C2 (de) Verfahren zur Verbesserung der Sprechererkennung
DE19854420C2 (de) Verfahren und Einrichtung zum Verarbeiten von Schallsignalen
DE19710953A1 (de) Verfahren und Vorrichtung zur Erkennung von Schallsignalen
DE4236315C1 (de) Verfahren zur Sprachcodierung

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

17P Request for examination filed

Effective date: 20000911

RTI1 Title (correction)

Free format text: METHOD FOR OBJECTIVE VOICE QUALITY EVALUATION

RTI1 Title (correction)

Free format text: METHOD FOR OBJECTIVE VOICE QUALITY EVALUATION

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031105

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20031105

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031105

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031105

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031105

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: NV

Representative=s name: ISLER & PEDRAZZINI AG

Ref country code: CH

Ref legal event code: EP

REF Corresponds to:

Ref document number: 59907623

Country of ref document: DE

Date of ref document: 20031211

Kind code of ref document: P

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: GERMAN

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040205

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040205

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040205

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040216

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20040211

NLV1 Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act
REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

ET Fr: translation filed
PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040814

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040831

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040831

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20040806

BERE Be: lapsed

Owner name: DEUTSCHE *TELEKOM A.G.

Effective date: 20040831

REG Reference to a national code

Ref country code: CH

Ref legal event code: PCAR

Free format text: ISLER & PEDRAZZINI AG;POSTFACH 1772;8027 ZUERICH (CH)

BERE Be: lapsed

Owner name: DEUTSCHE *TELEKOM A.G.

Effective date: 20040831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040405

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 18

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 19

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20180827

Year of fee payment: 20

Ref country code: FR

Payment date: 20180824

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: AT

Payment date: 20180821

Year of fee payment: 20

Ref country code: GB

Payment date: 20180828

Year of fee payment: 20

Ref country code: CH

Payment date: 20180827

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 59907623

Country of ref document: DE

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20190813

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK07

Ref document number: 253765

Country of ref document: AT

Kind code of ref document: T

Effective date: 20190814

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20190813