DE2610439A1

DE2610439A1 - CIRCUIT ARRANGEMENT FOR THE AUTOMATIC DETECTION OF LANGUAGE

Info

Publication number: DE2610439A1
Application number: DE19762610439
Authority: DE
Inventors: Hiroaki Sakoe
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1975-03-12
Filing date: 1976-03-12
Publication date: 1976-09-16
Also published as: NL186885B; NL186885C; NL7602579A; GB1546520A; DE2610439C2

Description

PATENTANWALT Dr. jur. UWE DREISS 7 STUTTGART 1PATENT ADVOCATE Dr. jur. UWE DREISS 7 STUTTGART 1

Diplom-Ingenieur, M. Sc. Sch'ickstraße 2 *? K 1 Π 4 ^ Q Graduate engineer, M. Sc. Sch'ickstrasse 2 *? K 1 Π 4 ^ Q

Telefon (0711) 245734 £ U I U 4 O Telephone (0711) 245734 £ UIU 4 O

Telegrammadresse UDEPATTelegram address UDEPAT

j NACHGEREICHTj SUBMITTED

Anmelder: ^Mein Zeichen:Applicant: ^My reference:

Nippon Electric Co Ltd Ui - 310Nippon Electric Co Ltd Ui - 310

33-1, Shiba Gochome33-1, Shiba Gochome

Minato-kuMinato-ku

Amtl. Akt. Z.:Official Act. Z .:

Tokyo (Japan)Tokyo (Japan)

Prioritäten: 1) 12.3.1975 2) 31.10.1975 Priorities: 1) 3/12/1975 2) 10/31/1975

29 891/1975 132 003/197529 891/1975 132 003/1975

Japan JapanJapan Japan

Schaltungsanordnung zur automatischen Erkennung von SpracheCircuit arrangement for automatic detection of language

Die Erfindung betrifft eine Schaltungsanordnung zur automatischen Erkennung von Sprache, die aus kontinuierlich gesprochenen Worten besteht.The invention relates to a circuit arrangement for automatic Recognition of speech made from continuously spoken Words.

Es besteht ein großer Bedarf nach Spr"~herkennungssystemen als Eingabegeräte zur Eingabe von Daten und Programmen in elektronischen Rechenanlagen; die Praxis erwartet demnächst praktikable Systeme, mit denen Sprache automatisch erkannt werden kann.There is a great need for spray detection systems as Input devices for entering data and programs into electronic computing systems; the practice expects to be practicable soon Systems with which speech can be recognized automatically.

Es ist bekannt, daß man ein Sprachmuster durch eine Folge Q-dimensionaler Herkmalsvektoren darstellen kann. Bekannte Systeme dieser Art sind beschrieben in:It is known that a speech pattern can be represented by a sequence of Q-dimensional heritage vectors. Acquaintance Systems of this kind are described in:

■- 2 —■ - 2 -

60 9838/089760 9838/0897

2 B10 4 32 B10 4 3

P. Jenes and M.V. Mathews, "Spoken Digit Recognition Using Tiue-frequency Pattern flatheing"; "Ine Journal of Acustical Society of America", Vol. 32, no. 11 (November 1960); und U.A. lilder, "On the Feasibility of Voice Input to an One Line Computer Prozessing System", Communication of ACM, Vol. 13, i\o. 6 (Juni 1970). iiei diesen Spracherkennungssystemen wird die Übereinstimmung entsprechender flerkmalsvektoren des zu erkennenden Sprachmusters mit den Merkmalsvektoren eines Jezugsmusters geprüft. Es wird ein Ähnliciikeitsmaß berechnet, das ein iiaß für die Übereinstimmung zwischen dem zu erkennenden ,luster am Eingang und dem Bezugsmuster ist. Dieses Ähmlichkeitsmali wird auf der Grundlage der Gesamtsumme der Größen berechnet, die die Ähnlichkeit zwischen den rlerkmals vektor en an den einander zeitlich entsprechenden Positionen innerhalb ihrer Folgen angeben. Ein zuverlässiges Erkennungsergebnis ist bei diesen Systemen unmöglich, wenn die zeitliche Position der Merkmalsvektoren in der einen Folge relativ zu der Lage der Merkmalsvektoren in der anderen Folge variiert. Dies ist dann der Fall, wenn die Geschwindigkeit, mit der ein Wort geäußert wird, variiert bzw. von einem vorgegebenen Wert abweicht. Die Geschwindigkeit, mit der ein Wort geäußert wird, variiert oft um sogar bis zu 30 %. Das führt bei den bekannten Systemen dazu, daß das Ännlichkeitsmaß für ein Wort, das, von derselben Person gesprochen, mit sich selbst verglichen wird, oft trotz dieser Gleichheit nur einen geringen Wert annimmt. Bei denP. Jenes and MV Mathews, "Spoken Digit Recognition Using Tiue-frequency Pattern flatheing";"Ine Journal of Acustical Society of America", Vol. 32, no. 11 (November 1960); and UA lilder, "On the Feasibility of Voice Input to an One Line Computer Processing System", Communication of ACM, Vol. 13, i \ o. 6 (June 1970). In these speech recognition systems, the correspondence of corresponding feature vectors of the speech pattern to be recognized with the feature vectors of a related pattern is checked. A similarity measure is calculated, which is an indicator of the correspondence between the luster to be recognized at the input and the reference pattern. This similitude penalty is calculated on the basis of the total sum of the quantities which indicate the similarity between the characteristic vectors at the positions within their sequences that correspond to one another over time. A reliable recognition result is impossible with these systems if the position in time of the feature vectors in one sequence varies relative to the position of the feature vectors in the other sequence. This is the case when the speed at which a word is uttered varies or deviates from a predetermined value. The speed at which a word is uttered often varies by as much as 30 %. In the known systems, this means that the degree of similarity for a word which, spoken by the same person, is compared with itself, often only assumes a low value despite this equality. Both

- 3 609838/0897 - 3 609838/0897

herkömmlichen Spracherkennungssystemen muß ferner eine Wortfolge auch Wort für Wort ausgesprochen werden; dies ist für die sprechende Person eine starke Einschränkung und bedeutet auch eine Reduzierung der Sprechgeschwindigkeit.conventional speech recognition systems must also have a word sequence can also be pronounced word for word; this is and means a severe limitation for the person speaking a reduction in the rate of speech.

Um kontinuierliche Sprache, die aus kontinuierlich gesprochenen vVorten zusammengesetzt ist, erkennen zu können, muH. für jedes Wort das Sprachmuster für ein Wort getrennt erkennbar sein. Ein Vorschlag hierfür ist in der US-PS 3 816 72 2 gemacht (Computer for Calculating the Similarity between Patterns and Pattern Recognition System comprising the Similarity Computer), deren Miterfinder der Erfinder der vorliegenden Erfindung ist. i3ei diesem System wird mit dynamischer Programmierung die kontinuierliche Sprache Wort fiir Wort getrennt. Die Trennung von kontinuierlicher Sprache in einzelne '.»'orte (Segmentierung) ist jedoch noch nicht leicht durchführbar.To continuous language that consists of continuously spoken vVorts is composed, to be able to recognize, must. for each word the language pattern for a word can be recognized separately. A A proposal for this is made in US Pat. No. 3,816,722 (Computer for Calculating the Similarity between Patterns and Pattern Recognition System comprising the Similarity Computer), the co-inventor of which is the inventor of the present invention. i3ei this system becomes continuous with dynamic programming Language separated word for word. Separation from continuous Speech in individual '. »' Places (segmentation) is however not yet easily feasible.

Es ist Aufgabe der vorliegenden Erfindung, eine Schaltungsanordnung zur automatischen Erkennung von Sprache zu schaffen, bei der das Maß für die Übereinstimmung von zu erkennenden Worten und Bezugsmuster von der Geschwindigkeit der Sprache unabhängig ist und bei dem eine Trennung der kontinuierlichen Sprache in liorte leicht möglich ist.It is the object of the present invention to provide a circuit arrangement to create automatic recognition of speech, in which the measure for the correspondence of to be recognized Words and reference patterns are independent of the speed of speech and in which there is a separation of the continuous Language in liorte is easily possible.

Erfindungsgemäß wird diese Aufgabe durch die im Patentanspruch angegebenen Merkmale gelöst.According to the invention, this object is achieved by the claims specified features solved.

6 0 9 B 3 P / Π Β 9 76 0 9 B 3 P / Π Β 9 7

Die Erfindung beruht u.a. darauf, daß alle Folgen von ι lüstern, einschließlich der Bezugs-ivortmuster, in allen möglichen Reihenfolgen zueinander angeordnet, die Bezugsmuster bilden. jas zwischen dem zu erkennenden oprachiiiuster am Eingang und diesen iiezugsmus tern gegebene Ähnlichkeitsmaß wird berechnet durch Prüfung der Übereinstimmung (Matching), ohne daß dabei auf ein Üegmentierungsverfahren zurückgegriffen werden müßte. Das iiezugsmus ter, bei dem sich das maximale Ähnlichkeitsmaß ergibt, wird als endgültiges Erkennungsergebnis angenommen. Die Erkennung der kontinuierlichen Sprachmuster erfolgt in drei Schritten. .lan errechnet damit eine praktikable Betriebsgeschwindigkeit. The invention is based, inter alia, on the fact that all sequences of ι lascivious, including the reference word pattern, arranged in all possible sequences with respect to one another, forming the reference pattern. The degree of similarity given between the oprachiiiuster to be recognized at the input and these drawing patterns is calculated by checking the agreement (matching) without having to resort to a segmentation method. The drawing pattern that yields the maximum degree of similarity is assumed to be the final recognition result. The recognition of the continuous speech patterns takes place in three steps. .lan uses this to calculate a practicable operating speed.

I hi ersten Schritt werden die Ahnlichkeitsmaße zwischen den Bezugsmustern und den partiellen Mustern des Musters am Eingang üerechnet. Diese partiellen Muster erstrecken sich von einem ersten gegebenen Zeitpunkt (Startpunkt) bis zu einem zweiten gegebenen Zeitpunkt (Endpunkt). Es ergibt sich somit ein maximales Ähnlichkeitsmaß und ein diesem entsprechendes Tv^rort als partielles Ähnlichkeitsmaß und dementsprechend ein partielles Hrkennungsergebnis jeweils für ein partielles Muster. Die partiellen Ähnlichkeitsmaße und die partiellen Erkennungsergebnisse werden dann an den Start- und Endpunkten gespeichert. Dieses Verfahren wird für alle Start- und Endpunkte wiederholt; die Ergebnisse dieses Verfahrensschrittes werden als Tabelle gespeichert.In the first step, the degree of similarity between the reference patterns and the partial patterns of the pattern at the input are calculated. These partial patterns extend from a first given point in time (starting point) to a second given point in time (end point). This results in a maximum similarity and that corresponding location ^r Tv as a partial similarity measure and accordingly, a partial Hrkennungsergebnis each for a partial pattern. The partial similarity measures and the partial recognition results are then stored at the start and end points. This process is repeated for all start and end points; the results of this process step are saved as a table.

-ό--ό-

609838/0897609838/0897

In einem zweiten Schritt wird diese Tabelle dann ausgewertet. Es wird eine Folge von partiellen Mustern derart ausgewählt, daß eine darin enthaltene Vielzahl von partiellen Mustern so aneinandergefügt werden kann, daß sich keine Überlappungen und keine Zwischenräume ergeben. Innerhalb dieser Folge wird dann die Summe der partiellen Ähnlichkeitsmaße maximiert.This table is then evaluated in a second step. A sequence of partial patterns is selected in such a way that that a plurality of partial patterns contained therein can be joined together so that no overlaps and leave no gaps. The sum of the partial similarity measures is then maximized within this sequence.

Im dritten und letzten Schritt werden danach die partiellen Erkennungsergebnisse aus der Tabelle extrahiert, die den im zweiten Schritt ermittelten partiellen Mustern entsprechen. Sie bilden das Bndergebnis.In the third and final step, the partial recognition results are then extracted from the table that corresponds to the Partial patterns determined in the second step. They form the bond result.

Die Berechnungen, die xvährend des er^ -en und des zweiten Schrittes anfallen, erfolgen mit dynamischer Programmierung. Dieses Verfahren ist im einzelnen in dem Buch "Applied Dynamic Programming", Princeton University Press (1962) , S. 3-29, beschrieben.The calculations made during the first and second Steps that arise are carried out with dynamic programming. This procedure is detailed in the book "Applied Dynamic Programming ", Princeton University Press (1962), Pp. 3-29.

Ausführungsbeispiele der Erfindung und ihrer vorteilhaften Weiterbildungen werden im folgenden unter Bezugnahme auf die beigefügten Zeichnungen beschrieben. Es stellen dar:Embodiments of the invention and its advantageous Further developments are described below with reference to the accompanying drawings. They represent:

Fig. 1 die schematische Darstellung zweier Sprachmuster für den gleichen kontinuierlich gesprochenen Text;1 shows the schematic representation of two speech patterns for the same continuously spoken text;

Fig. 2 eine grafische Darstellung zur Erläuterung der Erfindung;Fig. 2 is a graphical representation to explain the invention;

609838/0897609838/0897

Fig. 3Α u. 3Β Sprachniuster zur Erläuterung der Erfindung;3Α and 3Β language users to explain the invention;

Fig. 4 eine grafische Darstellung zur Erläuterung der Erfindung;Fig. 4 is a graphic representation to explain the invention;

Fig. 5 ein Blockschaltbild eines ersten Ausführungsbeispiels; Fig. 5 is a block diagram of a first embodiment;

Fig. 6 ein Blockschaltbild der in Fig. 5 gezeigten Recheneinheit;FIG. 6 is a block diagram of the computing unit shown in FIG. 5;

Fig. 7 eine grafische Darstellung zur Erläuterung7 is an explanatory diagram

des zweiten Ausführungsbeispiels nach Fig. 8;of the second embodiment according to FIG. 8;

Fig. 8 ein Blockschaltbild eines zweiten Ausführungsbeispiels ; 8 is a block diagram of a second embodiment;

Fig. 9 eine grafische Darstellung zur Erläuterung des zweiten Ausführungsbeispiels, und9 is a diagram for explaining the second exemplary embodiment, and FIG

Fig. 10 ein Blockschaltbild des in Fig. 8 verwendeten zweiten Rechners.FIG. 10 is a block diagram of the second computer used in FIG.

Anhand der Fig. 1 bis 4 wird die fortlaufende Spracherkennung zunächst im Prinzip erklärt.On the basis of FIGS. 1 to 4, the continuous speech recognition is first explained in principle.

Man kann, wie bereits erwähnt, ein Sprachmuster als zeitliche Folge Q-dimensionaler Merkmalsvektoren darstellen. Das Sprachmuster am Eingang seiAs already mentioned, a speech pattern can be represented as a temporal sequence of Q-dimensional feature vectors. That Speech pattern at the entrance

— 7 —- 7 -

609 83 8/0897609 83 8/0897

Dabei ist a. für das Sprachmuster A am Eingang ein i>Ierkmalsvektor im i-ten Zeitpunkt. Es sei fernerWhere a. for the speech pattern A at the input an i> I feature vector in the i-th point in time. It is furthermore

^ai ⁼ (^a1i* ^a2i» ··'» ^aq0 ' ^a i ⁼ ( ^a 1i * ^a 2i »·· '» ^a q0'

Die kontinuierliche und zu erkennende Sprache sei eine Folge von Ziffern (O, 1, 2, ..., n, ..., 9); ein Bezugs-'tv'ortmuster sei ä^a (a =0, ..., 9) mit einer Anzahl von J Merkmalsvektoren. Es sei: The continuous and recognizable language is a sequence of digits (O, 1, 2, ..., n, ..., 9); a reference 'tv' location pattern is ^a (a = 0, ..., 9) with a number of J feature vectors. Let it be:

¹ ^L 3 J_n ¹ ^L 3 Y _n

Dabei ist b¹-¹ für das ßezu<;s-iiortmuster Bⁿ ein Herkmalsvektor im.j-ten Zeitpunkt. £s sei fernerHere, b ¹ - ¹ for the ßzu <; s-location pattern B ^{n is} a traditional vector at the j-th point in time. Let it be further

J ist die Dauerperiode eines Bezugs-Wortmusters.J is the duration period of a reference word pattern.

Der Einfachheit halber wird folgende Darstellung benutzt:For the sake of simplicity, the following representation is used:

(3)(3)

^bj^{= (b}1j' ^b2j» "··» ^b j ^{= (b} 1j ' ^b 2j »" ·· »

609838/0807609838/0807

Das Bezugsrauster für die kontinuierliche Sprache wird nun als Folge von Bezugs-Wortraustern für Ziffern, die Wort für rfort ausgesprochen werden, dargestellt. In anderen Worten: ein Muster für kontinuierliche Sprache, die durch kontinuierliche Aussprache der Ziffern n(1), n(2), ..., n(x), ... n(Y) gegeben ist, wird durch eine Folge von Mustern dargestellt, die aus den entsprechenden Bezugs-Wortmustern wie folgt zusammengesetzt ist:The reference pattern for continuous speech is now a sequence of reference word patterns for digits that word for rfort to be pronounced. In other words, a pattern for continuous language created by continuous Pronunciation of digits n (1), n (2), ..., n (x), ... n (Y) is given, is represented by a sequence of patterns, those from the corresponding reference word patterns as follows is composed:

B = B ^UJ © B *· ^J ... ©Β ^{l J} ... (+J B ^L ^J (4)B = B ^UJ © B * ^J ... © Β ^{l J} ... (+ J B ^L ^J (4)

Für die Folge vonFor the sequence of

B = B^m 0 Bⁿ = b™ , b^m , ..., b¹? , bⁿ, b* ..., b^B = B ^m 0 B ⁿ = b ™, b ^m , ..., b ¹ ? , b ⁿ , b * ..., b ^

m ^{λ Δ} ^Jnm ^{λ Δ} ^J n

Der Einfachheit halber wird das Bezugsmuster B und sein Merkmalsvektor wie folgt dargestellt:For the sake of simplicity, the reference pattern is B and its feature vector shown as follows:

^bj ⁼ ^b j ⁼

(6)(6)

Die Komponenten dieses Vektors gewinnt man z.B. durch Abtasten der Q-Ausgänge eines Q-Kanal-Spektrum-Analysators in einem bestimmten Zeitpunkt.The components of this vector are obtained, for example, by scanning the Q outputs of a Q-channel spectrum analyzer in one specific time.

609838/0897609838/0897

Die MerkmalsVektoren a. und b^ zu den entsprechenden Zeitpunkten in den jeweiligen Folgen für dieselbe Sprache ergeben nun nicht notwendigerweise ein und dasselbe Phonem, da die Geschwindigkeit, mit der Worte, auch von derselben Person, gesprochen werden, verschieden sein kann. Z.ß. sei angenommen, daß die Muster A und B (vgl. Fig. 1) beide die Folge von Phonemen/san-ni-go/ (japanische Lautschrift für die Aussprache der Ziffernfolge "drei-zivei-fünf") darstellen sollen. Beide Folgen seien jedoch mit unterschiedlicher Geschwindigkeit ausgesprochen. Der Vektor a. im Zeitpunkt stellt dann z.B. das Phonem /s/ dar, während der Vektor b. im entsprechenden Zeitpunkt 20' ein davon unterschiedliches Phonem, nämlich /a/, darstellt. Mit herkömmlichen Verfahren zur Berechnung des Ähnlichkeitsmaßes, bei denen über i die Summe der Korrelationskoeffizienten Jf (a. , b.) gebildet wird, ergibt beim in Fig. 1 dargestellten Beispiel eine lediglich geringe Ähnlichkeit beider Muster. Dies kann somit zu einer falschen Feststellung eines Musters führen.The feature vectors a. and b ^ at the corresponding points in time in the respective sequences for the same language do not necessarily result in one and the same phoneme, since the speed with which words are spoken, even by the same person, can be different. Z.ß. it is assumed that the patterns A and B (see FIG. 1) are both intended to represent the sequence of phonemes / san-ni-go / (Japanese phonetic transcription for the pronunciation of the sequence of digits "three-zivei-five"). However, both consequences are pronounced at different speeds. The vector a. in time then, for example, the phoneme / s / represents, while the vector b. at the corresponding point in time 20 'represents a different phoneme, namely / a /. With conventional methods for calculating the degree of similarity, in which the sum of the correlation coefficients Jf (a., B.) Is formed via i, the example shown in FIG. 1 results in only a slight similarity between the two patterns. Thus, this can lead to a false detection of a pattern.

Ganz allgemein kann die Dauer jedes Phonems bei seiner tatsächlichen Äußerung erheblich unterschiedlich sein, ohne daß die Bedeutung der gesprochenen Worte dadurch beeinträchtigt wird. Für die Prüfung der Übereinstimmung (matching) wird daher ein Ähnlichkeitsmaß verwendet, das von diesen Änderungen der Geschwindigkeit der Äußerungen unabhängig ist.In general terms, the duration of each phoneme when it is actually uttered can vary considerably without that the meaning of the spoken words is affected. For checking the conformity (matching) a similarity measure is therefore used which is independent of these changes in the speed of the utterances.

- 10 609838/0897 - 10 609838/0897

In Fig. 2 sind entlang der Abszisse i bzw. der Ordinate j Folgen von Merkmalsvektoren angeordnet. Als Ähnlichkeitsmaß s (a., b-) zwischen den Vektoren a. und b- wird nun folgendes Scalar-Produkt der Merkmals vektoren a- und b- angex\^rendet:In FIG. 2, sequences of feature vectors are arranged along the abscissa i and the ordinate j. As a measure of similarity s (a., B-) between the vectors a. and b- the following scalar product of the feature vectors a- and b- angex \ ^r ends:

s (a., ,T.) = ^ (a_pi xb_pj) (7)s (a.,, _T. _{) = ^ (a pi} xb pj) (7)

P=1P = 1

Nach der Erfindung wird nun das Äimlichkeitsmaß 3 (A, B) zwischen dem Sprachmuster A am Eingang und den Bezugsmustern B für alle Bezugsmuster berechnet. Die Anzahl Y der Beζugs-Wortmuster und die entsprechenden Ziffern n(1), n(2), ..., n(X) , ..., n(Y) , für die man das Maximum des.ÄhnlichkeitsmaßesAccording to the invention, the similitude measure 3 (A, B) between the speech sample A at the input and the reference samples B is calculated for all reference samples. The number Y the inflection word pattern and the corresponding digits n (1), n (2), ..., n (X), ..., n (Y), for which the maximum of the similarity measure

max ( _ "^max ( _ "^

Y, n(1), n(2), ..., n(x) , ..., n(Y) S (A, B) I (8)Y, n (1), n (2), ..., n (x), ..., n (Y) S (A, B) I (8)

erhält, werden berechnet, um zu dem Erkennungsergebnxs zu gelangen.are calculated to arrive at the recognition result.

Eine direkte Berechnung des Maximums des Ähnlichkeitsmaßes gem. Gleichung (8) erfordert viel Zeit und beeinflußt daher sowohl die Kosten als auch notwendige Betriebszeit eines solchen Systems in gewissem Umfang negativ. Um dies zu vermeiden, wird nun, wie in Fig. 3A gezeigt, das Sprachmuster A am Eingang in partielle Muster von einem ersten gegebenen Zeitpunkt (Startpunkt) 1 bis zu einem zweiten gegebenen Zeitpunkt (Endpunkt) m aufgeteilt. Es gilt:A direct calculation of the maximum of the degree of similarity according to equation (8) requires a lot of time and therefore influences Both the cost and necessary uptime of such a system negatively to some extent. To avoid this, Now, as shown in Fig. 3A, the speech pattern A at the input is given in partial patterns of a first one Point in time (starting point) 1 divided up to a second given point in time (end point) m. The following applies:

609838/0897 " ^U "609838/0897 " ^U "

A(I,in) -A (I, in) -

In anderen liorten: wie in Fig. 3B gezeigt, wird das Sprachmuster A am Eingang in Y-Gruppen partieller Muster mit (Y-1) Unterbrechungspunkten 1(1), 1(2), ..., l(X), ···» 1(Y-1) aufgeteilt. Es gilt also:In other liorten: as shown in Fig. 3B, the speech pattern A at the entrance in Y groups of partial patterns with (Y-1) breakpoints 1 (1), 1 (2), ..., l (X), ··· »1 (Y-1) divided up. So the following applies:

A =A =

..0 Α(1_(χ__υ ..0 Α (1 _(χ _ _υ

(10)(10)

/·γ_ -J -ι , 1 /·γ>, J ./ · Γ_ -J -ι, 1 / · γ>, J.

Führt man Gleichungen (4) und (10) in den Ausdruck (8) ein, so erhält man:Introducing equations (4) and (10) into expression (8), one obtains:

< ν<ν

maxMax

1(1), 1(2)··· ¹CX-I)1 (1), 1 (2) ··· ¹ CX-I)

^SCAC1(X-1) ^SCAC1 (X-1)

η (χ)η (χ)

(11)(11)

Die Maximierung des Terms auf der rechten Seite von Ausdruck (11) wird in zwei Schritten ausgeführt. Der erste Schritt ist die Maximierung bezüglich der Ziffern n(1), n(2),'..., n(Y). Der zweite Schritt ist die Maximierung bezüglich der Zahl Y und der Unterbrechungspunkte 1(1), 1(2), ..., 1(Y-1). Das ergibt folgenden Ausdruck:Maximizing the term to the right of Expression (11) is carried out in two steps. The first step is to maximize with respect to the digits n (1), n (2), '..., n (Y). The second step is to maximize the number Y and the breakpoints 1 (1), 1 (2), ..., 1 (Y-1). That results in the following expression:

max 1max 1

(Y-1)(Y-1)

,η (χ), η (χ)

(12)(12)

Nach der Erfindung wird als Ähnlichkeitsmaß S(A(I _f ..-> ,According to the invention, the similarity measure S (A (I _f ..->,

1 _f ^) ßⁿ^^x-|j- ein hinsichtlich eines Zeitabschnittes normali-1 _f ^) ß ⁿ ^ ^x - | j- a normal-

6098 3 8/08976098 3 8/0897

- 12 -- 12 -

siertes Aim Ii dike its maß verwendet. Genauer: Das partielleAim Ii dike its measure used. More precisely: the partial

Muster A(l_{r 1Λ}, I₁- .) wird allgemein dargestellt durch 1.x- ι j ^Xj Pattern A (l _{r 1Λ} , I ₁ - .) Is generally represented by 1.x- ι j ^ Xj

^{C =C =}

^C1' ^C 1 '

i' "· ^C i '"· ^C

(13)(13)

tfird das Bezugsmuster Β¹¹*·^χ-' wie in Gleichung (3) dargestellt, dann ergibt sich das Ähnlichkeitsmaß S(C, B) durh die folgende Gleichung:tfird the reference pattern Β ¹¹ * · ^χ - 'as shown in Equation (3), then there is the similarity measure S (C, B) durh the following equation:

S(C, ß) = " S (C, ß) = "

Dabei ist j = j (i).Where j = j (i).

i-1i-1

Cc₁, b_j(1))Cc ₁ , b _{j (1)} )

(14)(14)

Die Berechnung des Ähnlichkeitsmaßes nach Gleichung (14) ist nun mit Hilfe dynamischer Programmierung in geeigneter tfeise möglich. Die Rekursionskoeffizienten bzw. Kumulationsmengen, die für die Ähnlichkeit repräsentativ sind, werden durch die Cleicnun,"The calculation of the degree of similarity according to equation (14) is now in a suitable manner with the help of dynamic programming possible. The recursion coefficients or cumulation sets that are representative of the similarity are given by the Cleicnun, "

maxMax

(i-1, J)(i-1, J)

(i, j-1) (i-1, j-2)(i, j-1) (i-1, j-2)

(15)(15)

gegeben. Dabei ist 1 ^ j «s J..given. Where 1 ^ j «s J ..

Bei der Berechnung beginnt man mit der AnfangsbedingungThe calculation starts with the initial condition

j - 1j - 1

J(1,1) - S(C₁, O₁)J (1,1) - S (C ₁ , O ₁ )

(16)(16)

609838/0897609838/0897

- 13 -- 13 -

Man gelangt dann schließlich zum letzten Rekurs!onskoeffizienten g(I,J) für i = I und j = J. Dabei wird davon ausgegangen, daß das gemäß der Rekursionsgleichung (15) letzte Rekursionskoeffizient g(I, J) die Berechnung der Gleichung (14) ist. Es gilt also:Finally, one arrives at the last recourse coefficient g (I, J) for i = I and j = J. It is assumed that the last Recursion coefficient g (I, J) is the calculation of equation (14). So the following applies:

S(C, B) = g(I, j) (17)S (C, B) = g (I, j) (17)

Es ist darauf hinzuweisen, daß die Geschwindigkeit der Aussprache in der Praxis höchstens u,.i 30 % schwankt. Der Vektor b., der mit einem Vektor c. korreliert wird, liegt daher in der Nachbarschaft von b. . Daher reicht es aus, wenn der Ausdruck (14) oder (15) für die möglichen Entsprechungen (i, j) berechnet werden soll, die der GleichungIt should be pointed out that in practice the speed of pronunciation fluctuates no more than 30%. The vector b., Which corresponds to a vector c. is correlated, is therefore in the vicinity of b. . Therefore, it suffices if the expression (14) or (15) is to be calculated for the possible correspondences (i, j) that of the equation

j -f ä i£ J + y (18)j -f ä i £ J + y (18)

genügen, die als "Normalisierungs fenster" bezeichnet wird. Die ganze Zahl JA kann man nun so bestimmen, daß sie ungefähr 30 % von I oder J ist. Das Normalisierungs fenster, das nach Gleichung (18) gegeben ist, entspricht einer einschränkenden Bedingung für die Berechnung der Gleichung (14) oder (15), die auch so darzustellen ist, daß sie innerhalb des Bereichs liegt, der durch die beiden geraden Linien 2 5 und 26 begrenzt wird. Sie sind durch die Gleichungensuffice, which is referred to as the "normalization window". The whole number JA can now be determined to be about 30 % of I or J. The normalization window given by equation (18) corresponds to a restrictive condition for the calculation of equation (14) or (15), which is also to be represented in such a way that it lies within the range defined by the two straight lines 2 5 and 26 is limited. You are through the equations

j = 1 + f und j = 1 - f j = 1 + f and j = 1 - f

definiert. Der Bereich schließt diese Grenzen mit ein,Are defined. The area includes these limits,

609838/0897 -14609838/0897 -14

Wie oben beschrieben, kann das Ähnlichkeitsmaß zwischen dem partiellen Muster C und dem Bezugs-ftortmuster B durch Berechnung des Rekursions aus drucks (15) unter den Bedingungen nach Gleichung (16) und (18) von i = j = 1 bis i = I und j = J erfolgen. Das ÄhnlichkeitsmaßAs described above, the degree of similarity between the partial pattern C and the reference location pattern B by calculation of the recursion from expression (15) under the conditions of equations (16) and (18) from i = j = 1 to i = I and j = J take place. The similarity measure

Bⁿ B ⁿ

δ(Α(1_(χ__υ I₀₀ Bⁿ) oder S(A(I, in), B)δ (Α (1 _(χ _ _υ I ₀₀ B ⁿ ) or S (A (I, in), B)

kann also zwischen dem partiellen Muster A(I,- .,-., Ir_xO oder A(I, m) und dem Bezugs-ifortmuster B berechnet werden. Es werden dann die Ähnlichkeitsmaße S(A(I, m), B°) bis S(A(I, m) ,B⁹) berechnet.can therefore be calculated between the partial pattern A (I, -., -., Ir _x O or A (I, m) and the reference ifort pattern B. The similarity measures S (A (I, m), B °) to S (A (I, m), B ⁹ ) calculated.

üin partielles Ähnlichkeitsmaß S 1,m wird als das Maximum dieser Ähnlichkeitsmaße S(A(I, m) B°) bis S(A(I, m) , B⁹) definiert. Das partielle Erkennungsergebnis η 1, m ist die Ziffer, mit der man das partielle Ähnlichkeitsmaß S 1, m erhält. Diese Beziehungen werden also in anderen Worten durch folgende Gleichungen gegeben:üa partial similarity measure S 1, m is defined as the maximum of these similarity measures S (A (I, m) B °) to S (A (I, m), B ⁹ ). The partial recognition result η 1, m is the number with which the partial similarity measure S 1, m is obtained. In other words, these relationships are given by the following equations:

1, m > = max / S(A(I1, m> = max / S (A (I

, m), Bⁿ J, m), B ⁿ J

IlIl

η< 1, m > = arg max Γ S(A(I, in), Bⁿ Jη <1, m> = arg max Γ S (A (I, in), B ⁿ J

Die partiellen Ähnlichkeitsmaße und die partiellen Erkennungsergebnisse werden für alle Kombinationen von 1 und m, die der Bedingung 1< m genügen, berechnet und gespeichert. DieseThe partial similarity measures and the partial recognition results are calculated for all combinations of 1 and m that satisfy the condition 1 <m, calculated and stored. These

- 15 609838/0897 - 15 609838/0897

beschriebene Ermittlung der partiellen ÄhnlichKeitsinaße und der partiellen Erkennuagsergebnisse ist der erste Schritt bei der Spracherkennung gemäß der Erfindung.Determination of the partial similitude and the partial recognition results is the first step in speech recognition according to the invention.

In der Praxis liefert die Berechnung der Gleichung (15) von j = 1 bis j=J alle Ähnlichke its maße S(A(I, m) , Bⁿ) für einen bestimmten Punkt 1, sowie ferner für alle m, die der Gleicnung genügen:In practice, the calculation of equation (15) from j = 1 to j = J yields all similarity measures S (A (I, m), B ⁿ ) for a certain point 1, as well as for all m, those of the equation suffice:

1 + J- t -1£ m* 1 + J + f - 1 (19)1 + J- t -1 £ m * 1 + J + f - 1 (19)

Der Endpunkt m des partiellen Musters A(I, m) für normal gesprochene Worte ist stets in dem durch Gleichung (19) definierten Bereich enthalten. Daher müssen keine partiellen Ähnlichkeitsmaße außerhalb dieses Bereiches berechnet werden. Der Grund ergibt sich aus Fig. 4. Auf der Abszisse ist der Startpunkt 1, auf der Ordinate der Endpunkt m aufgetragen. Die Kreuzungspunkte (1, m) in der 1/m-Ebene entsprechen den Kombinationen von Punkten 1 und m (beides sind ganze Zahlen). Der Bereich um einen Punkt m, in dem das Ähnlichkeitsmaß S(A(I, m) , Bⁿ) für einen bestimmten Punkt 1 berechnet wird, ist durch Gleichung (19) definiert. Dieser Bereich hängt von der Dauer J des Bezugs-Wortmus te rs ab und ergibt sich aus folgender Gleichung:The end point m of the partial pattern A (I, m) for normally spoken words is always included in the range defined by equation (19). Therefore, no partial similarity measures need to be calculated outside of this range. The reason is shown in FIG. 4. Starting point 1 is plotted on the abscissa and end point m is plotted on the ordinate. The crossing points (1, m) in the 1 / m plane correspond to the combinations of points 1 and m (both are whole numbers). The area around a point m in which the degree of similarity S (A (I, m), B ⁿ ) is calculated for a certain point 1 is defined by equation (19). This range depends on the duration J of the reference word pattern and results from the following equation:

1 + minf J_nI-Jf- 1 β. * ύ 1.+ ™ax [ J_n] + JN 1 (20)1 + minf J _n I-Jf- 1 β. * ύ 1. + ™ ax [ _{Y n} ] + JN 1 (20)

- 16 609638/089 7- 16 609638/089 7

dieser Bereich liegt in Fig. 4 zwischen den geraden Linien 32 und 33. nie Ähnlichkeitsmaße S(A(I, m) , B¹¹) werden innerhalb dieses Bereichs berechnet. Man erhält daraus die partiellen Ähnlichkeitsmaße S < 1, m > und die partiellen crkennungsergebnisse η < 1, ra>this area lies in FIG. 4 between the straight lines 32 and 33. Never measures of similarity S (A (I, m), B ¹¹ ) are calculated within this area. The partial similarity measures S <1, m> and the partial cr identification results η <1, ra> are obtained therefrom

Die geraden Linien 32 und 3 3 sind durch die GleichungenThe straight lines 32 and 3 3 are through the equations

nin fJ_n nin fJ _n

- 1 , und- 1, and

α= 1 + min m= 1 + max gegeben.α = 1 + min m = 1 + max given.

Auf diese Art und V/eise T^erden die partiellen Ähnlichkeitsmaße und die partiellen Erkennungsergebnisse für alle Punkte (1, m) in dem gestrichelten Bereich in Fig. 4 berechnet. Ferner erfolgt die Berechnung der Ähnlichkeitsmaße S(A(I, m), Bⁿ) für alle Werte von m und einen bestimmten Punkt 1 nach Gleichung (15). Die ähnlichen Berechnungen für alle Bezugs-Wortmuster ergeben die Ähnlichkeitsmaße, die der Reichweite der Linie 10 (oder einem bestimmten Punkt 1) entsprechen. Man erhält so die partiellen Älmlichkeitsmaße und die partiellen Erkennungsergebnisse in dem Bereich, der durch die Linie 10 gegeben ist. Durch Berechnung für alle Werte von 1 erhält man die partiellen Ähnlichkeitsmaße und die partiellen Erkennungsergebnisse in dem gestrichelten Bereich. Die Berechnung lediglich innerhalb dieses einge-In this way, the partial similarity measures and the partial recognition results for all points (1, m) in the dashed area in FIG. 4 are calculated. Furthermore, the similarity measures S (A (I, m), B ⁿ ) are calculated for all values of m and a specific point 1 according to equation (15). The similar calculations for all reference word patterns give the similarity measures that correspond to the range of line 10 (or a particular point 1). The partial similarity measures and the partial recognition results in the area given by the line 10 are thus obtained. By calculating for all values of 1, the partial similarity measures and the partial recognition results in the dashed area are obtained. The calculation only within this

- 17 609838/0897 - 17 609838/0897

grenzten Bereichs mindert das Ausmaß der notwendigen Rechenarbeiten erheblich.limited area significantly reduces the amount of computational work required.

imin the

Der bis jetzt geschilderte erste Schritt ist durch die in folgenden angegebenen sechs Sub-Schritte definiert:The first step described so far is through the in the following six sub-steps are defined:

(1-1) Setze S<l,m> auf O zurück; setze η auf O; (1-2) Setze 1 auf 1;(1-1) reset S <l, m> to O; set η to O; (1-2) set 1 to 1;

(1-3) Berechne Gleichung (15) innerhalb eines Bereiches von Werten für m, die der Gleichung (19) genügen; erhalte auf diese Weise S(A(I,m),Bⁿ) = g(m,J ).(1-3) Calculate equation (15) within a range of values for m that satisfy equation (19); get in this way S (A (I, m), B ⁿ ) = g (m, J).

Setze m auf 1 + J - Γ - 1; (1-4) Ist S(A(I,m),Bⁿ) * S < l,m > , so springe auf (1-5); Ist S(A(l,m),bⁿ) > S < l,m^ , so setze S (A(l,m) ,Bⁿ)Set m to 1 + J - Γ - 1; (1-4) If S (A (I, m), B ⁿ ) * S <l, m>, then jump to (1-5); If S (A (l, m), b ⁿ )> S <l, m ^, then set S (A (l, m), B ⁿ )

und η auf S <l,m > bzw. auf r. <: l,m > ; (1-5) Setze m auf (m + 1);and η on S <l, m> and on r. <: l, m >; (1-5) set m to (m + 1);

Ist m = I⁺J_n ⁺ Γ~Τ» ^s0 springe nach (1-4), undIf m = I ⁺ J _n ⁺ Γ ~ Τ » ^s0 jump to (1-4), and

ist m ^I⁺J_n ⁺ f-1 j so springe nach (1-6); (1-6) Setze η auf (n + 1) ;if m ^ I ⁺ J _n ⁺ f-1 j then jump to (1-6); (1-6) Set η to (n + 1);

Ist η ^ 9, springe nach (1-2), und ist η > 9, so ENDE.If η ^ 9, jump to (1-2), and if η> 9, so END.

Der zweite Schritt bei der Spracherkennung besteht in der Maximierungsrechnung für Y und die Unterbrechungspunkte ¹CIV ¹f2V '"' ¹TxV ·"» ¹TY-I") ^nack Ausdruck (12). Dies geschieht mit dynamischer Programmierung, wie im folgenden beschrieben.The second step in speech recognition consists in the maximization calculation ^{for Y and the breakpoints 1} CIV ¹ f2V '"' ¹ TxV ·" » ¹ TY-I") ^after expression (12). This is done with dynamic programming, as described below .

- 18 -- 18 -

609838/0897609838/0897

26102610

γ maxγ max

*·· ¹Cx) *'* ¹CY-I)' ^" ^S ^λ * ·· ¹ Cx) * '* ¹ CY-I)' ^ " ^S ^λ

x=lx = l

Nun sei angenommen, daß l_rv> = m ist und daßIt is now assumed that l _rv > = m and that

maxMax

, ¹(2)···¹(χ)·'·¹(Υ-ΐγ 2 ^S<lCx-1), ¹ (2) ¹ (χ) ' ¹ (Υ-ΐγ 2 ^{S <l} Cx-1)

l_Cy)-ml _Cy) -m

sei. Dann kann man den Ausdruck [Z2) durch T CI) darstellen. CI ist die Dauer des Musters A am hingang). Man kann den Ausdruck C22) ferner durch folgende Beziehung darstellen:may be. Then the expression [Z2) can be represented by T CI). CI is the duration of pattern A on the hungang). The expression C22) can also be represented by the following relationship:

S< h,
^L S <h,
^L.

= max S< h, m>
^L = max S <h, m>
^L.

Dies zeigt, daß die Berechnung von Ausdruck C2 2) mit dynamischer Programmierung in geeigneter Weise möglich ist. Die Berechnung des Rekursionsausdrucks ist gegeben durchThis shows that the calculation of expression C2 2) with dynamic Programming is possible in a suitable manner. The computation of the recursion expression is given by

TCm) = max / S «h, m> + TCh) | C23)TCm) = max / S «h, m> + TCh) | C23)

Dabei ist m = 1 ^" I.Where m = 1 ^ "I.

Die Berechnung beginnt mit der AnfangsbedingungThe calculation starts with the initial condition

TCO) = O (24)TCO) = O (24)

und wird durchgeführt bisand will be carried out until

T(I) für m=I. Bei der Berechnung des Rekursionsausdruckes (24) istT (I) for m = I. When calculating the recursion expression (24) is

hCm) - argmax [s<h, m > + TCh)J C25)hCm) - argmax [s <h, m > + TCh) J C25)

CDer Operator "argmax" steht für den Wert von "h", für den der Term in der eckigen Klammer ein Maximum wird).The operator "argmax" represents the value of "h", for which the term in square brackets becomes a maximum).

609838/0897 -19-609838/0897 -19-

Dies gilt für alle tferte m von 1 bis I. Sie werden dann gespeichert. Man erhalt dann den erkannten Unterbrechungspunkt Ij- -. , der den optimalen tfert von L· ^ in Ausdruck (21) darstellt, durch Berechnung des P^e kurs ionsaus drucks, der durchThis applies to all m from 1 to I. You will then saved. The recognized breakpoint is then obtained Ij- -. which finds the optimal t of L ^ in expression (21) represents, by calculating the P ^ e course expression, the by

^T(x) ^{= ]1 (r} Cx+1)^{3 C26)} ^T (x) ^{=] 1 (r} Cx + 1) ^{3 C26)}

gegeben ist, und zwar von der Anfangsbedingung lp^ - I bis zum letzten Rekursionskoeffizienten If₀-) - O. Die Zahl Y, die die Anzahl der Werte im Muster A am Eingang angibt, erhält nan als die Zahl X, bei der l_f ·, gleich O wird.is given, namely from the initial condition lp ^ - I to the last recursion coefficient If ₀ -) - O. The number Y, which indicates the number of values in pattern A at the input, receives nan as the number X, for which l _f ·, Becomes equal to O.

Der letzte Schritt des Erkennungsvorgangs besteht darin, daß nan das endgültige Erkennungsergebnis mit NZiffern erhält. Dieses endgültige Erkennungsergebnis ist gegeben durch:The last step of the recognition process is that nan receives the final recognition result with N-digits. This final recognition result is given by:

Dabei ist If_x-J der erkannte Unterbrechungsprunkt, den man bein Verfahrensschritt erhalten hat, η < 1, m > ist das partielle Erkennungsergebnis, das man im ersten Schritt erhalten hat.Here If _x -J is the recognized interruption point that was obtained in the process step, η <1, m > is the partial recognition result that was obtained in the first step.

Fig. 5 zeigt ein erstes Ausführungsbeispiel. In der Eingangsschaltung 61 findet die Analyse des eingehenden kontinuierlichen Sprachsignals statt. Diese Analyse liefert das Muster AFig. 5 shows a first embodiment. In the input circuit 61 the analysis of the incoming continuous speech signal takes place. This analysis yields pattern A.

- 20 -- 20 -

6 Q 3 B 2 Β / 0 R 9 76 Q 3 B 2 Β / 0 R 9 7

am Hingang als Folge von Merkmalsreflektoren a.. Die Eingangsschaltung 61 wird durch ein Mikrophon und einen Q-Kanal-Spektrum-Analysator gebildet; letzterer kann z.B. durch eine Gruppe von den einzelnen Kanälen zugeordneten Filtern und Analog/Digital-Konvertern (AID-Konvertern) gebildet werden. Der Q-Kanal-Spektrum-Analysator kann z.ß. aus den "Frequency Selectors" aufgebaut sein, die beschrieben sind in: "Automatic Word Recognition", IEEE Spectrum, Vol. 8, No. 8 (August 1971), S. 60, Fig. 3. Das Sprachsignal am Eingang wird mit Hilfe des Mikrophons in ein elektrisches Signal umgewandelt. Dieses Signal gelangt dann an die den einzelnen Kanälen zugeordneten Filter. Sie bewirken eine Frequenzteilung in Q-Signale auf den einzelnen Kanälen. Diese Signale gelangen an die A/D-Konverter, in denen sie synchron mit einem Analysier-Taktimpuls abgetastet und in digitale Signale umgewandelt werden, die einen Merkmalsvektor a. = (a*. , &2i> ···» ^aoi^ darstellen. In der Eingangsschaltung 61 werden auch die Start- und Endsignale erzeugt, die zeitlich den Startpunkt und den Endpunkt (i=1 und i=I) des Musters am Eingang angeben; diese Signale werden erzeugt, wenn die Amplitude des Eingangssignals zum ersten und zum letzten Mal einen vorbestimmten Schwellwert überschreitet. Der Zeitpunkt i des Musters A am Eingang wird durch einen Zählervorgang in einem (nicht gezeigten) Zähler in der Eingangsschaltung 61 bestimmt. Die Rückstellung des Zählers auf 1 er-at the entrance as a series of feature reflectors a .. The input circuit 61 is formed by a microphone and a Q-channel spectrum analyzer; the latter can be formed, for example, by a group of filters and analog / digital converters (AID converters) assigned to the individual channels. The Q-channel spectrum analyzer can e.g. be built up from the "Frequency Selectors" which are described in: "Automatic Word Recognition", IEEE Spectrum, Vol. 8, No. 8 (August 1971), p. 60, Fig. 3. The speech signal at the input is converted into an electrical signal with the aid of the microphone. This signal then reaches the filters assigned to the individual channels. They cause a frequency division into Q signals on the individual channels. These signals are sent to the A / D converter, in which they are sampled synchronously with an analyzing clock pulse and converted into digital signals that have a feature vector a. = (A *. &2i> ··· ^»a oi ^ represent. In the input circuit 61, the start and end signals are generated which time the starting point and the end point (i = 1 and i = I) of the pattern at the input These signals are generated when the amplitude of the input signal exceeds a predetermined threshold value for the first and last time. The time i of the pattern A at the input is determined by a counter operation in a counter (not shown) in the input circuit 61. The reset of the counter to 1

- 21 -- 21 -

609838/0897609838/0897

folgt bei Auftreten des Startsignals. <He Zählung erfolgt dann synchron mit dem Analysier-Taktimpuls. Der Zählerstand beim Auftreten des letzten Signals U ist die Dauer I des Musters A am Eingang. Bis zum Auftreten des letzten Signals U ist das die Dauer des Musters A am Eingang anzeigende Signal I-vorhanden und liegt an der Steuerschaltung 60 an.follows when the start signal occurs. <He counting done then synchronous with the analyzing clock pulse. The count when the last signal U occurs is the duration I des Pattern A at the entrance. The signal I- indicating the duration of the pattern A at the input is present until the last signal U occurs and is applied to the control circuit 60.

Der Merkmalsvektor a_i gelangt ferner von der üingangsschaltung 61 an einen Pufferspeicher 62. Seine iü^azität reicht aus, um alle Merkmalsvektoren a- des Musters A am eingang, d.h. a.. , a.., .·., a - , ... a γ zu speichern.The feature vector a _{i also} arrives at a buffer memory 62 from the input circuit 61. Its iü ^ acidity is sufficient to store all feature vectors a of the pattern A at the input, ie. , a ..,. ·., a -, ... a γ.

Die ßezugs-'i/ortmuster B (n=0~9) werden im Speicher 63 gespeichert. Aus ihm wird das Bezugs-Wortmuster ή ausgelesen das von einem Signal n, das die Steuerschaltung 6ü abgibt, designiert wird. Dieses aus dem Speicher 63 ausgelesene itfort gelangt an die Recheneinheit 64. Sie dient der Berechnung des partiellen Ähnlichkeitsmaßes bis zu Sub-Schritt (1-3) (s. oben). Berechnet wird das Ähnlichkeitsmaß S (A(I, m),B) zwischen einem partiellen Huster A (1, m) des Musters A am Eingang und dem ßezugs-Wortmuster B. Eine Schaltung 65 dient der partiellen Erkennung. Sie führt die Sub-Schritte (1-4) und (1-5) mit den Ähnlichkeitsmaßen S (A(I, m),B) aus, die in der Recheneinheit 64 errechnet \vorden sind. Damit werden in der Schaltung 65 die partiellen Ähnlichkeitsmaße S<1, m> und die partiellen Erkennungsergebnisse n<.l, m> errechnet.The train location patterns B (n = 0 ~ 9) are stored in the memory 63. The reference word pattern ή, which is designated by a signal n which the control circuit 6u outputs, is read from it. This itfort read out from the memory 63 reaches the arithmetic unit 64. It is used to calculate the partial degree of similarity up to sub-step (1-3) (see above). The degree of similarity S (A (I, m), B) between a partial cough A (1, m) of the pattern A at the input and the train word pattern B is calculated. A circuit 65 is used for partial recognition. It executes the sub-steps (1-4) and (1-5) with the similarity measures S (A (I, m), B), which are calculated in the arithmetic unit 64. The partial similarity measures S <1, m> and the partial recognition results n <.l, m> are thus calculated in the circuit 65.

- 22 -- 22 -

609838/0897609838/0897

Die partiellen Ähnlichkeitsmaße S<l,m > für die Werte 1 und ni werden im Speicher 66 (Pufferspeicher) gespeichert. Die partiellen Endergebnisse für 1 und m werden im Speicher gespeichert. Die End-Recheneinheit 68 führt den oben beschriebenen zweiten Schritt des lirkennungsverfahrens durch. Die schließlich am linde vorgesehene Rechenschaltung 69 führt die Lrkennung des Ergebnisses im letzten Schritt aus. Sämtliche Operationen werden bei diesem Ausfuhrunjsbeispiel von der Steuereinheit GO gesteuert.The partial similarity measures S <l, m> for the values 1 and ni are stored in the memory 66 (buffer memory). The final partial results for 1 and m are stored in memory. The end processing unit 68 carries out the above-described second step of the detection method. The computing circuit 69 finally provided on the linde carries out the identification of the result in the last step. In this exemplary embodiment, all operations are controlled by the control unit GO.

Die partiellen Ähnlichkeitsmaße S<l,m«> , die im Speicher 66 gespeichert sind, werden von einem Steuersignal el, das von der Steuerschaltung 60 abgegeben wird, gesteuert. Der Speicher 63 liest die ;33Zugs-V/brtmuster ö (B , B , B , ... oder 3 ), die durch das Signal η designiert worden sind, ebenfalls aus. .Ferner gibt die Steuerschaltung 60 ein Signal 1, das den Startpunkt im partiellen Muster A (1, m) angibt, an die Recheneinheit 64 und an die Schaltung 65 ab, in der die partielle Erkennung durchgeführt wird. Die Recheneinheit 64 berechnet die Ähnlichkeitsmaße S (A (1, m), Bⁿ) zwischen dem partiellen Muster A (1, m) des Musters A am Eingang und dem Bezugs-iiortmuster B, indem sie den Sub-Schritt (1-3) (s.oben) durchführt. Die Ähnlichkeitsmaße S (A (1, m) ,Bⁿ) gelangen von der Recheneinheit 64 an die Schaltung 65; von der Steuerschaltung 60 gelangen an die Schaltung 65 die Signale n, 1 und m,The partial similarity measures S <l, m <>, which are stored in the memory 66, are controlled by a control signal el which is output by the control circuit 60. The memory 63 also reads out the 33 train V / brtmuster δ (B, B, B, ... or 3) which have been designated by the signal η. Furthermore, the control circuit 60 outputs a signal 1, which specifies the starting point in the partial pattern A (1, m), to the arithmetic unit 64 and to the circuit 65 in which the partial recognition is carried out. The arithmetic unit 64 calculates the similarity measures S (A (1, m), B ⁿ ) between the partial pattern A (1, m) of the pattern A at the input and the reference location pattern B by performing the sub-step (1-3 ) (see above). The similarity measures S (A (1, m), B ⁿ ) pass from the computing unit 64 to the circuit 65; The signals n, 1 and m pass from the control circuit 60 to the circuit 65,

- 23 -609838/0897 - 23 - 609838/0897

Aus dem Speicher 66 gelangen die gespeicherten Äimlichkeitsmaße S<1, m> an die Schaltung 65. Die Schaltung 6 5 führt dann die Sub-Schritte (1-4) und (1-5) (s. oben) durch und bringt den Inhalt von S<I,m > und n<»l, m> der Speicher 66 DZVi. 67, wie er sich aus diesen Sub-Schritten ergibt, auf den neuesten Stand. Ist dieses Verfahren ffIr alle tferte von 1 beendet, dann wird das Signal n, das von der Steuerschaltung abgegeben wird, nun seinerseits jeweils um 1 von O bis 9 verändert. Mit dein Ad Schluß dieses Verfahrens für den Wert von η gleich 9 ist der gesamte erste Verfahrensschritt beendet. Bei iieendigunij dieses ersten Schrittes erzeugt die Steuerschaltung 60 Signale U und I-,, die die Beendigung des ersten Verfahrensschrittes bzw. die Dauer (die Azahl der flerkmalsvektoren) des "Musters am Hingang anzeigen. Diese Signale werden an die Hrkennungs-Endschaltung 69 weitergegeben.The stored similitude measures S <1, m> pass from the memory 66 to the circuit 65. The circuit 65 then carries out the sub-steps (1-4) and (1-5) (see above) and brings the content from S <I, m> and n <»l, m> the memory 66 DZVi. 67, as it results from these sub-steps, is up to date. If this method has ended for all times from 1, then the signal n, which is emitted by the control circuit, is in turn changed by 1 from 0 to 9 in each case . With the end of this process for the value of η equal to 9, the entire first process step is ended. When this first step is completed, the control circuit 60 generates signals U and I-, which indicate the completion of the first process step or the duration (the number of characteristic vectors) of the pattern at the entrance.

Der End-Recheneinheit werden die partiellen Ähnlichkeitsmaße S<l,m> zugeführt; sie berechnet aus ihnen den Rekursionsausdruck nach Gleichung (2 3) und damit h (m) aus dem partiellen Ähnlichkeitsmaß S<l,m ?■ . Die End-Recheneinheit 68 berechnet ferner den Rekursionsausdruck nach Gleichung (2 6) und damit h (m) und daraus die Anzahl Y der Worte und der Unterbrechungspunkte 1 , -v » The partial similarity measures S <1, m> are fed to the final processing unit; from them it calculates the recursion expression according to equation (2 3) and thus h (m) from the partial similarity measure S <l, m ? ■ . The end processing unit 68 also calculates the recursion expression according to equation (2 6) and thus h (m) and from this the number Y of words and the breakpoints 1 , -v »

Die Erkennungs-Endschaltung 69 erkennt schließlich das ursprünglich am Eingang aufgetretene Muster A als solches. Es.The recognition end circuit 69 finally recognizes this originally Pattern A occurring at the entrance as such. It.

- 24 609838/ηΒΠ7 - 24 609838 / ηΒΠ7

ist derjenige Ausdruck nach Gleichung (2 7), in dem 1, . und n<l,m > angegeben sind.is the expression according to equation (2 7) in which 1,. and n <l, m> are given.

Die Recheneinheit 64 zur Berechnung der Ähnlichkeitsmaße wird im folgenden unter Bezugnahme auf Fig. 6 im Detail beschrieben. Sie besteht aus einem Speicher 41, der als Pufferspeicher für die Bezugs-Wortmuster dient, einem Rechner 642 zur Berechnung der Ähnlichkeitsmaße, einem Speicher 643, einem Rekursionsrechner 644, einem Operationsregister 645, sowie einer Steuerschaltung 64ü.The arithmetic unit 64 for calculating the similarity measures is described in detail below with reference to FIG described. It consists of a memory 41, which serves as a buffer memory for the reference word patterns, a computer 642 for calculating the similarity measures, a memory 643, a recursion calculator 644, an operation register 645, and a control circuit 64ü.

:)as Bezugs-!/ort muster ii, das aus dem Speicher 63 (Fig. 5) kommt, wird im Speicher 641 gespeichert. Dem Rechner 642 wird von Pufferspeicher 62 das Muster A und vom Speicher 641 das ßezugs-vVortmuster B zugeführt. Er berechnet die Ähnlichkeits-:) The reference! / location pattern ii coming from the memory 63 (FIG. 5) is stored in the memory 641. The computer 642 is supplied with the pattern A from the buffer memory 62 and the ßzug-Vormuster B from the memory 641. He calculates the similarity

Q
maße S (a., b.) = JT (^ap- x bp.) zwischen den MerkmalsvektorenQ
measures S (a., b.) = JT ( ^a p- x bp.) between the feature vectors

P=1
a- und b- für alle Kombinationen (i, j), die den Beziehungen 1 = i = I und 1 = j = J genügen. Die Errechneten Ähnlichkeitsmaße S (a., b.) werden im Speicher 643 gespeichert. Der Speicher liest für die Kombination (i,j), die von den Signalen i und j als Signal 3 designiert wird, die Ähnlichkeitsmaße S (a^, b·) aus. An den Rekursionsrechner 644 gelangt das Signal S aus dem Speicher 643 und die Signale g*, g? und g, aus den Operationsregister 645; der Rekursionsrechner 644 berechnet damit die Summe g_Q des Signals S und dasjenige der Signale g« , g₂, g^,P = 1
a- and b- for all combinations (i, j) that satisfy the relationships 1 = i = I and 1 = j = J. The calculated similarity measures S (a., B.) Are stored in the memory 643. The memory reads out the similarity measures S (a ^, b ·) for the combination (i, j) which is designated by the signals i and j as signal 3. The signal S from the memory 643 and the signals g *, g? and g, from the operation register 645; the recursion calculator 644 thus calculates the sum g _{Q of} the signal S and that of the signals g «, g ₂ , g ^,

- 25 -· 609838/0897- 25 - 609838/0897

das ein Maximum ist. Er berechnet den folgenden Ausdruck:that is a maximum. It calculates the following expression:

^gO ^{= S}(^ai» ^aj) ^{+ max} / Si* S₂, S3 7 (²⁸) ^g O ^{= S} ( ^a i » ^a j) ^{+ max} / Si * S ₂ , S3 7 ( ²⁸ )

Die Summen g~ lierden im Operationsregister 645 gespeichert.The sums are stored in the operation register 645.

Von der Steuerschaltung 646 wird ein Steuersignal an das Operationsregister 645 abgegeben. Dadurch wird die Anfangsbedingung gemäß Ausdruck (16) eingestellt bzw. gesetzt. Die Steuerschaltung 646 erzeugt ein Signal j.., das von j = 1 zunimmt, sowie ein Signal i., das innerhalb des Bereichs nach Ausdruck (19) zunimmt. Das Ähnlichkeitsmaß S (a-, b.)> das der Kombination (i, j,), die von den Signalen i- und j- designiert wird, entspricht, wird aus dem Speicher 643 ausgelesen. Auf die Kombination (i, j) hin, die durch die Signale i* und j., designiert ist, erzeugt das Operationsregister 645 Signale Si» g₂» ^g3> ^d^^e durch die GleichungenA control signal is output from the control circuit 646 to the operation register 645. This sets the initial condition according to expression (16). The control circuit 646 generates a signal j .. which increases from j = 1 and a signal i. Which increases within the range of expression (19). The degree of similarity S (a-, b.)> Which corresponds to the combination (i, j,) which is designated by the signals i- and j- is read out from the memory 643. In response to the combination (i, j) designated by the signals i * and j., The operation register 645 generates signals Si »g ₂ » ^g 3> ^d ^ ^e by the equations

g-l = g(i-1, j)
g₂ = ß(i-1, j-1)
S₃ = sCi-1, J-2)gl = g (i-1, j)
g ₂ = ß (i-1, j-1)
S ₃ = sCi-1, J-2)

gegeben sind. Der P^ekursionsrechner 644 berechnet nun den Rekursionsausdruck nach Gleichung (28) aus den Signalen S, Si» S₂* S3· nas bedeutet, daß der Rekursionsrechner den Rekursionsausdruck nach Ausdruck (15) berechnet. Der errechnete Wert g_Q wird in das Operationsregister 645 ein-given are. The recursion calculator 644 now calculates the recursion expression according to equation (28) from the signals S, Si »S ₂ * S3 · nas means that the recursion calculator calculates the recursion expression according to expression (15). The calculated value g _Q is entered in the operation register 645

- 26 -- 26 -

609838/0897609838/0897

geschrieben. Durch Variation von i und j bis i=I und j=Jwritten. By varying i and j to i = I and j = J

erhält man das Ähnlichkeitsmaß S (A (1, m), oⁿ) - gCl, J).one obtains the similarity measure S (A (1, m), o ⁿ ) - gCl, J).

Anstelle des Rekursionsausdrucks nach (15) kann man folgenden Rekursionsausdruck verwenden:Instead of the recursion expression according to (15) one can use the following recursion expression:

Ka-, b.) + gCi-1, j)Ka-, b.) + GCi-1, j)

SCa₁, bj) + g (i-1, j-1) SCa₁, bj) + 3 C i, j-1)SCa ₁ , bj) + g (i-1, j-1) SCa ₁ , bj) + 3 C i, j-1)

= nax= nax

b..) + SCa₁-I, b.) bj) + g(i-l, j-1) b.) + SCa₁, bpl)b ..) + SCa ₁ -I, b.) bj) + g (il, j-1) b.) + SCa ₁ , bpl)

+ g(i-l, j-2)+ g (i-l, j-2)

Beim ersten Ausführungsbeispiel brauchen die Speicher 66 und 67 eine relativ große Kapazität. Ihre Kapazität MA entspricht der Fläche des gestrichelten .Bereichs in Fig. 4 und istIn the first embodiment, the memories need 66 and 67 a relatively large capacity. Their capacitance MA corresponds to the area of the dashed area in Fig. 4 and is

MA "^Ix C^axMA "^ Ix C ^ ax

Im Falle gesprochener Ziffern ergeben sich folgende typische Beispiele::In the case of spoken digits, the following typical examples result:

inax
ηinax
η

= 25 ,= 25,

= 15= 15

= 7= 7

I = 70 im Falle von 4 digitalen Ziffern.I = 70 in the case of 4 digital digits.

609838/0897609838/0897

- 27 -- 27 -

Das ergibt eine Speicherkapazität MA von 1680 Worten für jeden Speicher. Die Pufferspeicher 66 und 67 benötigen daher eine Kapazität von 3 360 (=1680 χ 2) Worten.This results in a storage capacity MA of 1680 words for each store. The buffer memories 66 and 67 therefore require a capacity of 3 360 (= 1680 χ 2) words.

Beim Ausführungsbeispiel werden die Älmlichkeitsmaße entlang der Linie 10 (Fig· 4) berechnet; entlang dieser Linie ist 1 im ersten Verfahrensschritt konstant. Das berechnete partielle Ähnlichkeitsmaß wird dann entlang Linie 11 (Fig. 4) weiter verarbeitet; entlang dieser Linie ist m konstant. Das erfolgt in dem zweiten Verfahrensschritt. Dies erfordert die Speicherung aller s<l,m> innerhalb des schraffierten Bereiches. Andererseits liegt bei der Berechnung des Ausdrucks nach: Gleichung (25) das partielle Ähnlichkeitsmaß S"*h,ia>. für einen Punkt m entlang der Linie 11. Diese Tatsache wird beim zweiten Ausführungsbeispiel ausgenutzt, um die Speicherkapazität zu verringern.In the exemplary embodiment, the similarity measures are along calculated from line 10 (FIG. 4); along that line is 1 constant in the first process step. The calculated partial similarity measure is then taken along line 11 (Fig. 4) further processed; m is constant along this line. This takes place in the second process step. This requires the storage of all s <l, m> within the hatched area. On the other hand, lies in the calculation of the expression according to: equation (25) the partial similarity measure S "* h, ia>. for a point m along the line 11. This fact is used in the second embodiment to increase the storage capacity to reduce.

Beim zweiten Ausführungsbeispiel werden die Ähnlichkeitsmaße S<l,m> ira ersten Verfahrensschritt entlang der Linie 11 berechnet. Ist das partielle Ähnlichkeitsmaß S<·l,m > bezüglich eines Punktes von m berechnet worden, dann wird der Rekursionsausdruck (2 3) für den zweiten Verfahrensschritt berechnet. Daher erfordert der Speicher zur Speicherung der partiellen Ähnlichkeitsmaße S<-l,m >· eine nur kleine Kapazität, die ausreicht, um S<l,m> innerhalb des Bereiches der Linie 11 zuIn the second exemplary embodiment, the similarity measures S <1, m> are calculated in the first method step along the line 11. If the partial similarity measure S <· 1, m > has been calculated with respect to a point of m, then the recursion expression (2 3) is calculated for the second method step. Therefore, the memory for storing the partial similarity measures S <-l, m> · requires only a small capacity, which is sufficient to store S <l, m> within the range of the line 11

- 28 -- 28 -

609838/0897609838/0897

speichern. Gleichermaßen kann die Kapazität des Speichers für die partiellen Krkennungsergebnisse η < l,m ? reduziert werden,to save. Likewise, the capacity of the memory for the partial identification results can be η <l, m ? be reduced,

Bei diesem Ausführungsbeispiel ist es für die Berechtigung des Rekursionsausdrucks (15) notwendig, seine Zeitachsen i und j zu vertauschen, d.h. es ergibt sich:In this embodiment it is for the authorization of the Recursion expression (15) is necessary to swap its time axes i and j, i.e. it results:

die Anfangsbedingung: g(I,J)=S(C₁, bj) (29)the initial condition: g (I, J) = S (C ₁ , bj) (29)

Rekursionsausdruck:Recursion expression:

g(i,j)=S(C.,b.)+ maxg (i, j) = S (C., b.) + max

Die Berechnung der Ausdrücke (29) und (30) wird im folgenden an Hand von Fig. 7 beschrieben. Ein Punkt 40 hat das Wertepaar (m,J) und gibt die Anfangsbedingung (29). Die geraden Linien 42 und 43 entsprechen dem Ausdruck (18) und werden durch folgende Beziehungen angegeben:The calculation of expressions (29) and (30) is described below with reference to FIG. A point 40 has the pair of values (m, J) and gives the initial condition (29). The straight lines 42 and 43 correspond to expression (18) and become indicated by the following relationships:

j ₊ m - J_n -j ₊ m - J _n -

i - j + m -i - j + m -

(31)(31)

Daher wird die Berechnung des Rekursionsausdrucks (30) in dem Bereich zwischen den Linien 42 und 43, der sich aus der Beziehung (31) ergibt, ausgeführt. Hat die Berechnung j = 1 erreicht, so liegen alle g (1,1) für i=l im Bereich 21 vor.Therefore, the computation of the recursion expression (30) in the area between the lines 42 and 43, which results from the relationship (31). Does the calculation have j = 1 is reached, then all g (1,1) for i = l are in range 21.

- 29 -- 29 -

609838/G897609838 / G897

Ist nunIs now

g(l,1) = S(A(l,m),Bⁿ),g (l, 1) = S (A (l, m), B ⁿ ),

so ergibt sich der Bereich von 1 durch die Punkte 44 und 45, d.h. er entspricht dem Ausdruck (31) für j=1 und wird die Beziehung gegeben:the range of 1 results from points 44 and 45, i.e. it corresponds to expression (31) for j = 1 and becomes the Relationship given:

= 1= 1+Hi-J_n+ ^ (32)= 1 = 1 + Hi-J _n + ^ (32)

Die gleichen Berechnungen werden mit allen B (η=Ο·**9) durchgeführt; so erhält man die partiellen Ahnlichkeitsmaße S <l,m> bezüglich Werten von 1, die der Gleichung (32) genügen. Es istThe same calculations are carried out with all B (η = Ο · ** 9); so one obtains the partial similarity measures S <l, m> with respect to values of 1 that satisfy the equation (32). It is

S< l,m> = max S(A(I, m) , Bⁿ). (33)S <l, m> = max S (A (I, m), B ⁿ ). (33)

Man erhält dann die folgenden partiellen Erkennungsergebnisse n< l,m > :The following partial recognition results n <are then obtained l, m>:

η < l,m> = argmax S(A(I,m), Bⁿ) (34)η <l, m> = argmax S (A (I, m), B ⁿ ) (34)

So erhält man die partiellen Ähnlichkeitsmaße S < l,m > und die partiellen Erkennungsergebnisse n<l,m > entlang Linie in Fig. 4.The partial similarity measures S <l, m > and the partial recognition results n <l, m > along the line in FIG. 4 are thus obtained.

Aus den partiellen Ähnlichkeitsmaßen S< l,m > und den partiellen Erkennungsergebnissen η < l,m > ergibt sich der Rekursionsausdruck (2 3) derart, daß man T (m) und N (m) berechnet, für die gilt:From the partial similarity measures S <l, m> and den partial recognition results η <l, m> the recursion expression (2 3) results in such a way that T (m) and N (m) calculated for which the following applies:

N (m) = η < h(m), m > (35)N (m) = η <h (m), m> (35)

- 30 609838/0897 - 30 609838/0897

261ΓΗ39261-39

In dem letzten "Verfahrensschritt findet die Erkennung auf der Grundlage der Funktionen h(i-i) und N(r-i) statt. In anderen Worten: Die Berechnung des Rekurs ionsaus drucks ist gegeben durchIn the last "procedural step, the recognition takes place based on the functions h (i-i) and N (r-i). In other Words: The computation of the recursion expression is given by

h = h(ra)h = h (ra)

N = N(m) (36)N = N (m) (36)

m = h m = h

.Diese Berechnung wird von der AnfangsbedingungThis calculation is based on the initial condition

m = 1 (37) -m = 1 (37) -

D is m = 0 durchgeführt.D is m = 0 carried out.

Fig. 8 zeigt ein zweites Ausführungsbeispiel, Es enthält die Eingangsschaltung 71, den Pufferspeicher 7 2 für das Muster am Eingang, sowie den Speicher 7 3 für das Bezugswort-Muster. Diese Bauteile entsprechen den Bauteilen 61 bis 63 nach Fi.g Der Speicher 74 speichert die Dauer J (n = 0,...,9) des Bezugsmusters und liest die designierte Dauer J bei Auftreten des Signals η aus. Ein erster Rechner 75 berechnet den Rekursionsausdruck (15). Ein erstes Operationsregister 76 speichert die Terme g(i,j) in Gleichung (15). Ein Speicher 7 7 speichert die partiellen Älinlichkeitsmaße S<l,m > gemäß Ausdruck (33). Ein Speicher 78 speichert die partiellen Erkennungsergebnisse η <l,m> nach Ausdruck (34). Die Speicher 77 und 78 haben eindimensionale Adressen. An der Adresse "1" der Speicher 7 7 und 78 werden die partiellen Ähnlichkeitsmaße S<l,m > und die partiellen Erkennungsergebnisse η < l,m> gespeichert.Fig. 8 shows a second embodiment, It contains the Input circuit 71, the buffer memory 7 2 for the pattern at the input, and the memory 7 3 for the reference word pattern. These components correspond to the components 61 to 63 according to FIG. The memory 74 stores the duration J (n = 0, ..., 9) des Reference pattern and reads the designated duration J when it occurs of the signal η. A first computer 75 calculates the recursion expression (15). A first operation register 76 stores the terms g (i, j) in equation (15). A memory 7 7 stores the partial linearity measures S <l, m > according to expression (33). A memory 78 stores the partial recognition results η <l, m> according to expression (34). The memories 77 and 78 have one-dimensional addresses. At the address "1" of the memories 7, 7 and 78, the partial similarity measures S <1, m > and the partial recognition results η <l, m> are stored.

- 31 609838/0897 - 31 609838/0897

2 6 1 η 4 32 6 1 η 4 3

Jiin Vergleicher 79 berechnet das partielle Xhnlichkeitsiaaß S < l,m > .In comparator 79, the partial similarity value is calculated S <l, m>.

Hin zweiter .Rechner 80 berechnet die Rekurs ions aus drücke (2 3) und (35). Bin Speicher 81 speichert T(m), das man nach (23) erhält. Bin Speicher 82 speichert h(m). Bin Speicher 83 speichert M(m), das man gemäß Ausdruck (3S) erhält. Die Erkennungs-Schaltung 84 berechnet die Ausdrücke (36) und (37), und damit das Endergebnis. Die Steuerschaltung 80 steuert die Funktion der einzelnen Bauteile.The second computer 80 calculates the recursion expressions (2 3) and (35). Bin 81 stores T (m), which can be found according to (23) receives. A memory 82 stores h (m). A memory 83 stores M (m) obtained from Expression (3S). the Detection circuit 84 calculates expressions (36) and (37), and with it the bottom line. The control circuit 80 controls the Function of the individual components.

Wird von der Eingangs schaltung 71 das Ende des Musters A am Eingang festgestellt, dann wird ein Signal I, das die Dauer des. Musters A am Eingang repräsentiert, an die Steuerschaltung 70 abgegeben. Ein Zähler in Steuerschaltung 70 wird auf 1 gesetzt; sein Zählerstand gibt den Endpunkt m des partiellen Musters A(I, m) an. Hat der Endpunkt m einen niedrigen Viert, d.h. liegt er links vom Punkt 90 in Fig. 9, so liegt der Startpunkt im negativen Bereich. Daher beginnt die Operation dieses Systems nicht, bevor der Wert m den Punkt 90 überschritten hat, Der Wert für Punkt 90 auf der i-Achse ist definiert durchIf the input circuit 71 detects the end of the pattern A at the input, then a signal I, the duration Des. Pattern A represents at the input, output to the control circuit 70. A counter in control circuit 70 is set to 1 set; its count indicates the end point m of the partial pattern A (I, m). If the end point m has a low fourth, i.e., to the left of point 90 in Figure 9, is the starting point in the negative area. Therefore, the operation of this system does not begin until the value m has exceeded point 90, The value for point 90 on the i-axis is defined by

m = mmm = mm

überschreitet der Wert den von m, wie er in Gleichung (38) definiert ist, so werden die folgenden Operationen für alleif the value exceeds that of m, as shown in equation (38) is defined so do the following operations for all

ORiGiNAL INSPECTEDORiGiNAL INSPECTED

- 32 -- 32 -

6 0 9 B 3 R / Π Β 9 76 0 9 B 3 R / Π Β 9 7

Werte von m durchgeführt. Die Steuerschaltung 70 variiert dann das Signal η entsprechend den Ziffern 0 bis 9. Je nach der designierten Zahl η werden dann die ersten beiden Verfahrensschritte durchgeführt. Values of m carried out. The control circuit 70 then varies the signal η corresponding to the digits 0 to 9. Depending on the designated number η, the first two method steps are then carried out.

Das Signal J, das die Zeitdauer J des Bezugs-Wortnmsters J3ⁿ angibt, wird bei Auftreten von η erzeugt. Zur selben Zeit wird das Beζugs-Wortmuster Bⁿ als Signal aus dem Speicher 7 3 ausgelesen. Das Signal ß wird zeitlich umgekehrt ausgelesen, d.h. in der Reihenfolge bj, bj_^, ..., b^, b.. Dann wird der Merkmalsvektor a- (i = m) aus Speicher 7 2 ausgelesen und dem ersten Rechner 75 zugeführt. In ihm erfolgt die Berechnung des Rekursionsausdrucks (30). Dabei dient das Operationsregister 76 als Sub-Speicher. Der erste Rechner 7 5 und das Operationsregister 76 können die der Recheneinheit 64 nach Fig. 5 sein. Ist die Berechnung des Rekursionsausdrucks (30) beendet, dann erhält man die Ähnlichkeitsmaße S(A(I, m), Bⁿ) für den durch Gleichung (52) gegebenen Bereich. Der Inhalt des Speichers 77 wird von einem Signal el., das die Steuerschalung 70 vor Beginn der Operation abgibt, auf 0 zurückgestellt. Jedes Ähnlichkeitsmaß S(A(I, ra) , 3ⁿ) wird mit dem Inhalt des Speichers 77 verglichen. Das jeweils größere wird dann eingeschrieben; dadurch wird die Berechnung des Ausdrucks (33) durchgeführt. Das bedeutet: Das Ähnlichkeitsmaß (S(A(I, m), B¹¹), das vom Signal Im, das von der Steuerschaltung 70 abgegeben wird, designiert wird, wird aus dem Operationsregister 76 als Signal g_Q = S < l,m > , dasThe signal J, which indicates the duration J of the reference word window J3 ⁿ , is generated when η occurs. At the same time, the inflection word pattern B ^{n is} read out as a signal from the memory 73. The signal β is read out reversed in time, ie in the order bj, bj_ ^, ..., b ^, b .. Then the feature vector a- (i = m) is read from memory 7 2 and supplied to the first computer 75. The recursion expression (30) is calculated in it. The operation register 76 serves as a sub-memory. The first computer 75 and the operation register 76 can be those of the arithmetic unit 64 according to FIG. 5. When the computation of the recursion expression (30) has ended, the similarity measures S (A (I, m), B ⁿ ) are obtained for the range given by equation (52). The content of the memory 77 is reset to 0 by a signal el., Which the control circuit 70 emits before the start of the operation. Each similarity measure S (A (I, ra), 3 ⁿ ) is compared with the content of the memory 77. The bigger one is then registered; thereby performing the calculation of expression (33). This means: The degree of similarity (S (A (I, m), B ¹¹ ), which is designated by the signal Im, which is output by the control circuit 70, is obtained from the operation register 76 as the signal g _Q = S <1, m >, that

~33-609838/0Π97 ~ 33-609838 / 0Π97

an der Adresse "1" des Speichers 7 7 gespeichert ist, als Signal S₁ aus dem Speicher 3 7 ausgelesen. i)er Vergleicher 39 erzeugt nur dann einen Einschreib impuls wri, wenn das Signal }Zq großer als das Signal S.. ist. Liegt der einschreib inipuls wri vor, dann wird das Signal g_Q, d.h. S(A(I, m) Bⁿ) an der Adresse ¹¹I" im Speicher 7 7 als neues Ähnlichkeitsmaß 3 < l,m J> und der entsprechende iVert von η wird an der Adresse "1" des Speichers als neues n<l,m >· eingelesen. Ist die Berechnung also dann durch Variation von 1 innerhalb des Bereiches nach (32) und unter Variation von η = O bis η = 9 durchgeführt, dann wird das partielle Ähnlichkeitsmaß S<l,m > und die partiellen Erkennungsergebnisse n<l,m> in den Speicher 77 bzw. 78 gespeichert. Dann gibt die Steuerschaltung 70 an den ziieiten Rechner 80 ein Zeiteinstellsignal t. ab. Nun wird der Inhalt des Speichers 81 von dem Signal cl₂, das von der Steuerschaltung 70 im Startpunkt des Musters am Eingang erzeugt wird, auf 0 zurückgestellt. Der zweite Rechner 80 berechnet dann den Ausdruck (23) für das m, das von Signal m₁, welches von der Steuerschaltung 70 bei Auftreten des Zeiteinstellsignals t. abgegeben wird, designiert worden ist.is stored at address "1" of memory 7 7, _{read out as signal S 1} from memory 3 7. i) the comparator 39 generates a write-in pulse wri only when the signal } Zq is greater than the signal S ... If the write-in pulse wri is present, then the signal g _Q , ie S (A (I, m) B ⁿ ) at the address ¹¹ I "in the memory 7 7 as a new measure of similarity 3 < 1, m J> and the corresponding iVert from η is read in at address "1" of the memory as a new n <1, m> · If the calculation is then carried out by varying 1 within the range according to (32) and varying from η = 0 to η = 9 , then the partial similarity measure S <l, m> and the partial recognition results n <l, m> are stored in the memories 77 and 78. Then the control circuit 70 outputs a time setting signal t to the second computer 80. Now the content, reset of the memory 81 from the signal cl _2, which is generated by the control circuit 70 at the start point of the pattern at the entrance to 0. the second computer 80 then calculates the expression (23) m for which m of signal _1, which is output by the control circuit 70 upon occurrence of the time setting signal t. , has been designated.

Der zweite Rechner 80 wird nun an Hand von Fig. 10 beschrieben. Er enthält eine Steuerschaltung 800, ein Register 810, einen Addierer 8 20 und einen Vergleicher 8 30. Das Register wird von einem Rückstellsignal el, auf 0 zurückgestellt, das,The second computer 80 will now be described with reference to FIG. It contains a control circuit 800, a register 810, an adder 8 20 and a comparator 8 30. The register is reset to 0 by a reset signal el, which,

- 34 609838/0897 - 34 609838/0897

2 61 Π 4 32 61 Π 4 3

zeitlich gesteuert von dem Einstellsignal t.. , in der Steuerschaltung 8ÜO erzeugt wird. Der Inhalt des Speichers 7 7 an der Adresse "ii", d.h. S < Ii, ra-> , und der Inhalt des Speichers 73 an der Adresse "h", d.h. η < h,m > , werden als Signale $2 bzw. n₂ ausgelesen. Man erhält dann den Inhalt des Speichers 81 an der Adresse "h", d.h. T(h), als Signal T₁. ])ie Signale T. und S₂ werden im Addierer 3 20 addiert. Man erhält so das Ergebnis S(h+m) + T(h) als Signal X. Das Register 810 und der Vergleicher 8 30 gleichen im Aufbau dem Speicher 7 7 bzw. dem Vergleicher 79, d.h. sie errechnen die Ilaxircierung des Ausdrucks (23). Das heißt: das Signal X als Ausgang des Addierers 820 wird mit dem Inhalt Z des Registers 810 verglichen; ein Einschreibimpuls wr2 wird lediglich dann erzeugt, wenn X>Z ist. Der Einschreibimpuls wr2 bewirkt, daß das Signal X in das Register 810 und an der Adresse 'Ή¹¹ in den Speicher 81 eingeschrieben wird. Die Signale n₂ und h werden in die Speicher 82 und 8 3 jeweils an der Adresse "in" eingeschrieben, die vom Signal n₂ designiert wird. So werden T(r;i), h(m) und K(m) an der Adresse "m¹¹ in den Speichern 81, 82 bzw. 83 gespeichert.timed by the setting signal t .., is generated in the control circuit 8ÜO. The content of the memory 7 7 at the address "ii", ie S <Ii, ra->, and the content of the memory 73 at the address "h", ie η <h, m>, are generated as signals $ 2 and n, respectively ₂ read out. The content of the memory 81 is then obtained at the address “h”, ie T (h), as signal T ₁ . ]) The signals T. and S ₂ are added in the adder 3 20. The result S (h + m) + T (h) is thus obtained as signal X. The structure of the register 810 and the comparator 8 30 is identical to the memory 7 7 and the comparator 79, ie they calculate the Ilaxircation of the expression (23 ). That is to say: the signal X as the output of the adder 820 is compared with the content Z of the register 810; a write-in pulse wr2 is generated only when X> Z. The write-in pulse wr2 has the effect that the signal X is written into the register 810 and into the memory 81 ^{at the address' Ή 11.} The signals n ₂ and h are written into the memories 82 and 8 3 each at the address "in" which is designated by the signal n ₂ . Thus, T (r; i), h (m) and K (m) at the address "m ¹¹ are stored in the memories 81, 82 and 83, respectively.

Der Rechenvorgang ist damit für einen bestimmten Punkt m abgeschlossen. Das Signal wird dann um 1 vermehrt. Dieselbe Rechnung wird solange wiederholt, bis das m den Wert I hat.The calculation process is thus completed for a specific point m. The signal is then increased by 1. Same The calculation is repeated until the m has the value I.

- 35 609838/0897 - 35 609838/0897

1st der Re cli en Vorgang für m = I beendet, dann wird die
Lrkeimunrs schaltung 84 von Zoiteinstellsignal t-,, das die
Steuerschaltung 30 erzeugt, gestartet. JIe in diesem Zustand in den Speichern 82 und Γ,3 gespeicherten Oaten h(u) und A(Hi) sind die für in = 1 - I.· Die lirkennungsschaltung 84 enthält
Schaltmittel, die ein Signal m-, erzeugen, das in designiert. Das Signal m.. gelangt an die Speicher 82 und 35, die die
Daten h(:-".i) und .-J(i.l)dann an die Schalteinheit 84 weiterleben. :^!ic Schalteinheit ü4 berechnet den Ausdruck (37) und erzeugt N als abschließendos Lrlcennun^ser^ebnis.If the recli en process for m = I is ended, then the
Lrkeimunrs circuit 84 of Zoiteinstellsignal t- ,, which the
Control circuit 30 generated, started. The data h (u) and A (Hi) stored in this state in the memories 82 and Γ, 3 are those for in = 1 - I. · The detection circuit 84 contains
Switching means that generate a signal m-, which is designated in. The signal m .. reaches the memory 82 and 35, which the
Data h (: - ". I) and.-J (il) then live on to the switching unit 84.: ^! Ic switching unit 4 calculates the expression (37) and generates N as the final result.

Patentanspruch 1:Claim 1:
SADSAD

609838/Ü897609838 / Ü897

Claims

PATENTANWALTDr. fur. UWE DREISS 7 STUTTGART 1 Diplom-lngenieür, M. Sc. - / SchidcstraBe 2 2610439 «'*' Telefon (0711) 245734 Telegrammadresse UDEPAT j NAGHGEREICHT Anmelder: Mein Zeichen: Nippon Electric Co Ltd Ni - 810 55-1, Siiiba Gochome Minato-ku T ο k γ ο (Japan) απ,η,αιαζ., Prioritäten: 1) 12.3.1975 2) 31.10.1975 29 891/1975 132 003/1975 Japan Japan PatentanspruchPATENT ADVOCATE Dr. for. UWE DREISS 7 STUTTGART 1 Graduate engineer, M. Sc. - / SchidcstraBe 2 2610439 «'*' Telephone (0711) 245734 Telegram address UDEPAT j NAGHGEREICHT Applicant: My reference: Nippon Electric Co Ltd Ni - 810 55-1, Siiiba Gochome Minato-ku T ο k γ ο (Japan) απ, η , αιαζ., Priorities: 1) 3/12/1975 2) 10/31/1975 29 891/1975 132 003/1975 Japan Japan claim

1. Schaltungsanordnung zur automatischen Erkennung von Sprache, die aus kontinuierlich ausgesprochenen Worten besteht und bei der in einer Eingangsschaltung ein Sprachmuster erstellt wird, das durch eine zeitliche Folge von Herkmalsvektoren dargestellt wird, dadurch gekennzeichnet, daß das durch die Merkmalsvektoren (a- ; i = 1 , .·., I) dargestellte Sprachmuster A in einem ersten Speicher (62), und eine Anzahl (n) von Bezugs-Wortmustern (Bⁿ), ebenfalls jewOÜs dargestellt durch eine zeitliche Folge von Merkmalsvektoren (b.; j = 1, ..., J ) in einem zweiten Speicher (63) gespeichert wird, und daß die Bezugs-Wortmuster und partielle Muster (A(I, m)) an eine Recheneinheit (64) ausgelesen werden, wobei sich die partiellen Muster von einem ersten (1) bis zu einem zweiten Zeitpunkt (m; 1 = 1 = m = I) erstrecken und ebenfalls als zeitliche Folge von Merkmalsvektoren1. Circuit arrangement for the automatic recognition of speech, which consists of continuously spoken words and in which a speech pattern is created in an input circuit, which is represented by a time sequence of traditional vectors, characterized in that the feature vectors (a-; i = 1. ·., voice pattern I) shown a in a first memory (62), and a number (n) of reference word patterns (B ^n), also jewOÜs represented by a time sequence of feature vectors (b .; j = 1 , ..., J) is stored in a second memory (63), and that the reference word patterns and partial patterns (A (I, m)) are read out to an arithmetic unit (64), the partial patterns being derived from a first (1) to a second point in time (m; 1 = 1 = m = I) and also as a time sequence of feature vectors

- 2 609838/0897 - 2 609838/0897

J?J?

(a-, ₊i , cL-,₊2, ···, a·, ..., a ) dargestellt sind, und die Recheneinheit (64) zwischen den partiellen Mustern (A(I, m)) und dem Bezugs-Wortmuster (Bⁿ) mit dynamischer Programmierung die Ähnlichkeitsmaße (S(A(I, m), B )) berechnet, und daß eine Schaltung (65) das Maximum partieller Ähnlichkeitsmaße (S<.1, η > ) für eine Anzahl (n) von fVorten bildet und ein partielles Erkennungsergebnis (n < 1, m>) ermittelt, welches ein Wort in der Anzahl (n) der Worte darstellt, und für das das partielle ivhnlichlceitsmaß S(<1, m>) ermittelt worden ist, daß ferner ein Speicher (66) das partielle Äanlichkeitsmaß (S <1, m> ) und das für die genannten Zeitpunkte (1, r.i) ermittelte und durch das genannte Maximum der partiellen Ähnlichkeitsmaße gebildete partielle Erkennungsergebnis (n<l, m>) speichert, und daß ferner eine zweite Rechenschaltung (68) das aus einer Anzahl (Y) Worte bestehende und eine entsprechende Anzahl (Y-1) von Unterbrechungspunkten (l_ri>., I_n-,, ..., L₀, ..., Iv -lO aufweisende LU I^ J -Ό ^1- ■ J(a-, ₊ i, cL-, ₊ 2, ···, a ·, ..., a) are shown, and the arithmetic unit (64) between the partial patterns (A (I, m)) and the reference Word pattern (B ⁿ ) with dynamic programming calculates the similarity measures (S (A (I, m), B)), and that a circuit (65) calculates the maximum of partial similarity measures (S <.1, η > ) for a number ( n) forms from fVorts and determines a partial recognition result (n <1, m>), which represents a word in the number (n) of words and for which the partial similarity measure S (<1, m>) has been determined, that a memory (66) also stores the partial similarity measure (S <1, m>) and the partial recognition result (n <1, m>) determined for the mentioned times (1, ri) and formed by the mentioned maximum of the partial similarity measures , and that further a second arithmetic circuit (68) consists of a number (Y) words and a corresponding number (Y-1) of breakpoints (l _ri >., I _n - ,, ..., L ₀ , .. ., Iv -lO aufwe isende LU I ^ J -Ό ^1- ■ J

Sprachmuster (A) am Eingang in einer der Zahl (Y) der Worte gleiche Zahl partieller Sprachmuster (A(I^_v_-^, ifyO (^ ⁼ "I »2, ..., Y) aufteilt, und daß die erste Recheneinheit (64) bei Vorliegen des partiellen Ähnlichkeitsmaßes (S"^!, m>) und des partiellen ErkennungsergebnissesSpeech pattern (A) at the input into one of the number (Y) of words _{divides the same number of partial speech patterns (A (I ^ v} _- ^, ifyO (^ ⁼ "I» 2, ..., Y), and that the first Computing unit (64) when the partial similarity measure (S "^ !, m>) and the partial recognition result are present

609838/0897609838/0897

SoSo

(n<l, m} die partiellen Ähnlichkeitsmaße (S 0,1,..-.(n <l, m} the partial similarity measures (S 0,1, ..-.

für alle benachbarten £ombinationenfor all neighboring combinations

von Unterbrechungspunkten an eine weitere Rechensciialtuiig C69) ausliest, die das Maximum der partiellen Ähnlichkeits maße berechnet und aus den zugeordneten partiellen j£rkennungserg©bnissen (n<l, m >) die entsprechende Zahl (Y) der iiorte bildet.from breakpoints to a further arithmetic case C69), which calculates the maximum of the partial similarity measures and from the assigned partial j £ r identification results (n <l, m >) the corresponding number (Y) forms the iiorte.

09838/Q8.909838 / Q8.9

S3S3

LeerseiteBlank page