US2243525A - Production of artificial speech - Google Patents

Production of artificial speech Download PDF

Info

Publication number
US2243525A
US2243525A US324286A US32428640A US2243525A US 2243525 A US2243525 A US 2243525A US 324286 A US324286 A US 324286A US 32428640 A US32428640 A US 32428640A US 2243525 A US2243525 A US 2243525A
Authority
US
United States
Prior art keywords
key
resonance
speech
waves
circuit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US324286A
Inventor
Homer W Dudley
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
Bell Telephone Laboratories Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bell Telephone Laboratories Inc filed Critical Bell Telephone Laboratories Inc
Priority to US324286A priority Critical patent/US2243525A/en
Application granted granted Critical
Publication of US2243525A publication Critical patent/US2243525A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00

Definitions

  • the present invention relates to the artificial production of speech or similar complex waves.
  • The-invention has particular reference to, and will be disclosed in connection with, a key-operdied or finger-operated mechanism iorbuilding up understandable speech waves from waves produced in suitable wave sources and constituting the raw material out of .which the speech is to be constructed.
  • An object of the invention is the artificial production of speech or similar waves in simplified and effective manner.
  • the present invention aims at a simplification in apparatus, pecially in the matter at the irequency selective devices used.
  • the present invention achieves such simplification by imitating more closely the process carried out by the hu-. man speech mechanism. While the system of my prior patent has operated successfully as an arusing, inplace of the large number of fixed filters the ,tip raised so as to provide two resonance principal portions. Moreover, as indicated above','
  • the positions of jaws, tongue and lips used for producing one sound effect change by a continuous movement to the positions necessary to produce the next sound effect, resulting in a continuous transition from one sound to'another, in a large number of cases.
  • the frequency controls are exercised by the air chamber resonances of the mouth comprising P ncipally two variable resonances, and changes aremade mostly by gliding from one sound to another as a result of continuous or gradual, of abrupt, changes.
  • the size of the total mouth cavity is varied lay-movement oi the lower jaw and the lips while the tongue modifies the mouth rosenances in a very important manner by shaping the mouth cavities.
  • the tongue by rising in the middle divides the mouth cavity into two principal chambers with air coupling between themfand the resonances 0! these two chambers are varied between wide limits by varying the respectiveair chamber sizes and degree figure.
  • Key I is the pitch control key
  • key II is of coupling.
  • the tongue may be drawn back and other.
  • v Fig l is a schematic circuit diagram of a. system for artificially producing speech sounds manually under the control of keys in accordance with one form of the invention
  • I 2 is a schematic circuit diagram similar to Fig. 1, as indi- 1 but with certain simplifications according to the invention
  • Fig. 3 is a diagram generally similar to Fig. 1 but showing a modified system according to the invention
  • I v Fig. 4 is a i'ragmentarycircuit schematic showing a modification that may be substituted in the system ofFig. 1, as indicated by the broken lines.
  • Fig. 1 a system for producing artificial speech is shown in which five keys or controls are used, numbered I, II, 111, .IV and V in the the key for selecting between the type of energy source (buzz or keys III and IV are the resonance controls and key V is the stop conso: nant key.
  • keys or controls numbered I, II, 111, .IV and V in the the key for selecting between the type of energy source (buzz or keys III and IV are the resonance controls and key V is the stop conso: nant key.
  • Theaource of .energy simulating the vocal cords and producing the buzz or complex tone wave is shown as a relaxation oscillator comprising gasfilled tube l and associated circuits. This may be of the type shown in United States patent to R. R. Piesz 2,183,248, issued December 12, 1939.
  • Equalizer 4' reduces all of the harmonics to the same amplitude or is designed to produce some other desired relation between them.
  • key I When key I is-in its upper position it makes contact at 2, giving a bias to the grid equal to the sum of the voltages of batteries 3' and 3", which is great enough to prevent production of oscillations.
  • contact As key I is tiepressed, contact is made at 2'.
  • the two parts of the resistance 3 and the resistance adjacent 3" act like a potentiometer so that when contact is made at 2' a-lesser part of the voltage of battery 3" is effective on the grid and the relaxation oscillator starts to oscillate at a low fundamental frequency.
  • a proportionately smaller voltage is applied from source 3" to the grid and the fundamental frequency of the oscillator is caused to increase.
  • the source of continuous spectrum or random frequency noise is shown as comprising gas tube 5, followed by amplifier 8, and equalizer l' for making the energy distribution constant over the utilized frequency band or otherwise suitabLv shaping it.
  • Energy source key II in its upper position connects oscillator l to the conductor 8 and in its lower position it connects the random noise source to conductor 8.
  • the other side of the respective source is shown grounded.
  • Fig. 1 shows use of a ground return although thewgrounds could be replaced by a conductor, if desired.
  • keys III and IV The function of keys III and IV is to control the tuning of the two selective circuit branches shown included in conductor 8.
  • the resonant circuit controlled by key III includes inductances I and II, resistance II and capacity If.
  • the resonant circuit controlled by key IV includes In considering the operation of the circuit of Fig. 1, it will be convenient to have certain frequency relations in mind. It will be assumed for illustration that the random noise covers the human speech band and that for production of normal speech eflects the relaxation oscillator circuit can have its fundamental frequency varied resonance and resistances II and I8 may be proportioned to give the desired degree of flatness even to the possibility of having them vary with the inductances under control of extra arms on keys III and IV not shown.
  • circuit 9, l0, II, II when circuit 9, l0, II, II is adjusted to resonance at 1,000 cycles, it may give ,6 decibels greater attenuation at 250 cycles away from resonance than it has at the resonant frequency; and when circuit l3, l4, l5, I8 is adjusted to resonance at 3,000 cycles it may give 8 decibels greater attenuation at 450 cycles away from resonance than at the resonant frequency.
  • circuit 9, l0, II, II when circuit 9, l0, II, II is adjusted to resonance at 1,000 cycles, it may give ,6 decibels greater attenuation at 250 cycles away from resonance than it has at the resonant frequency; and when circuit l3, l4, l5, I8 is adjusted to resonance at 3,000 cycles it may give 8 decibels greater attenuation at 450 cycles away from resonance than at the resonant frequency.
  • Key V when in its top position transmits the waves at full volume to the variable volume control pad I! and amplifier "leading toloudspeaker I! or other output such as a line. When be determined by trial. Stop consonant effects are introduced by sharp, quick movements of key V.
  • Figs. 1A and 13 different types of resonance controls are shown which may be substituted for the type shown in Fig. 1.
  • Key III in Fig. 1A moves a sliding contact along inductive winding 20, while key IV similarly moves its slider alon inductive winding ii.
  • the slider may move from units. These prevent storage of energy with re- I range.
  • Fig. 2 illustrates a system in which a simplification in the number of controls has been made.
  • the sound wave sources I and 5 and their associated circuit elements may be the same as previously described.
  • Key II of Fig. 1 has been replaced by a relay 24 adapted to be energized from battery 25 whenever key I' bridges over contacts 26, which is its first operative position, the topmost position being -idl'e, corresponding to such a large bias on the grid that no oscillationsare produced.
  • relay 24 When relay 24 is energized its armature closes its lower contact and connects the random noise source toconductor 8. Further downward movement of key I releases relay 24 to connect the tone source I to conductor 8 and such further downward movement of key I also results in controlling the fundamental pitch as previously described.
  • the resonance controls are not independent. When a sound has a certain resonance in the lower frequency range, it will have an associated resonance in the upper frequency This is more or less fundamental in the two parts-for producing resonant efiects. In any case, a certain lower resonance anda certain upper resonance must go together for a paroutputs from the energy sources are shown, eachincluding a "bufler" amplifier 3
  • the stop consonant key V is similar in construction to that shown in Fig. 38 of my copending application for United States patent Serial No. 181,275, filed December 23, 1937. and is arranged to provide for the making of aseries of circuit closures in given sequence and returned to normal position without operating the contacts in reverse order, if such action is desired.
  • the key shank 35 is linked to the slider 36 which in its downwardftravel engages in succession contacts 31, 38, 39, 40 and 4
  • wiper 36 connects conductor 8 and contact 31 directlyt'o the outgoing circuit including volume adjusting network Il, amplifier l8 and loud-speaker l9, When moved into engagement with contact 38, wiper 36 connects the output of the relaxation oscillaticular sound, Because of these facts there is associated with each resonance A, a single corresponding resonance B for the optimum production of a particular sound. Because of this, it is possible to combine these two resonant controls into one multiple resonance control.
  • Fig. 2 This has been done in Fig. 2 by providing a single key III with two sliders operating over resistances 28 and 28, respectively, for controlling the direct current in the inductive windings of the respective resonant branches, similarly to 4 Fig. 1.
  • any type of variation such as increasing or decreasing at various rates can be secured by continuous movement of the slider uniformly in the same direction.
  • the remainder of the system from this point on to the final output may be the same as in Fig. 1,
  • other types could be used such as those shown in Figs. 1A and-1B with suitable mechanical linkage between the two sliders and the single control key.
  • Fig. 3 shows the same system as Fig. 1 except as modified to include a more elaborate stop contor l throughbufier amplifier 33 and network 45 to the outgoing circuit.
  • network 45 is substituted for thevariable resonance control circuits and by. constructing network 45 to have definite characteristics the desired resonance effoot is produced when this is brought into the circuit.
  • wiper 36 brings network 46 into circuit between the output of the resonance control circuits and the outgoing circuit.
  • Network 46 may be constructed to give its individual characteristic shaping to the waves.
  • wiper 36 connects the output of the random noise source 5 through buffer amplifier 34 and shaping network 41 to the outgoing circuit.
  • Fig. 4 the circuit of which is to be substituted in Fig. 1 between lines 2-2 and 11-11. separate resonance control circuits are shown for voice and unvoiced sounds with common control keys 111" and IV"... In this arrangement the control keys.
  • Each resonance control key carries two sliders which operate over separate inductance controls in respective circuits.
  • the resonant branches 3, In, II, I! and I3, I4, l5 and I6 may be the same as in Fig. 1 and controlled in exactly similar manner, as shown. In the present figure, however, these are used only with" respect to the controlled by resonant branches 9', In, H, I2,
  • I4, l5, l6 which may be constructed to have resonant properties similar to or, preferably, different from the resonant circuits for the voicedenergy. Since the unvoiced sounds usually employhigher frequencies than the voiced 'sounds, the resonant branches indicated by primed numerals would be resonant to higher frequencies than the others. Such an arrangement makes for ease in manipulation, for a key canbe made to correspond, in one position, to one frequency condition in the branch carrying waves representing voiced sounds and to some different frequency condition in the consonant sound branches, so that both of these resonant conditions can be obtained without movement of the control key in question. Thus the transition from voiced to unvoiced sounds and vice versa can be made with less extent of movement of the control keys in many instances.
  • a source of electrical wave energy of continuous spectrum for simulating unvoiced sounds, means to select wave energy from either source, means to control variably the waves from the selected source comprising variable resonant circuits simulating the effect of the different resonant cavities of the human mouth.
  • electrical wave generating means for producing electrical wave energy having a spectrum distribution resembling that of vocal cord sounds and other electrical wave energy of continuous spectrum
  • a switch for selecting either of said types .of wave energy at will
  • a shaping circuit comprising a pair of resonant circuits capable of having their resonances separately varied over the major part of the speech frequency range, individual resonance controls for said circuits,
  • a controlling circuit means producing waves consisting of a fundamental and a plurality of harmonic components in the speech frequency range, means producing other waves of random frequency distribution within the speech frequency range, selecting means for artificial speech,
  • a system for the artificial production of speech waves comprising means for generating waves covering the speech frequency range and comprising afundamental frequency and harmonies thereof, means for generating wavesof random frequency distribution, means for producing sound from said waves in sequencessimulating the sound sequences in speech and a resonant system connected to said generating and sound producing means having variable resonance means" for emphasizing sounds in different portions of the speech frequency band.
  • a source of electrical waves comprising a fundamental frequency and harmonics thereof, a source of waves of random frequency distribution, means for translating waves from either source into sound, aresonan-t system for transferring Waves from said sources to said translating means comprising circuit branches of adjustable resonance, and means for varying the resonant frequencies of said branches.
  • the method of producing speech artificially comprising generating electrical waves simulating the vocal cord vibrations and generating other electrical waves simulating unvoiced sounds, re-' producing sounds from said waves and molding the waves which produce said sounds by producing resonance effects upon the electrical waves simulating the action of the resonant air chambers of the mouth and producing continuous variations of said resonance effects to make transitions from one sound to another.
  • a relaxation oscillator adapted to connect said sources to said translating means and manually operated meansfor varying the resonant frequencies of said resonance circuits.
  • a relaxation oscillator for the artificial production of speech, a relaxation oscillator, a random noise source, means for translating electrical waves into sound waves, variable resonance circuits adapted to connect said sources to said translating means, manually operated means for varying the resonant frequencies of said resonance circuits and a stop consonant key having a movable contaoting element,- movable over a succession of stationary contacts for connecting the input of said sound reproducing means in rapid succession to a, succession of shaping networks.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrophonic Musical Instruments (AREA)

Description

May 27, 1941'.
H. W. DUDLEY PRODUCTION OF ARTIFICIAL SPEECH Filed March 16, 1940 4 Sheets -Sheet' l 6A5 FILLED EQUAL/25R AAA l W l M l EQUAL/25R INVEA IT OR H. m DUDLEY A TTORNE Y May 27, 1941.
H. W. DUDLEY PRODUCTION OF ARTIFICIAL SPEECH Filed March 16, 1940 6A). FILLED EQUAL IZER RA NDOMv 4 Sheets-Sheet 2 /-v)v-TOR H. W DUDLEY ATTURNEV y 7, 1941. H. w. DUDLEY 2,243,525
PRODUCTION 0F ARTIFICIAL SPEECH v Filed March 16, 1940 4 Sheets-Sheet s FIG. 3
co/v TRQLS RESONANCE BUFFER AMPLIFIERS gum gg I /NVENTOR gg htw DUDLEY A TTORNEV May 27, 1941. H. w. DUDLEY v ,5
PRODUCTION OF ARTIFICIAL SPEECH Filed March 16, 1 940 4 Sheets-Sheet 4 ll llll hlllll ATTORNEY Patented May 27,1941
UNITED STATE S PATENT OFFICE" amuse rnonucrron' or numeric srnnon Homer W. Dudley, Garden City, N. E, assignor a Bell Telephone Laboratories, Incorporated, New York, N. Y., a co p ration of New York Application March 16, 1940, Serial No. 324,286
' 9 Claims. (01. I'm-d) The present invention relates to the artificial production of speech or similar complex waves. The-invention has particular reference to, and will be disclosed in connection with, a key-operdied or finger-operated mechanism iorbuilding up understandable speech waves from waves produced in suitable wave sources and constituting the raw material out of .which the speech is to be constructed.
An object of the invention is the artificial production of speech or similar waves in simplified and effective manner.
In my prior Patent 2,121,142, June 21, 1938, I
have disclosed a system for artificial speech production from waves of two types, a buzzer-like tone wave and a hiss like noise wave, which are combined in particular manners under control of keys to build up speech waves; In that tern as'disclosed a number of narrow band filters ior example, ten filters, are used in the syn thesizing operation, these filters having fixed nequency limits and being brought into the circuit in such combinations as needed. Usually only asmall number of these filters are needed for any one sound, such as two or three filters, the rest of the group being momentarily unused.
The present invention aims at a simplification in apparatus, pecially in the matter at the irequency selective devices used. The present invention achieves such simplification by imitating more closely the process carried out by the hu-. man speech mechanism. While the system of my prior patent has operated successfully as an arusing, inplace of the large number of fixed filters the ,tip raised so as to provide two resonance principal portions. Moreover, as indicated above','
the positions of jaws, tongue and lips used for producing one sound effect, change by a continuous movement to the positions necessary to produce the next sound effect, resulting in a continuous transition from one sound to'another, in a large number of cases.
According to the present invention the con ditions of nature are more closely approached by disclosed in my prior patent, a small number of resonant circuits, such as two, which are variable and which can be variably controlled to produce sounds that merge, one into the next. as
- the transition is made from one sound to ,an-
tificial speech producer, the use of a large number of fixed filters which are switched into and out of circuit in groups has no'closecounterpart in the human speech mechanism. Rather, in
human speech production the frequency controls are exercised by the air chamber resonances of the mouth comprising P ncipally two variable resonances, and changes aremade mostly by gliding from one sound to another as a result of continuous or gradual, of abrupt, changes.
In talking, the size of the total mouth cavity is varied lay-movement oi the lower jaw and the lips while the tongue modifies the mouth rosenances in a very important manner by shaping the mouth cavities. For example, the tongue by rising in the middle divides the mouth cavity into two principal chambers with air coupling between themfand the resonances 0! these two chambers are varied between wide limits by varying the respectiveair chamber sizes and degree figure. Key I is the pitch control key, key II is of coupling. The tongue may be drawn back and other. a
The nature of the invention and its various objects and features will be made more clear from the following detailed description oi particular embodiments as illustrated in the attached drawinst In the drawings: v Fig l is a schematic circuit diagram of a. system for artificially producing speech sounds manually under the control of keys in accordance with one form of the invention;
Figs. 1A and 1B show modifications respectively that may be substituted for the corresponding portion of the circuit of cated by the brokenlines: I 2 is a schematic circuit diagram similar to Fig. 1, as indi- 1 but with certain simplifications according to the invention;
Fig. 3 is a diagram generally similar to Fig. 1 but showing a modified system according to the invention; and I v Fig. 4 is a i'ragmentarycircuit schematic showing a modification that may be substituted in the system ofFig. 1, as indicated by the broken lines.
Referring to Fig. 1 a. system for producing artificial speech is shown in which five keys or controls are used, numbered I, II, 111, .IV and V in the the key for selecting between the type of energy source (buzz or keys III and IV are the resonance controls and key V is the stop conso: nant key. w
Theaource of .energy simulating the vocal cords and producing the buzz or complex tone wave is shown as a relaxation oscillator comprising gasfilled tube l and associated circuits. This may be of the type shown in United States patent to R. R. Piesz 2,183,248, issued December 12, 1939.
It produces a wave rich in harmonics; the fundamental of which is readily variable by controlling the negative grid bias. Equalizer 4' reduces all of the harmonics to the same amplitude or is designed to produce some other desired relation between them. When key I is-in its upper position it makes contact at 2, giving a bias to the grid equal to the sum of the voltages of batteries 3' and 3", which is great enough to prevent production of oscillations. As key I is tiepressed, contact is made at 2'. The two parts of the resistance 3 and the resistance adjacent 3" act like a potentiometer so that when contact is made at 2' a-lesser part of the voltage of battery 3" is effective on the grid and the relaxation oscillator starts to oscillate at a low fundamental frequency. As the key is depressed further, a proportionately smaller voltage is applied from source 3" to the grid and the fundamental frequency of the oscillator is caused to increase.
The source of continuous spectrum or random frequency noise is shown as comprising gas tube 5, followed by amplifier 8, and equalizer l' for making the energy distribution constant over the utilized frequency band or otherwise suitabLv shaping it.
Energy source key II in its upper position connects oscillator l to the conductor 8 and in its lower position it connects the random noise source to conductor 8. The other side of the respective source is shown grounded. For simplicity of wiring, Fig. 1 shows use of a ground return although thewgrounds could be replaced by a conductor, if desired.
The function of keys III and IV is to control the tuning of the two selective circuit branches shown included in conductor 8. The resonant circuit controlled by key III includes inductances I and II, resistance II and capacity If. The resonant circuit controlled by key IV includes In considering the operation of the circuit of Fig. 1, it will be convenient to have certain frequency relations in mind. It will be assumed for illustration that the random noise covers the human speech band and that for production of normal speech eflects the relaxation oscillator circuit can have its fundamental frequency varied resonance and resistances II and I8 may be proportioned to give the desired degree of flatness even to the possibility of having them vary with the inductances under control of extra arms on keys III and IV not shown. To give illustrative figures, when circuit 9, l0, II, II is adjusted to resonance at 1,000 cycles, it may give ,6 decibels greater attenuation at 250 cycles away from resonance than it has at the resonant frequency; and when circuit l3, l4, l5, I8 is adjusted to resonance at 3,000 cycles it may give 8 decibels greater attenuation at 450 cycles away from resonance than at the resonant frequency. These values are not critical and in practice can be varied widely to produce desired effects.
With these general values of frequencies in mind, it is seen that manipulation of key I produces rising and falling inflection and determines the normal talking fundamental, while keys III and IV mold the output of either of the two energy sources, as chosen by key II, into reco nizable sounds similarly to the way in which the human mouth molds the vocal cord waves and resistance l3, capacity I4 and inductances II and I8. While the resonances of these circuits mightbe controlled in various ways the particular method exemplified in this figure is by varying the saturating current flowing in the windings, which have saturable cores. As either key may be. This current varies the point of operation on the permeability curve.
Key V when in its top position transmits the waves at full volume to the variable volume control pad I! and amplifier "leading toloudspeaker I! or other output such as a line. When be determined by trial. Stop consonant effects are introduced by sharp, quick movements of key V.
While in Fig. 1, as well as in the figures to follow, the use of only two resonance controls is shown it is within the invention to use more than two resonance controls, and additional ones may be provided by substantial duplication of the ones shown. The variation of inductance can be made to follow any desired law within considerable limits by use of cores in the different inductances saturating at different rates or in different directions as disclosed more fully in my copending application Serial No. 324,288 filed March 16, 1940.
In Figs. 1A and 13 different types of resonance controls are shown which may be substituted for the type shown in Fig. 1. Key III in Fig. 1A moves a sliding contact along inductive winding 20, while key IV similarly moves its slider alon inductive winding ii. The slider may move from units. These prevent storage of energy with re- I range.
way the air cavity of the mouth is divided into sultant clicks as the contacts are made or broken. The resistances also give some damping of the circuit resonance.
Fig. 2 illustrates a system in which a simplification in the number of controls has been made. In this figure the sound wave sources I and 5 and their associated circuit elements may be the same as previously described. Key II of Fig. 1 has been replaced by a relay 24 adapted to be energized from battery 25 whenever key I' bridges over contacts 26, which is its first operative position, the topmost position being -idl'e, corresponding to such a large bias on the grid that no oscillationsare produced. When relay 24 is energized its armature closes its lower contact and connects the random noise source toconductor 8. Further downward movement of key I releases relay 24 to connect the tone source I to conductor 8 and such further downward movement of key I also results in controlling the fundamental pitch as previously described.
It is possible to omit the explosive consonant control also. A sudden oncoming of energy of the right frequency distribution will produce a good imitation of the explosive sound desired. This is accomplished in the circuit of Fig, 2 by providing a contact 21 in the path on the slider of key I. When contact is made at 21 by the rising key I a silent condition results which can be terminated by lowering the key again resulting in energy changes such as might occur in the sound 9 of the word ago. Similarly, the stop position may be preceded or followed by the random noise source connection above it. By the proper sequence of key I positions and of adjustment of resonance by key III any of the stop consonants can be produced.
In general, the resonance controls are not independent. When a sound has a certain resonance in the lower frequency range, it will have an associated resonance in the upper frequency This is more or less fundamental in the two parts-for producing resonant efiects. In any case, a certain lower resonance anda certain upper resonance must go together for a paroutputs from the energy sources are shown, eachincluding a "bufler" amplifier 3|, 32, 33 or 34 to isolate the two'separate output branches of the same oscillator and prevent undesired reactions.
The stop consonant key V is similar in construction to that shown in Fig. 38 of my copending application for United States patent Serial No. 181,275, filed December 23, 1937. and is arranged to provide for the making of aseries of circuit closures in given sequence and returned to normal position without operating the contacts in reverse order, if such action is desired. The key shank 35 is linked to the slider 36 which in its downwardftravel engages in succession contacts 31, 38, 39, 40 and 4|. If the key is not allowed to rise until it has been depressed the full limit, its end 36 passes under the lower end of guide trigger 42 which is urged to the left by spring 43, so that on its upward stroke the end 36 passes on the right-hand side of member 42 and makes contact with fixed member 44 on such upward stroke. The manner in which it -is used in the figure is illustrative of one type of control. In its rest position' as shown, wiper 36 connects conductor 8 and contact 31 directlyt'o the outgoing circuit including volume adjusting network Il, amplifier l8 and loud-speaker l9, When moved into engagement with contact 38, wiper 36 connects the output of the relaxation oscillaticular sound, Because of these facts there is associated with each resonance A, a single corresponding resonance B for the optimum production of a particular sound. Because of this, it is possible to combine these two resonant controls into one multiple resonance control.
This has been done in Fig. 2 by providing a single key III with two sliders operating over resistances 28 and 28, respectively, for controlling the direct current in the inductive windings of the respective resonant branches, similarly to 4 Fig. 1. By strapping variously the contacts to points along resistor 29, any type of variation such as increasing or decreasing at various rates can be secured by continuous movement of the slider uniformly in the same direction. The remainder of the system from this point on to the final output may be the same as in Fig. 1, Instead of the particular type of resonance controls shown, other types could be used such as those shown in Figs. 1A and-1B with suitable mechanical linkage between the two sliders and the single control key.
Fig. 3 shows the same system as Fig. 1 except as modified to include a more elaborate stop contor l throughbufier amplifier 33 and network 45 to the outgoing circuit. Thus network 45 is substituted for thevariable resonance control circuits and by. constructing network 45 to have definite characteristics the desired resonance effoot is produced when this is brought into the circuit. When moved to engage contact 39, wiper 36 brings network 46 into circuit between the output of the resonance control circuits and the outgoing circuit. Network 46 may be constructed to give its individual characteristic shaping to the waves. When in contact with member 40, wiper 36 connects the output of the random noise source 5 through buffer amplifier 34 and shaping network 41 to the outgoing circuit. This'may be used to give a pronounced sibilant or hiss or aspirant or other unvoiced effect and'the resonance of network 41 may be chosen to select the particular part of the spectrum to give the desired effect. If depressed only part way the key V retraces its downward path on returning, but if depressed to the full extent it returns wiper 36 on the opposite side of catch 42 so that 36 makes contact withspring 44 which, as shown, has the same electrical eiIect as contact between 36 and 31. By variously wiring the contacts of key .V', various efiects are obtainable, those described being illustrative of some, Moreover, if desired, a plurality of separate keys like key V may be used in parallel and so wired that each key corresponds to a particular consonant or group of consonants.
In Fig. 4, the circuit of which is to be substituted in Fig. 1 between lines 2-2 and 11-11. separate resonance control circuits are shown for voice and unvoiced sounds with common control keys 111" and IV"... In this arrangement the control keys.
- Each resonance control key carries two sliders which operate over separate inductance controls in respective circuits. The resonant branches 3, In, II, I! and I3, I4, l5 and I6 may be the same as in Fig. 1 and controlled in exactly similar manner, as shown. In the present figure, however, these are used only with" respect to the controlled by resonant branches 9', In, H, I2,
and 13, I4, l5, l6 which may be constructed to have resonant properties similar to or, preferably, different from the resonant circuits for the voicedenergy. Since the unvoiced sounds usually employhigher frequencies than the voiced 'sounds, the resonant branches indicated by primed numerals would be resonant to higher frequencies than the others. Such an arrangement makes for ease in manipulation, for a key canbe made to correspond, in one position, to one frequency condition in the branch carrying waves representing voiced sounds and to some different frequency condition in the consonant sound branches, so that both of these resonant conditions can be obtained without movement of the control key in question. Thus the transition from voiced to unvoiced sounds and vice versa can be made with less extent of movement of the control keys in many instances.
What is claimed is: a
1. In a system for producing a source of electrical wave energy of frequency spectrum resembling vocal cord sounds, a source of electrical wave energy of continuous spectrum for simulating unvoiced sounds, means to select wave energy from either source, means to control variably the waves from the selected source comprising variable resonant circuits simulating the effect of the different resonant cavities of the human mouth.
2. In a system for producing artificial speech, electrical wave generating means for producing electrical wave energy having a spectrum distribution resembling that of vocal cord sounds and other electrical wave energy of continuous spectrum, a switch for selecting either of said types .of wave energy at will, a shaping circuit comprising a pair of resonant circuits capable of having their resonances separately varied over the major part of the speech frequency range, individual resonance controls for said circuits,
means to apply the wave energy selected by said switch to said resonant circuits, and a sound producer actuated by the energy transmitted through said resonant circuits.
3.' In mechanism for producing artificial speech and similar effects, a controlling circuit, means producing waves consisting of a fundamental and a plurality of harmonic components in the speech frequency range, means producing other waves of random frequency distribution within the speech frequency range, selecting means for artificial speech,
movement of the human speech mechanism, and means to translate the energy in said controlling circuit in-to sounds simulating speech.
4.;A system for the artificial production of speech waves comprising means for generating waves covering the speech frequency range and comprising afundamental frequency and harmonies thereof, means for generating wavesof random frequency distribution, means for producing sound from said waves in sequencessimulating the sound sequences in speech and a resonant system connected to said generating and sound producing means having variable resonance means" for emphasizing sounds in different portions of the speech frequency band.
5. In a system for producing artificial speech, a source of electrical waves comprising a fundamental frequency and harmonics thereof, a source of waves of random frequency distribution, means for translating waves from either source into sound, aresonan-t system for transferring Waves from said sources to said translating means comprising circuit branches of adjustable resonance, and means for varying the resonant frequencies of said branches.
6. The combination according to claim 5, in-
, eluding manually operated keys for selecting the one of said wave sources to be used and for varying the resonance of said circuit branches.
7. The method of producing speech artificially comprising generating electrical waves simulating the vocal cord vibrations and generating other electrical waves simulating unvoiced sounds, re-' producing sounds from said waves and molding the waves which produce said sounds by producing resonance effects upon the electrical waves simulating the action of the resonant air chambers of the mouth and producing continuous variations of said resonance effects to make transitions from one sound to another.
8. In a system for the artificial production of speech, a relaxation oscillator, a random noise source, means for translating electrical waves into sound waves, variable resonance circuits adapted to connect said sources to said translating means and manually operated meansfor varying the resonant frequencies of said resonance circuits.
9. In a systemfor the artificial production of speech, a relaxation oscillator, a random noise source, means for translating electrical waves into sound waves, variable resonance circuits adapted to connect said sources to said translating means, manually operated means for varying the resonant frequencies of said resonance circuits and a stop consonant key having a movable contaoting element,- movable over a succession of stationary contacts for connecting the input of said sound reproducing means in rapid succession to a, succession of shaping networks.
HQMER W. DUDLEY.
US324286A 1940-03-16 1940-03-16 Production of artificial speech Expired - Lifetime US2243525A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US324286A US2243525A (en) 1940-03-16 1940-03-16 Production of artificial speech

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US324286A US2243525A (en) 1940-03-16 1940-03-16 Production of artificial speech

Publications (1)

Publication Number Publication Date
US2243525A true US2243525A (en) 1941-05-27

Family

ID=23262926

Family Applications (1)

Application Number Title Priority Date Filing Date
US324286A Expired - Lifetime US2243525A (en) 1940-03-16 1940-03-16 Production of artificial speech

Country Status (1)

Country Link
US (1) US2243525A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2458227A (en) * 1941-06-20 1949-01-04 Hartford Nat Bank & Trust Co Device for artificially generating speech sounds by electrical means

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2458227A (en) * 1941-06-20 1949-01-04 Hartford Nat Bank & Trust Co Device for artificially generating speech sounds by electrical means

Similar Documents

Publication Publication Date Title
US2243527A (en) Production of artificial speech
US2183248A (en) Wave translation
Dunn The calculation of vowel resonances, and an electrical vocal tract
US2301871A (en) Electrical musical instrument
Rosen Dynamic analog speech synthesizer
US2938079A (en) Spectrum segmentation system for the automatic extraction of formant frequencies from human speech
CA2019804A1 (en) Gain-shape vector quantization apparatus
US2817711A (en) Band compression system
US2250916A (en) Robot lip action
US2414886A (en) Apparatus for the production of music
US2243525A (en) Production of artificial speech
US2243526A (en) Production of artificial speech
Klatt Acoustic theory of terminal analog speech synthesis
US2233258A (en) Electrical musical instrument
US2694954A (en) Electrical musical instrument
US2458227A (en) Device for artificially generating speech sounds by electrical means
US2243089A (en) System for the artificial production of vocal or other sounds
US2570701A (en) Harmonic-selecting apparatus
US3180918A (en) Tone generator system
US2824906A (en) Transmission and reconstruction of artificial speech
US2983178A (en) Accordion-organ
US1947621A (en) Reverberation circuit
US3573374A (en) Formant vocoder utilizing resonator damping
Fletcher Plucked strings-a review
US3042748A (en) Dynamic analog speech synthesizer