US3542955A - Automatic generation of voiceless excitation in a vocal-tract synthesizer - Google Patents
Automatic generation of voiceless excitation in a vocal-tract synthesizer Download PDFInfo
- Publication number
- US3542955A US3542955A US724931A US3542955DA US3542955A US 3542955 A US3542955 A US 3542955A US 724931 A US724931 A US 724931A US 3542955D A US3542955D A US 3542955DA US 3542955 A US3542955 A US 3542955A
- Authority
- US
- United States
- Prior art keywords
- synthesizer
- signal
- vocal
- tract
- excitation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000005284 excitation Effects 0.000 title description 22
- 230000001755 vocal effect Effects 0.000 description 18
- 230000006870 function Effects 0.000 description 14
- 230000015572 biosynthetic process Effects 0.000 description 11
- 238000003786 synthesis reaction Methods 0.000 description 11
- 230000004044 response Effects 0.000 description 9
- 230000003595 spectral effect Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 2
- 239000003990 capacitor Substances 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003534 oscillatory effect Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 210000001260 vocal cord Anatomy 0.000 description 2
- 241000400041 Proteidae Species 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000005669 field effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000000034 method Methods 0.000 description 1
- 208000024703 mutilating palmoplantar keratoderma with periorificial keratotic plaques Diseases 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 210000000689 upper leg Anatomy 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Definitions
- This invention relates to speech synthesis and, more particularly, to apparatus for exciting an analog speech synthesizer to generate synthesized speech sounds.
- 664,130 is achieved by further simulating the glottal section of the human articulatory system as a second order oscillatory system.
- the oscillatory system is controlled in accordance with physiological parameters of subglottal pressure, vocal cord tension, and vocal tract shape. Although this system greatly improves the quality of voiced sounds, it does not provide sufficient control of the synthesizer to achieve a like improvement in unvoced sounds.
- Fricative excitation is essential for generation of sounds, such as /s/ and /sh/. Fricatives are generated in the human articulatory system by turbulent air flow (noise) at a constriction in the vocal tract. At some given velocity, the flow through the constriction becomes turbulent and thereby constitutes a localized random noise source. The location of the effective noise source and the magnitude of the noise, however, are difficult to determine.
- Stop excitation is produced by making a complete closure of the vocal tract. Pressure is built up behind the closure and, when released, a step function of pressure is applied to the vocal tract. Generally, stop excitation is followed by some frictative excitation (or aspiration) caused by turbulence generated after the release of the pressure. The position of the stop source, i.e., the step function, and the magnitude of the signal are also difficult to determine.
- Another object of this invention is to generate voiceless excitation in a vocal-tract synthesizer in response to signals present within the synthesizer.
- controllable signals developed to represent the momentary changes in air flow caused by variation in a vocal-tract configuration may be used to control voiceless excitation in a synthesizer to produce natural-sounding artificial speech.
- flow through a constriction may be characterized by Reynolds number, and that Reynolds number is proportional to the particle velocity of the flow through the constriction and to the width of the constriction.
- Reynolds number is proportional to the particle velocity of the flow through the constriction and to the width of the constriction.
- noise pressure at a fixed distance in front of the mouth and, in turn, the noise pressure at a given location, for example, at a constriction within the vocal tract is proportional to the square of Reynolds number'in excess of a critical threshold level.
- voiceless excitation is developed in a vocal-tract synthesizer by use of variable random energy (noise) sources, each having controllable internal impedances.
- the noise is selectively inserted in each of a plurality of sections, which make up the synthesizer.
- Control of the noise source and its'impedance is effected by using signals present within each synthesizer sections, namely, area control signals and current level.
- voiceless excitation is developed in each section of a vocal-tract synthesizer by sampling the current flowing in that sec-,
- a feature of the present invention is the generation of signals in response to instantaneous changes in the shape of and the flow within a vocal-tract synthesizer section to develop voiceless excitation for generating synthetic speech.
- Another feature of the invention permits voiceless excitation signals to be used in conjunction with glottal excitation signals to form voiced fricative synthetic speech sounds.
- FIG. 1 shows in block schematic form a speech synthesis system in which the present invention may be utilized
- FIG. 2 illustrates a section of an analog synthesizer that employs the present invention
- FIG. 3 shows in block schematic form the details of the controllable noise source of FIG. 2;
- FIG. 4 shows in block schematic form the details of the impedance control generator of FIG. 2.
- a spoken word is composed of a sequence of linguistic elements called phonemes.
- phonemes, or sequences of phonemes to gether with suitable stressing must be specified in a manner directly acceptable by a synthesizer.
- the synthesizer must be excited by signals representative of these phonemes, in both the voiced and unvoiced domain, in order to generate naturaLsounding speech sounds.
- FIG. 1 shows in simplified block form a synthesis by rule system suitable for generating natural-sounding speech in response to coded signals.
- Phoneme sequence generator 101 is utilized to produce appropriate signals for initiating the synthesis of speech sounds. Upon request, this information is supplied to vocal-tract area function generator 102 and to glottal excitation signal generator 103.
- Area control signals A through A for controlling the synthesis of voiced sounds by transmission line vocal-tract synthesizer 104 are generated by vocal tract area function generator 102.
- Synthesizer 104 also requires suitable excitation, U which is produced in glottal excitation generator 103.
- Synthesizer 104 responds to the applied control signals and develops aritificial speech sounds which may be used to energize a transducer such as loud speaker 105.
- a synthesis system such as this is described in detail in my copending applicator (Ser. No. 664,130, filed Aug. 29, 1967) cited above. Although that system generates signals which are adequate to excite vocal-tract synthesizer 104 for voiced sounds, it does not afford sufiicient control over synthesizer 104 to generate high-quality unvoiced sounds.
- speech synthesizer 104 may comprise n sections, each of which represents a portion of the human vocal tract.
- speech synthesizer 104 may comprise n sections, each of which represents a portion of the human vocal tract.
- noise signals must be inserted into each of the synthesizer sections. The inserted noise signals must be representative of noise generated by turbulent flow within the human vocal tract.
- zq is the acoustic particle velocity flowing in synthesizer section i;
- W is the effective width of synthesizer section i
- p is the air density
- Equation 2 can be simplified to By simple substitution, Equation 4 reduces to i i
- the area control signal A and the current flowing within the section which is representative of U
- experimentation has indicated that the internal impedance of the localized noise source may be approximated by a series connection of variable resistive elements, namely, R and R
- FIG. 2 shows in schematic form section i of vocal-tract synthesizer 104 comprising a plurality of n sections which respond to signals present within a vocaltract model to excite the synthesizer for generating the unvoiced, i.e., fricative and stop, portion of synthesized speech sounds.
- Inductors L and capacitor C are primary impedance elements which form one T-section of transmission-line type vocal-tract synthesizer 104. The component values of inductors L and capacitor C are controlled in response to vocal-tract area signal A
- a synthesizer T-section of this type and apparatus for controlling the T-section components is described in greater detail in the Rosen patent cited above.
- Current U is developed in T-section i in response to the application of glottal excitation signal U (FIG. 1) to vocal-tract synthesizer 104.
- the resultant current U flowing within section i is sampled at 201.
- Current U normally comprises a DC component and higher spectral components.
- the DC. component is several times greater in magnitude than the higher spectral components. Since the magnitude of the noise signal to be inserted in the synthesizer section and the value of the internal impedance of the noise source are dependent upon the magnitude of U, the system may tend to be unstable.
- U is filtered via low-pass filter 202 to eliminate the higher spectral components, which are generally responsible for such instability.
- filter 202 minimizes the possibility of feeding back high spectral components that are inserted into the synthesizer by noise source 204, or by the noise sources of any of the other sections of the synthesizer, thereby further insuring stability of the system.
- noise source 204 or by the noise sources of any of the other sections of the synthesizer, thereby further insuring stability of the system.
- a low-pass filter having a cutoff frequency of approximately 500 Hz. is sufiicient for this purpose.
- current signal U is applied to full wave rectifier circuit 203 to obtain its absolute value ]U,[.
- any other arrangement for obtaining the absolute value of U may be utilized as desired.
- a signal representative of [U is thereafter applied to controllable noise source 204, to be discussed in greater detail in connection with FIG. 3, and to impedance control generator 205, to be discussed in connection With FIG. 4.
- Area control signal A is also applied to controllable noise source 204 and to impedance control generator 205.
- Signal A is utilized in conjunction with a signal representative of
- the noise signal inserted and the resistive values of R and R cooperate to vary the fiow of current U in the synthesizer section, thus to generate natural-sounding speech sounds.
- FIG. 3 shows in block schematic form details of controllable noise source 204.
- Signals representative of A, and ⁇ U 1 are supplied, in accordance with the invention, to controllable noise source 204 for generating a noise signal P in accordance with Equation 5.
- U is squared in squaring network 301, and thereafter applied to one input of dividing network 302.
- A is applied to another input of dividing network 302.
- the output from divider 302 is applied to amplifier 303, where its amplitude is adjusted.
- the output from amplifier 303 is thereafter applied to control the gain of variable gain amplifier 304.
- a noise signal generated in noise source 305 which may be, for example, a gas tube, is applied to the input of variable gain amplifier 304.
- the output signal P of amplifier 304 is applied to transformer 206 for insertion into the synthesizer section.
- FIG. 4 A block schematic diagram of impedance control generator 205 is shown in FIG. 4.
- Signals A, and [U are supplied to generator 205, wherein they are utilized to generate signals for controlling the impedance values of R and R in accordance with Equation 6 and 7, respectively.
- squaring network 401 A signal representative of [U is applied to a second input of dividing network 402.
- the output signal, (K [U ]A,- from dividing network 402 is representative of Equation 6 and is applied to variable resistor 207, which may be either a rheostat driven by a servomechanism, an appropriately biased field effect transistor, or the like.
- Signal A is also applied for cubing to network 403, where it is converted to A Thereafter A is divided and appropriately scaled in network 404.
- the output from network 404 is representative of Equation 7. This signal, (K4A1 3), is applied to variable resistor 208.
- control signal generator includes:
- Apparatus as defined in claim 1 further including controllable impedance elements connected in series with said inserting means, said impedance elements being responsive to others of said control signals for varying the magnitude of the impedance in said circuit path of said synthesizer section in accordance with a preestablished function of said control signals.
- control signal generator includes:
- apparatus for developing voiceless excitation in each of said sections which comprises:
- a controllable source of noise energy having variable internal impedance elements; means for generating a signal representative of the magnitude of a current developed in a circuit path of said synthesizer section, said current signal being representative of the volume velocity flow in said synthesizer section; means responsive to said vocal-tract area function signal and to said current magnitude signal for generating signals for controlling said noise source; and
- said im- References Cited pedance control signal generator includes: 5 UNITED STATES PATENTS means responsive to said area funct1on signal for gen- 3,042,748 7/1962 Rosen 179 1 eratmg a signal for controlling the impedance value a of one of said impedance elements; and KATHLEEN H CLAFFY, Primary Examiner means responsive to the magnitude of said current signal and to said area function signal for generating a 10 OLMS Assistant Exammer
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Description
Nov. 24, 1970 J. L. FLANAGAN 3,542,955
I AUTOMATIC GENERATION OE VOICELESS EXCITATIQN IN A VOCAL-TRACT SYNTHESIZER Filed April 29, 1968 2 Sheets-Sheet 1 VOCAL TRACT AREA 5 FUNCTION I /o/ GENERATOR 4 I/ /A/ /A/7 'PHONEME G-z@%%a-,-% o
4 GLOTTAL 9 l EXC/TAT/ON VOCAL TRACT I SIGNAL SVNTHES/ZER GENERATOR A j 1 F G. 2 2
204 205 2/0 l CON TROLL/161E IMPEDANCE r NOISE CONTROL sou/n35 GENERATOR x02. i 207 20a i u -206 C W W WT m m O /7 T-SECT/ON C I I Q lNl/NTOR J. L. FLANAGAN ATTORNEY United States Patent O "ice ABSTRACT OF THE DISCLOSURE Natural-sounding synthesized speech is produced in an analog vocal-tract synthesizer by selectively inserting a noise signal, in addition to a glottal excitation signal, into each section of the synthesizer. The magnitude of the noise signal and the internal resistance of the noise source are automatically controlled in response to the current flowing in and the area control signal applied to each synthesizer section.
BACKGROUND OF THE INVENTION This invention relates to speech synthesis and, more particularly, to apparatus for exciting an analog speech synthesizer to generate synthesized speech sounds.
Several systems have been proposed which attempt to produce natural-sounding speech through synthesis. In one of the most promising of these systems an attempt is made to duplicate the transmission properties of the human vocal tract and to supply siutable vocal excitation to a model of the vocal tract constructed to exhibit these properties. Since the exact process by which human speech is generated is not completely understood, speech produced in such a synthesis system is often unnatural sounding. Difficulties have been experienced in duplicating signals representative both of the voiced and unvoiced portions of human speech. Fortunately, it is now possible to develop voiced speech portions of a very thigh quality. The improvement, as described in my copending application, I. L. Flanagan, filed Aug. 29, 1967, Ser. No. 664,130, is achieved by further simulating the glottal section of the human articulatory system as a second order oscillatory system. The oscillatory system is controlled in accordance with physiological parameters of subglottal pressure, vocal cord tension, and vocal tract shape. Although this system greatly improves the quality of voiced sounds, it does not provide sufficient control of the synthesizer to achieve a like improvement in unvoced sounds.
Generation of natural-sounding speech also requires that the synthesizer be properly excited in the voiceless domain. Voiceless excitation is of two types, namely, continuant (fricative) and transient (stop). Fricative excitation is essential for generation of sounds, such as /s/ and /sh/. Fricatives are generated in the human articulatory system by turbulent air flow (noise) at a constriction in the vocal tract. At some given velocity, the flow through the constriction becomes turbulent and thereby constitutes a localized random noise source. The location of the effective noise source and the magnitude of the noise, however, are difficult to determine.
3 ,542,955 Patented Nov. 24, 1970 Stop excitation is produced by making a complete closure of the vocal tract. Pressure is built up behind the closure and, when released, a step function of pressure is applied to the vocal tract. Generally, stop excitation is followed by some frictative excitation (or aspiration) caused by turbulence generated after the release of the pressure. The position of the stop source, i.e., the step function, and the magnitude of the signal are also difficult to determine.
A system which attempts to reproduce speech, in response to an analysis of actual human speech, is disclosed in U.S. Pat. 3,042,748 issued to G. Rosen, July 3, 1962. Rosen employs apparatus for inserting signals into an analog synthesizer to produce synthetic speech sounds in response to a voiced signal. Rosen, however, attempts to control externally the magnitude of and place of insertion of the signals. Signals so generated yield speech sounds having poor quality. In such a system, difficulties are experienced in locating where the noise signals are to be inserted for obtaining satisfactory syntheized sounds. Although such a system may be useful in certain applications, it is unsatisfactory for practical use in the synthesis of human speech in response to coded signals, i.e., to the synthesis of speech by rule.
Therefore, it is an object of this invention to generate natural-sounding speech sounds.
Another object of this invention is to generate voiceless excitation in a vocal-tract synthesizer in response to signals present within the synthesizer.
SUMMARY OF THE INVENTION In accordance with this invention, these and other objects are accomplished by turning to account the flow characteristics of the human articulatory system. It is important to recognize that voiceless sounds are produced in the human articulatory system by the flow of turbulent air through a constriction in the vocal tract, or by a closure of the vocal tract. Thus, in an electrical counterpart to the human system, controllable signals developed to represent the momentary changes in air flow caused by variation in a vocal-tract configuration may be used to control voiceless excitation in a synthesizer to produce natural-sounding artificial speech.
It is known that flow through a constriction may be characterized by Reynolds number, and that Reynolds number is proportional to the particle velocity of the flow through the constriction and to the width of the constriction. Furthermore, it is known that noise pressure at a fixed distance in front of the mouth and, in turn, the noise pressure at a given location, for example, at a constriction within the vocal tract, is proportional to the square of Reynolds number'in excess of a critical threshold level.
Thus, according to the invention, voiceless excitation is developed in a vocal-tract synthesizer by use of variable random energy (noise) sources, each having controllable internal impedances. The noise is selectively inserted in each of a plurality of sections, which make up the synthesizer. Control of the noise source and its'impedance is effected by using signals present within each synthesizer sections, namely, area control signals and current level.
Accordingly, in the practice of this invention, voiceless excitation is developed in each section of a vocal-tract synthesizer by sampling the current flowing in that sec-,
tion and utilizing a signal representative of the magnitude of the current in conjunction with the area control signals already supplied to the section to generate signals representative of the voiceless portion of speech.
Therefore, a feature of the present invention is the generation of signals in response to instantaneous changes in the shape of and the flow within a vocal-tract synthesizer section to develop voiceless excitation for generating synthetic speech.
Another feature of the invention permits voiceless excitation signals to be used in conjunction with glottal excitation signals to form voiced fricative synthetic speech sounds.
These and other objects and advantages of the invention will be more fully understood from the following detailed description of an illustrative embodiment thereof taken in connection with the appended drawings.
BRIEF DESCRIPTION OF THE DRAWING FIG. 1 shows in block schematic form a speech synthesis system in which the present invention may be utilized;
FIG. 2 illustrates a section of an analog synthesizer that employs the present invention;
FIG. 3 shows in block schematic form the details of the controllable noise source of FIG. 2; and
FIG. 4 shows in block schematic form the details of the impedance control generator of FIG. 2.
DETAILED DESCRIPTION OF THE INVENTION A spoken word is composed of a sequence of linguistic elements called phonemes. Generally, to produce synthetic speech, phonemes, or sequences of phonemes, to gether with suitable stressing must be specified in a manner directly acceptable by a synthesizer. The synthesizer must be excited by signals representative of these phonemes, in both the voiced and unvoiced domain, in order to generate naturaLsounding speech sounds.
FIG. 1 shows in simplified block form a synthesis by rule system suitable for generating natural-sounding speech in response to coded signals. Phoneme sequence generator 101 is utilized to produce appropriate signals for initiating the synthesis of speech sounds. Upon request, this information is supplied to vocal-tract area function generator 102 and to glottal excitation signal generator 103. Area control signals A through A for controlling the synthesis of voiced sounds by transmission line vocal-tract synthesizer 104 are generated by vocal tract area function generator 102. Synthesizer 104 also requires suitable excitation, U which is produced in glottal excitation generator 103. Synthesizer 104 responds to the applied control signals and develops aritificial speech sounds which may be used to energize a transducer such as loud speaker 105. A synthesis system such as this is described in detail in my copending applicator (Ser. No. 664,130, filed Aug. 29, 1967) cited above. Although that system generates signals which are adequate to excite vocal-tract synthesizer 104 for voiced sounds, it does not afford sufiicient control over synthesizer 104 to generate high-quality unvoiced sounds.
In order to generate high-quality synthetic speech, a synthesizer must be capable of automatically responding to instanteous changes in the shape of the vocal tract and to changes in flow within the vocal tract. Thus, in order to simulate the human vocal tract, speech synthesizer 104 may comprise n sections, each of which represents a portion of the human vocal tract. To excite vocaltract synthesizer 104 for generating unvoiced and voiced fricative sounds, in addition to voiced sounds, noise signals must be inserted into each of the synthesizer sections. The inserted noise signals must be representative of noise generated by turbulent flow within the human vocal tract.
It is known that flow through a constriction may be characterized by Reynolds number (R). Thus, for a typical section, for example section i of n section synthesizer 104 W iP IL where:
zq is the acoustic particle velocity flowing in synthesizer section i;
W, is the effective width of synthesizer section i;
p is the air density; and
,u is the coefficient of viscosity.
where R zR and K is a constant.
In a practical system, because of the squaring, the threshold term R is not critical and actually may be neglected. Thus, Equation 2 can be simplified to By simple substitution, Equation 4 reduces to i i| P143 (5) Therefore, a localized source of noise may be controlled by signals already present within the synthesizer. For example, the area control signal A, and the current flowing within the section which is representative of U Furthermore, experimentation has indicated that the internal impedance of the localized noise source may be approximated by a series connection of variable resistive elements, namely, R and R These terms are described in greater detail in section 3.52 of my book, Speech Analysis, Synthesis and Perception, Academic Press, 1965, in connection with the vocal cord orifice model.
Thus, FIG. 2 shows in schematic form section i of vocal-tract synthesizer 104 comprising a plurality of n sections which respond to signals present within a vocaltract model to excite the synthesizer for generating the unvoiced, i.e., fricative and stop, portion of synthesized speech sounds. Inductors L and capacitor C are primary impedance elements which form one T-section of transmission-line type vocal-tract synthesizer 104. The component values of inductors L and capacitor C are controlled in response to vocal-tract area signal A A synthesizer T-section of this type and apparatus for controlling the T-section components is described in greater detail in the Rosen patent cited above.
Current U is developed in T-section i in response to the application of glottal excitation signal U (FIG. 1) to vocal-tract synthesizer 104. Thus, the resultant current U flowing within section i is sampled at 201. Current U normally comprises a DC component and higher spectral components. Generally, the DC. component is several times greater in magnitude than the higher spectral components. Since the magnitude of the noise signal to be inserted in the synthesizer section and the value of the internal impedance of the noise source are dependent upon the magnitude of U,, the system may tend to be unstable. Thus, further to stabilize the system, U is filtered via low-pass filter 202 to eliminate the higher spectral components, which are generally responsible for such instability. Moreover, filter 202 minimizes the possibility of feeding back high spectral components that are inserted into the synthesizer by noise source 204, or by the noise sources of any of the other sections of the synthesizer, thereby further insuring stability of the system. Experience indicates that a low-pass filter having a cutoff frequency of approximately 500 Hz. is sufiicient for this purpose. After filtering, current signal U is applied to full wave rectifier circuit 203 to obtain its absolute value ]U,[. Obviously, any other arrangement for obtaining the absolute value of U may be utilized as desired.
A signal representative of [U is thereafter applied to controllable noise source 204, to be discussed in greater detail in connection with FIG. 3, and to impedance control generator 205, to be discussed in connection With FIG. 4. Area control signal A is also applied to controllable noise source 204 and to impedance control generator 205. Signal A, is utilized in conjunction with a signal representative of |U to generate the desired signal P representative of noise, and to generate signals for controlling the resistive values of impedances R l and R Noise signal P; is selectively inserted into synthesizer section 1' via transformer 206. Thus, the noise signal inserted and the resistive values of R and R cooperate to vary the fiow of current U in the synthesizer section, thus to generate natural-sounding speech sounds.
FIG. 3 shows in block schematic form details of controllable noise source 204. Signals representative of A, and {U 1 are supplied, in accordance with the invention, to controllable noise source 204 for generating a noise signal P in accordance with Equation 5. Accordingly, |U is squared in squaring network 301, and thereafter applied to one input of dividing network 302. A is applied to another input of dividing network 302. The output from divider 302 is applied to amplifier 303, where its amplitude is adjusted. The output from amplifier 303 is thereafter applied to control the gain of variable gain amplifier 304. A noise signal generated in noise source 305, which may be, for example, a gas tube, is applied to the input of variable gain amplifier 304. The output signal P of amplifier 304 is applied to transformer 206 for insertion into the synthesizer section.
A block schematic diagram of impedance control generator 205 is shown in FIG. 4. Signals A, and [U are supplied to generator 205, wherein they are utilized to generate signals for controlling the impedance values of R and R in accordance with Equation 6 and 7, respectively. Thus, to generate a signal for controlling the impedance value of R A, is applied to squaring network 401. The output of squaring network of 401, which represents A is applied to one input of dividing network 402. A signal representative of [U is applied to a second input of dividing network 402. The output signal, (K [U ]A,- from dividing network 402 is representative of Equation 6 and is applied to variable resistor 207, which may be either a rheostat driven by a servomechanism, an appropriately biased field effect transistor, or the like. Signal A is also applied for cubing to network 403, where it is converted to A Thereafter A is divided and appropriately scaled in network 404. The output from network 404 is representative of Equation 7. This signal, (K4A1 3), is applied to variable resistor 208.
The above-described arrangements are, of course, merely illustrative of the application of the principles of this invention. Numerous other arrangements may be devised by those skilled in the art without departing from the spirit or scope of the invention.
What is claimed is:
1. In a section of a vocal-tract synthesizer having a 10 nal having a magnitude controlled in accordance with a preestablished relationship with said control signals; and
means for inserting said energy signal into said circuit path of said synthesizer section, thereby to develop voiceless excitation.
2. Apparatus as defined in claim 1, wherein said control signal generator includes:
means responsive to said fiow signal for generating a signal representative of the magnitude of said flow signal squared; and
means responsive to said squared signal and said area function signal for generating a signal representative of the magnitude of random energy to be inserted into said circuit path of said synthesizer section.
3. Apparatus as defined in claim 1, further including controllable impedance elements connected in series with said inserting means, said impedance elements being responsive to others of said control signals for varying the magnitude of the impedance in said circuit path of said synthesizer section in accordance with a preestablished function of said control signals.
4. Apparatus as defined in claim 3, wherein said control signal generator includes:
means responsive to said local-tract area function signal for generating a signal to control the impedance value of one of said impedance elements; and
means responsive to the magnitude of said volume velocity flow signal and to said area function signal for generating a signal for controlling the impedance value of another of said impedance elements.
5. In an analog vocal-tract synthesizer having a plurality of sections, each responsive to an individual vocaltract area function signal, apparatus for developing voiceless excitation in each of said sections, which comprises:
a controllable source of noise energy having variable internal impedance elements; means for generating a signal representative of the magnitude of a current developed in a circuit path of said synthesizer section, said current signal being representative of the volume velocity flow in said synthesizer section; means responsive to said vocal-tract area function signal and to said current magnitude signal for generating signals for controlling said noise source; and
means for inserting a signal developed by said noise source into said circuit path of said synthesizer section, thereby to develop voiceless excitation.
6. Apparatus as defined in claim 5, wherein said current magnitude includes low-pass filter means for eliminating selective frequency components from said current signal; and wherein said control signal means includes means responsive to the magnitude of said current for generating a signal representative of the magnitude of said current signal squared, and means responsive to said squared signal and to said area function signal for generating a signal representative of Reynolds number squared. 7O 7. Apparatus as defined in claim 5, further including controllable impedance elements representative of said variable internal impedance being connected in said circuit path of said synthesizer section; and
means responsive to said vocal-tract area function signal and to said current magnitude signal for generating signals for controlling said impedance elements signal for controlling the impedance value of anto vary selectively the magnitude of the impedance other of said impedance elements. in said circuit path.
8. Apparatus as defined in claim 7, wherein said im- References Cited pedance control signal generator includes: 5 UNITED STATES PATENTS means responsive to said area funct1on signal for gen- 3,042,748 7/1962 Rosen 179 1 eratmg a signal for controlling the impedance value a of one of said impedance elements; and KATHLEEN H CLAFFY, Primary Examiner means responsive to the magnitude of said current signal and to said area function signal for generating a 10 OLMS Assistant Exammer
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US72493168A | 1968-04-29 | 1968-04-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
US3542955A true US3542955A (en) | 1970-11-24 |
Family
ID=24912471
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US724931A Expired - Lifetime US3542955A (en) | 1968-04-29 | 1968-04-29 | Automatic generation of voiceless excitation in a vocal-tract synthesizer |
Country Status (1)
Country | Link |
---|---|
US (1) | US3542955A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4170719A (en) * | 1978-06-14 | 1979-10-09 | Bell Telephone Laboratories, Incorporated | Speech transmission system |
US5528726A (en) * | 1992-01-27 | 1996-06-18 | The Board Of Trustees Of The Leland Stanford Junior University | Digital waveguide speech synthesis system and method |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3042748A (en) * | 1958-08-25 | 1962-07-03 | Rosen George | Dynamic analog speech synthesizer |
-
1968
- 1968-04-29 US US724931A patent/US3542955A/en not_active Expired - Lifetime
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3042748A (en) * | 1958-08-25 | 1962-07-03 | Rosen George | Dynamic analog speech synthesizer |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4170719A (en) * | 1978-06-14 | 1979-10-09 | Bell Telephone Laboratories, Incorporated | Speech transmission system |
US5528726A (en) * | 1992-01-27 | 1996-06-18 | The Board Of Trustees Of The Leland Stanford Junior University | Digital waveguide speech synthesis system and method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Flanagan et al. | Excitation of Vocal‐Tract Synthesizers | |
Flanagan et al. | Synthesis of speech from a dynamic model of the vocal cords and vocal tract | |
Macon et al. | A singing voice synthesis system based on sinusoidal modeling | |
Macon et al. | Concatenation-based midi-to-singing voice synthesis | |
US3542955A (en) | Automatic generation of voiceless excitation in a vocal-tract synthesizer | |
Guérin et al. | A voice source taking account of coupling with the supraglottal cavities | |
Elie et al. | A glottal chink model for the synthesis of voiced fricatives | |
US2824906A (en) | Transmission and reconstruction of artificial speech | |
Kasuya et al. | Joint estimation of voice source and vocal tract parameters as applied to the study of voice source dynamics | |
Oliveira | Estimation of source parameters by frequency analysis. | |
Rothenberg et al. | A three-parameter voice source for speech synthesis | |
AU769036B2 (en) | Device and method for digital voice processing | |
US2339465A (en) | System for the artificial production of vocal or other sounds | |
Estes et al. | Speech synthesis from stored data | |
US3268660A (en) | Synthesis of artificial speech | |
Flanagan et al. | Speech synthesis | |
Sondhi | Articulatory modeling: a possible role in concatenative text-to-speech synthesis | |
US3549807A (en) | Voiced fricative synthesizer | |
Gutiérrez-Arriola et al. | A new multi-speaker formant synthesizer that applies voice conversion techniques | |
JPH09152892A (en) | Voice signal deformation connection method | |
US3042748A (en) | Dynamic analog speech synthesizer | |
Crystal | Model of larynx activity during phonation | |
Palmer et al. | Dynamic voice source synthesis | |
Childers et al. | Articulatory synthesis: nasal sounds and male and female voices | |
JPS58168097A (en) | Voice synthesizer |