Vocoder Guide, Meaning , Facts, Information and Description
A vocoder (name derived from voice coder, formerly also called voder) is a speech analyser and synthesizer. It was originally developed as a speech coder for telecommunications applications in the 1930s, the idea being to code speech for transmission. Its primary use in this fashion is for secure radio communication, where voice has to be digitized, encrypted and then transmitted on a narrow, voice-bandwidth channel. The vocoder has also been used extensively as an electronic musical instrument.
| Table of contents |
|
2 Vocoder alogrithms 3 Musical applications |
How a vocoder works
The human voice consists of sounds generated by the opening and closing of the glottis by the vocal cords, which produces a periodic waveform. This basic sound is then modified by the nose and throat to produce differences in pitch in a controlled way, creating the wide variety of sounds used in speech. There are another set of sounds, known as the unvoiced and plosive sounds, which are not modified by the mouth in the same fashion.
The vocoder examines speech by finding this basic frequency, the fundamental frequency, and measuring how it is changed over time by recording someone speaking. This results in a series of numbers representing these modified frequencies at any particular time as the user speaks. In doing so, the vocoder dramatically reduces the amount of information needed to store speech, from a complete recording to a series of numbers. To recreate speech, the vocoder simply reverses the process, creating the fundamental frequency in an oscillator, then passing it into a modifier that changes the frequency based on the originally recorded series of numbers.
Of course, the actual qualities of speech cannot be reproduced this easily. In addition to a single fundamental frequency, the vocal system adds in a number of resonant frequencies that add character and quality to the voice, known as the formant. Without capturing these additional qualities, the vocoder will never sound "real".
In order to address this, most vocoder systems use what are effectively a number of vocoders, all tuned to different frequencies (using band-pass filters). The various values of these filters are stored not as the raw numbers, which are all based on the original fundamental frequency, but as a series of modifications to that fundamental needed to modify it into the signal seen in that filter. During playback these settings are sent back into the filters and then added together, modified with the knowledge that speech typically varies between these frequencies in a fairly linear way. The result is recognizable speech, although somewhat "mechanical" sounding. Vocoders also often include a second system for generating unvoiced sounds, using a noise generator instead of the fundamental frequency.
Even with the need to record several frequencies, and the additional unvoiced sounds, the compression of the vocoder system is impressive. Standard systems to record speech record a frequency from about 500Hz to 8kHz, where most of the frequencies used in speech lie, which requires 64kbit/s of bandwidth (due to Nyquist frequency). However a vocoder can provide a reasonably good simulation with as little as 2400 bit/s of bandwidth, a 26x improvement.
Vocoder alogrithms
Several vocoder algorithms are used in NSA encryption systems:
- LPC-10, FIPS Pub 137, 2400 bit/s, which uses linear predictive coding
- Code Excited Linear Prediction, (CELP), 2400 and 4800 bit/s, Federal Standard 1016, used in STU-III
- Continuously Variable Slope Delta-modulation (CVSD), 16 Kbit/s, used in wideband encryptors such as the KY-57.
- Adaptive Differential Pulse Code Modulation (ADPCM), ITU-T G.721, 32Kbit/s used in STE secure telephone
- Mixed Excitation Linear Prediction (MELP), MIL STD 3005, 2400 bit/s, used in the Future Narrowband Digital Terminal FNBDT, NSA's 21st century secure telephone.
Musical applications
For musical applications, a source of musical sounds is used as the oscillator, instead of extracting the fundamental frequency. For instance, one could use the sound of a guitar as the input to the filter bank, a technique that became popular in the 1970s.In 1970, electronic music pioneers Wendy Carlos and Robert Moog developed one of the first truly musical vocoders. A 10-band device inspired by the vocoder designs of Homer Dudley, it was originally called a spectrum encoder-decoder, and later referred to simply as a vocoder. The carrier signal came from Carlos' Moog modular synthesizer, and the modulator from a microphone input. The output of the 10-band vocoder was fairly intelligible, but relied on especially articulated speech.
Carlos' and Moog's vocoder was featured in several recordings, including the soundtrack to Stanley Kubrick's A Clockwork Orange, in which the vocoder sang the vocal part of Beethoven's Ninth Symphony. Also featured in the soundtrack was a piece called Timesteps, which featured the vocoder in two main sections. Originally, Timesteps was intended as merely an introduction to vocoders for the "timid listener", but Kubrick choose to include the piece on the soundtrack, much to the surprise of Wendy Carlos.
This is an Article on Vocoder. Page Contains Information, Facts Details or Explanation Guide About Vocoder
