Vocoder - Modern Vocoder Implementations

Modern Vocoder Implementations

Even with the need to record several frequencies, and the additional unvoiced sounds, the compression of the vocoder system is impressive. Standard speech-recording systems capture frequencies from about 500 Hz to 3400 Hz, where most of the frequencies used in speech lie, typically using a sampling rate of 8 kHz (slightly greater than the Nyquist rate). The sampling resolution is typically at least 12 or more bits per sample resolution (16 is standard), for a final data rate in the range of 96-128 kbit/s. However, a good vocoder can provide a reasonable good simulation of voice with as little as 2.4 kbit/s of data.

'Toll Quality' voice coders, such as ITU G.729, are used in many telephone networks. G.729 in particular has a final data rate of 8 kbit/s with superb voice quality. G.723 achieves slightly worse quality at data rates of 5.3 kbit/s and 6.4 kbit/s. Many voice systems use even lower data rates, but below 5 kbit/s voice quality begins to drop rapidly.

Several vocoder systems are used in NSA encryption systems:

LPC-10, FIPS Pub 137, 2400 bit/s, which uses linear predictive coding
Code-excited linear prediction (CELP), 2400 and 4800 bit/s, Federal Standard 1016, used in STU-III
Continuously variable slope delta modulation (CVSD), 16 kbit/s, used in wide band encryptors such as the KY-57.
Mixed-excitation linear prediction (MELP), MIL STD 3005, 2400 bit/s, used in the Future Narrowband Digital Terminal FNBDT, NSA's 21st century secure telephone.
Adaptive Differential Pulse Code Modulation (ADPCM), former ITU-T G.721, 32 kbit/s used in STE secure telephone

(ADPCM is not a proper vocoder but rather a waveform codec. ITU has gathered G.721 along with some other ADPCM codecs into G.726.)

Vocoders are also currently used in developing psychophysics, linguistics, computational neuroscience and cochlear implant research.

Modern vocoders that are used in communication equipment and in voice storage devices today are based on the following algorithms:

Algebraic code-excited linear prediction (ACELP 4.7 kbit/s – 24 kbit/s)
Mixed-excitation linear prediction (MELPe 2400, 1200 and 600 bit/s)
Multi-band excitation (AMBE 2000 bit/s – 9600 bit/s)
Sinusoidal-Pulsed Representation (SPR 300 bit/s – 4800 bit/s)
Tri-Wave Excited Linear Prediction (TWELP 600 bit/s – 9600 bit/s)

Famous quotes containing the word modern:

“As far as modern writing is concerned, it is rarely rewarding to translate it, although it might be easy.... Translation is very much like copying paintings.”
—Boris Pasternak (1890–1960)

Main Site Subjects