Matrix decoder
Encyclopedia
Matrix decoder is an audio technology where a finite number of discrete audio channels (e.g., 2) are decoded into a larger number of channels on play back (e.g., 5). The channels are generally, but not always, arranged for transmission or recording by an encoder
, and decoded for playback by a decoder
.
The function is to allow multichannel audio, such as quadraphonic sound or surround sound
to be encoded in a stereo signal, and thus played back as stereo on stereo equipment, and as surround on surround equipment – this is "compatible" multichannel audio.
2:4
Five discrete channels decoded to six channels would be notated:
5:6
Many matrix decoders take advantage of the Haas effect
, as well as audio cues inherent in the source channels.
The various encoding matrixes are described below.
phase-shift, phase-shift
This above is the basic "4/2" (4 channels in, 2 out) encoding equation and after 1972, wasn't much used by CBS and others... The basic SQ 4/2 code had mono/stereo anomalies as well as encoding/decoding problems - these were heavily criticized by Michael Gerzon and others, so CBS corrected them all with the SQ Position Encoder, the Forward-Oriented Encoder, the Backwards-Oriented Encoder and the "London Box."
Position Encoder: An N/2 encoder that encoded every position in a 360° circle - it had 16 inputs and each could be dialed to the exact direction desired, generating an optimized encode.
Forward-Oriented encoder:
phase-shift, phase-shift
Forward-Oriented encoder: Caused Center Back to be encoded as Center Front and was recommended for live broadcast use for maximum mono compatibility - it also encoded Center Left/Center Right and both diagonal splits in the optimal manner. Could be used to modify existing 2-channel stereo recordings and create 'synthesized SQ' that when played through a Full-Logic or Tate DES SQ decoder, exhibited a 180° or 270° synthesized quad effect. Many stereo FM radio stations broadcasting SQ in the 70's used their Forward-Oriented SQ encoder for this. For SQ decoders, CBS designed a circuit that produced the 270° enhancement using the 90° phase shifters in the decoder. Sansui's QS Encoders and QS Vario-Matrix Decoders had a similar capability.
Backwards-Oriented encoder:
phase-shift, phase-shift
Backwards-Oriented Encoder: Was the reverse of the Forward-Oriented Encoder - it allowed sounds to be placed optimally in the back half of the room, but mono-compatibility was sacrificed. When used with standard stereo recordings it created "extra wide" stereo with sounds outside the speakers.
Some encoding mixers had channel strips switchable between forward-oriented and backwards-oriented encodong.
London Box: Encoded Center Back in such a way that it didn't cancel in mono playback, thus its output was usually mixed with that of a Position Encoder or a Forward Oriented encoder. After 1972, the vast majority of SQ Encoded albums were mixed with either the Position Encoder or the Forward-Oriented encoder.
In addition, CBS created the SQ Ghent Microphone, which was a spatial microphone system using the Neumann QM-69 mic. The signals from the QM-69 were differenced, and then phase-matrixed into 2-channel SQ.
With the Ghent Microphone, SQ was transformed from a Matrix into a Kernel and an additional signal could be derived to provide N:3:4 performance.
In 1976, Ben Bauer integrated matrix and discrete systems into USQ, or Universal SQ. (others had done this with their quad systems too) It was a hierarchical 4-4-4 discrete matrix that used the SQ matrix as the baseband for discrete quadraphonic FM broadcasts using additional difference signals called "T" and "Q". For a USQ FM broadcast, the additional "T" modulation was placed at 38kHz in quadrature to the standard stereo difference signal and the "Q" modulation was placed on a carrier at 76kHz. For standard 2-channel SQ Matrix broadcasts, CBS recommended that an optional pilot-tone be placed at 19kHz in quadrature to the regular pilot-tone to indicate SQ encoded signals and activate the listeners Logic decoder. CBS argued that the SQ system should be selected as the standard for quadraphonic FM because, in FCC listening tests of the various four channel broadcast proposals, the 4:2:4 SQ system, decoded with a CBS Paramatrix decoder, outperformed 4:3:4 (without logic) as well as all other 4:2:4 (with logic) systems tested, approaching the performance of a discrete master tape within a very slight margin. At the same time, the SQ "fold" to stereo and mono was preferred to the stereo and mono "fold" of 4:4:4, 4:3:4 and all other 4:2:4 encoding systems.
The Directional Enhancement System, also known as the Tate DES, was an advanced decoder for SQ (although it could be made to work with any matrix or kernel system) and enhanced the directionality of the basic SQ matrix. It first matrixed the four outputs of the SQ decoder to derive additional signals, then compared their envelopes to detect the predominant direction and degree of dominance. A processor section, implemented outside of the Tate IC chips, applied variable attack/decay timing to the control signals and determined the coefficients of the "B" (Blend) matrices needed to enhance the directionality. These were acted upon by true analog multipliers in the Matrix Multiplier IC's, to multiply the incoming matrix by the "B" matrices and produce outputs in which the directionality of all predominant sounds were enhanced. Since the DES could recognize all three directions of the Energy Sphere simultaneously, and enhance the separation, it had a very open and 'discrete' sounding soundfield. In addition, the enhancement was done with sufficient additional complexity that all non-dominant sounds were kept at their proper levels. Dolby used the Tate DES IC's in their theater processors until around 1986, when they developed the Pro Logic system. Unfortunately, delays and problems kept the Tate DES IC's from the market until the late 70's and only two consumer decoders were ever made that employed them, the Audionics Space & Image Composer and the Fosgate Tate II 101A. The Fosgate used a faster, updated version of the IC, called the Tate II, and additional circuitry that provided for separation enhancement around the full 360 soundfield. Unlike the earlier Full Wave-matching Logic decoders for SQ, that varied the output levels to enhance directionality, the Tate DES cancelled SQ signal crosstalk as a function of the predominant directionality, keeping non-dominant sounds and reverberation in its proper spatial locations at at their correct level.
phase-shift, phase-shift
phase-shift, phase-shift
phase-shift, phase-shift
Note, Dolby MP Matrix is the name of the 4:2:4 encoding matrix. The term "Dolby Surround" refers to both the encoding and decoding in the home environment, while in the theater it is known Dolby Stereo.
phase-shift, phase-shift
The encoding matrix is the same as for Dolby Surround; only the Pro Logic decoder differs. There is no such thing as a Pro Logic encoder.
phase-shift, phase-shift
Encoder
An encoder is a device, circuit, transducer, software program, algorithm or person that converts information from one format or code to another, for the purposes of standardization, speed, secrecy, security, or saving space by shrinking size.-Media:...
, and decoded for playback by a decoder
Decoder
A decoder is a device which does the reverse operation of an encoder, undoing the encoding so that the original information can be retrieved. The same method used to encode is usually just reversed in order to decode...
.
The function is to allow multichannel audio, such as quadraphonic sound or surround sound
Surround sound
Surround sound encompasses a range of techniques such as for enriching the sound reproduction quality of an audio source with audio channels reproduced via additional, discrete speakers. Surround sound is characterized by a listener location or sweet spot where the audio effects work best, and...
to be encoded in a stereo signal, and thus played back as stereo on stereo equipment, and as surround on surround equipment – this is "compatible" multichannel audio.
Process
Matrix encoding does not allow one to encode several channels in fewer channels without losing information: one cannot fit 5 channels into 2 (or even 3 into 2) without losing information, as this loses dimensions: the decoded signals are not independent. The idea is rather to encode something that will both be an acceptable approximation of the surround sound when decoded, and acceptable (or even superior) stereo.Notation
The notation for matrix encoding consists of the number of discrete audio channels separated by a colon from the number of decoded channels. For example, two discrete channels decoded to four-channels would be notated:2:4
Five discrete channels decoded to six channels would be notated:
5:6
Many matrix decoders take advantage of the Haas effect
Haas effect
The Haas effect is a psychoacoustic effect, described in 1949 by Helmut Haas in his Ph.D. thesis. It is often equated with the underlying precedence effect .- Experiments and findings :...
, as well as audio cues inherent in the source channels.
The various encoding matrixes are described below.
SQ matrix encoding (2:4, "Stereo Quadraphonic", CBS SQ)
Encoding matrix | Left Front | Right Front | Left Back | Right Back |
---|---|---|---|---|
Left Total | 1.0 | 0.0 | k0.7 | 0.7 |
Right Total | 0.0 | 1.0 | -0.7 | j0.7 |
phase-shift, phase-shift
This above is the basic "4/2" (4 channels in, 2 out) encoding equation and after 1972, wasn't much used by CBS and others... The basic SQ 4/2 code had mono/stereo anomalies as well as encoding/decoding problems - these were heavily criticized by Michael Gerzon and others, so CBS corrected them all with the SQ Position Encoder, the Forward-Oriented Encoder, the Backwards-Oriented Encoder and the "London Box."
Position Encoder: An N/2 encoder that encoded every position in a 360° circle - it had 16 inputs and each could be dialed to the exact direction desired, generating an optimized encode.
Forward-Oriented encoder:
Encoding matrix | Left Front | Right Front | Left Back | Right Back |
---|---|---|---|---|
Left Total | 1.0 | 0.0 | 0.7 | k0.7 |
Right Total | 0.0 | 1.0 | k0.7 | 0.7 |
phase-shift, phase-shift
Forward-Oriented encoder: Caused Center Back to be encoded as Center Front and was recommended for live broadcast use for maximum mono compatibility - it also encoded Center Left/Center Right and both diagonal splits in the optimal manner. Could be used to modify existing 2-channel stereo recordings and create 'synthesized SQ' that when played through a Full-Logic or Tate DES SQ decoder, exhibited a 180° or 270° synthesized quad effect. Many stereo FM radio stations broadcasting SQ in the 70's used their Forward-Oriented SQ encoder for this. For SQ decoders, CBS designed a circuit that produced the 270° enhancement using the 90° phase shifters in the decoder. Sansui's QS Encoders and QS Vario-Matrix Decoders had a similar capability.
Backwards-Oriented encoder:
Encoding matrix | Left Front | Right Front | Left Back | Right Back |
---|---|---|---|---|
Left Total | -1.0 | 0.0 | k0.7 | 0.7 |
Right Total | 0.0 | 1.0 | -0.7 | j0.7 |
phase-shift, phase-shift
Backwards-Oriented Encoder: Was the reverse of the Forward-Oriented Encoder - it allowed sounds to be placed optimally in the back half of the room, but mono-compatibility was sacrificed. When used with standard stereo recordings it created "extra wide" stereo with sounds outside the speakers.
Some encoding mixers had channel strips switchable between forward-oriented and backwards-oriented encodong.
London Box: Encoded Center Back in such a way that it didn't cancel in mono playback, thus its output was usually mixed with that of a Position Encoder or a Forward Oriented encoder. After 1972, the vast majority of SQ Encoded albums were mixed with either the Position Encoder or the Forward-Oriented encoder.
In addition, CBS created the SQ Ghent Microphone, which was a spatial microphone system using the Neumann QM-69 mic. The signals from the QM-69 were differenced, and then phase-matrixed into 2-channel SQ.
With the Ghent Microphone, SQ was transformed from a Matrix into a Kernel and an additional signal could be derived to provide N:3:4 performance.
In 1976, Ben Bauer integrated matrix and discrete systems into USQ, or Universal SQ. (others had done this with their quad systems too) It was a hierarchical 4-4-4 discrete matrix that used the SQ matrix as the baseband for discrete quadraphonic FM broadcasts using additional difference signals called "T" and "Q". For a USQ FM broadcast, the additional "T" modulation was placed at 38kHz in quadrature to the standard stereo difference signal and the "Q" modulation was placed on a carrier at 76kHz. For standard 2-channel SQ Matrix broadcasts, CBS recommended that an optional pilot-tone be placed at 19kHz in quadrature to the regular pilot-tone to indicate SQ encoded signals and activate the listeners Logic decoder. CBS argued that the SQ system should be selected as the standard for quadraphonic FM because, in FCC listening tests of the various four channel broadcast proposals, the 4:2:4 SQ system, decoded with a CBS Paramatrix decoder, outperformed 4:3:4 (without logic) as well as all other 4:2:4 (with logic) systems tested, approaching the performance of a discrete master tape within a very slight margin. At the same time, the SQ "fold" to stereo and mono was preferred to the stereo and mono "fold" of 4:4:4, 4:3:4 and all other 4:2:4 encoding systems.
The Directional Enhancement System, also known as the Tate DES, was an advanced decoder for SQ (although it could be made to work with any matrix or kernel system) and enhanced the directionality of the basic SQ matrix. It first matrixed the four outputs of the SQ decoder to derive additional signals, then compared their envelopes to detect the predominant direction and degree of dominance. A processor section, implemented outside of the Tate IC chips, applied variable attack/decay timing to the control signals and determined the coefficients of the "B" (Blend) matrices needed to enhance the directionality. These were acted upon by true analog multipliers in the Matrix Multiplier IC's, to multiply the incoming matrix by the "B" matrices and produce outputs in which the directionality of all predominant sounds were enhanced. Since the DES could recognize all three directions of the Energy Sphere simultaneously, and enhance the separation, it had a very open and 'discrete' sounding soundfield. In addition, the enhancement was done with sufficient additional complexity that all non-dominant sounds were kept at their proper levels. Dolby used the Tate DES IC's in their theater processors until around 1986, when they developed the Pro Logic system. Unfortunately, delays and problems kept the Tate DES IC's from the market until the late 70's and only two consumer decoders were ever made that employed them, the Audionics Space & Image Composer and the Fosgate Tate II 101A. The Fosgate used a faster, updated version of the IC, called the Tate II, and additional circuitry that provided for separation enhancement around the full 360 soundfield. Unlike the earlier Full Wave-matching Logic decoders for SQ, that varied the output levels to enhance directionality, the Tate DES cancelled SQ signal crosstalk as a function of the predominant directionality, keeping non-dominant sounds and reverberation in its proper spatial locations at at their correct level.
QS matrix encoding (2:4, "Quadraphonic Sound", QS)
Encoding matrix | Left Front | Right Front | Left Back | Right Back |
---|---|---|---|---|
Left Total | 0.92 | 0.38 | j0.92 | j0.38 |
Right Total | 0.38 | 0.92 | k0.38 | k0.92 |
phase-shift, phase-shift
Ambisonic UHJ kernel encoding (2:3)
Encoding matrix | W (pressure signal) | X (front-back signal) | Y (left-right signal) |
---|---|---|---|
Left Total | 0.470 + k0.171 | 0.093 + j0.255 | +0.328 |
Right Total | 0.470 + j0.171 | 0.093 + k0.255 | -0.328 |
phase-shift, phase-shift
Dolby MP matrix encoding (2:4)
Encoding matrix | Left | Right | Center | Surround |
---|---|---|---|---|
Left Total | ||||
Right Total |
phase-shift, phase-shift
Note, Dolby MP Matrix is the name of the 4:2:4 encoding matrix. The term "Dolby Surround" refers to both the encoding and decoding in the home environment, while in the theater it is known Dolby Stereo.
Dolby Pro Logic matrix (2:4)
Encoding matrix | Left | Right | Center | Surround |
---|---|---|---|---|
Left Total | ||||
Right Total |
phase-shift, phase-shift
The encoding matrix is the same as for Dolby Surround; only the Pro Logic decoder differs. There is no such thing as a Pro Logic encoder.
Dolby Pro Logic II matrix encoding (2:5)
Encoding matrix | Left | Right | Center | Rear Left | Rear Right |
---|---|---|---|---|---|
Left Total | |||||
Right Total |
phase-shift, phase-shift
See also
- Ambisonic UHJ formatAmbisonic UHJ formatAmbisonic UHJ format is a development of the Ambisonic surround sound system designed to allow Ambisonic recordings to be carried by mono- and stereo-compatible media. It is a hierarchy of systems in which the recorded soundfield will be reproduced with a degree of accuracy that varies according to...
- Dolby DigitalDolby DigitalDolby Digital is the name for audio compression technologies developed by Dolby Laboratories. It was originally called Dolby Stereo Digital until 1994. Except for Dolby TrueHD, the audio compression is lossy. The first use of Dolby Digital was to provide digital sound in cinemas from 35mm film prints...
- Dolby Pro LogicDolby Pro LogicDolby Pro Logic is a surround sound processing technology developed by Dolby Laboratories, designed to decode soundtracks encoded with Dolby Surround. Dolby Stereo was originally developed by Dolby in 1976 for analog cinema sound systems...
- Haas effectHaas effectThe Haas effect is a psychoacoustic effect, described in 1949 by Helmut Haas in his Ph.D. thesis. It is often equated with the underlying precedence effect .- Experiments and findings :...
- Quadraphonic sound
- Surround soundSurround soundSurround sound encompasses a range of techniques such as for enriching the sound reproduction quality of an audio source with audio channels reproduced via additional, discrete speakers. Surround sound is characterized by a listener location or sweet spot where the audio effects work best, and...