MPEG-1 Audio Layer II

From Wikipedia, the free encyclopedia

MPEG-1 Audio Layer 2
File name extension .mp2
Internet media type audio/mpeg
Type of format Audio

MPEG-1 Audio Layer II (MP2, sometimes incorrectly called Musicam) [1] is an audio codec defined by ISO/IEC 11172-3. While MP3 is much more popular for PC and internet applications, MP2 remains a dominant standard for audio broadcasting.

Contents

[hide]

[edit] History of development from MP2 to MP3

See also: MP3#History

MP2 was originally designed for the purpose of digital radio and TV broadcasting (DAB, DMB, DVB), and use on Video CD [2] but once standardized by ISO, MPEG Audio was further promoted due to its Layer III (MP3) component. The standard was finalized (circa. 1992) at the same time that the internet was becoming widely used and widely available.

Some important (mostly undocumented) events in the development of MP2 stand out.

  • MP2 (as a psychoacoustical compression algorithm) was nearly perfectly developed especially with regards to glockenspiels (related to the xylophone) audio samples. It has been shown to be particularly efficient on high quality percussive sounds (impulses) thanks to the very efficient time-domain concealment characteristics of its polyphased filter bank. Testing has shown MP2 to be equivalent or superior to much more recent audio codecs, such as Dolby Digital AC-3. [3]
  • MP2 audio remains the preeminent lossy audio coding standard due to its especially high audio coding performances on highly critical audio material such as triangle, glockenspiel, castanet, symphonic orchestra, male and female voices. Subjective audio testing done by professional experts in the most critical conditions ever implemented have shown MP2 to offer transparent audio compression at 256 kbit/s for 16-bit audio. [4]
  • MP2 is based on perceptual coding, using 32 subbands filterbank and a model of the human auditory system either monaural or binaural (the last one in joint stereo with stereo intensity) for encoding.
  • It took some 9 months and one extra layer of codec complexity to turn MP2 into the well known MP3 format, by the introduction complementary signal processing tools, such as an additional MDCT transform, entropy coding and joint stereo mode (stereo intensity).
  • Newer audio codecs are still affected by the same fundamental problem in the codec model that the triangle, kabuki, glockenspiel and crysaglott revealed — coding signals with complex impulses and high energy transients are poorly reproduced.
    This is the fundamental difference between MP2 and subsequent audio codecs, which put much less of a focus on time domain critical audio sequences (more typical in classical music).
  • MP2 was proposed by the Advanced Television Research Consortium as a candidate for audio coding for the US digital TV standard, ATSC. This proposal existed in the drafts of the ATSC standard but not in the final release version.
    MP2 subsequently lost in the DTV "Grand Alliance" shootout to Dolby AC-3.

[edit] Technical Specifications

MPEG-1 Layer II is defined in ISO/IEC 11172-3

  • Sampling rates: 32, 44.1 and 48 kHz
  • Bitrates: 32, 48, 56, 64, 80, 96, 112, 128, 160, 192, 224, 256, 320 and 384 kbit/s

An extension has been provided in MPEG-2 Layer II and is defined in ISO/IEC 13818-3

  • Additional sampling rates: 16, 22.05 and 24 kHz
  • Additional bitrates: 8, 16, 24, 32, 40, 48, 56, 64, 80, 96, 112, 128, 144 and 160 kbit/s

The format is based on successive digital frames of 1152 sampling intervals with four possible formats:

  • mono format
  • stereo format
  • joint stereo format (stereo irrelevance)
  • dual channel (uncorrelated) format

[edit] How the MP2 Codec works

  • MP2 is a sub-band audio encoder, which means that compression takes place in the time domain with a low-delay filter bank producing 32 frequency domain components. By comparison, MP3 is a transform audio encoder with hybrid filter bank, which means that compression takes place in the frequency domain after a hybrid (double) transformation from the time domain.
  • MPEG Audio Layer II is the core algorithm of the MP3 standards. All psychoacoustical characteristics and frame format structures of the MP3 codec are derived from the basic MP2 algorithm and format.
  • The MP2 encoder may exploit inter channel redundancies depending on its encoding mode. In pure stereophonic mode, this makes MP2 less efficient than MP3 on low bitrates (lower than 192 kbit/s). For example, a 128 kbit/s MP3 encoded audio usually sounds, to the human ear, truer to the original source than the same audio encoded as 192 kbit/s MP2. However MP2 can reach similar encoding performances to MP3 stereophonic mode thanks to its Joint Stereo coding mode which removes stereo intensity irrelevance.
  • Like MP3, MP2 is a perceptual codec, which means that it removes information that the human auditory system will not be able to perceive. To choose which information to remove, the audio signal is analyzed according to a psychoacoustic model, which takes into account the parameters of the human auditory system. Research into psychoacoustics has shown that if there is a strong signal on a certain frequency, then weaker signals at frequencies close to the strong signal's frequency cannot be perceived by the human auditory system. This is called frequency masking. Perceptual audio codecs take advantage of this frequency masking by ignoring information at frequencies that are deemed to be imperceptible, thus allowing more data to be allocated to the reproduction of perceptible frequencies.
  • MP2 splits the input audio signal into 32 sub-bands, and if the audio in a sub-band is deemed to be imperceptible then that sub-band is not transmitted. MP3, on the other hand, transforms the input audio signal to the frequency domain in 576 frequency components. Therefore, MP3 has a higher frequency resolution than MP2, which allows the psychoacoustic model to be applied more selectively than for MP2. So MP3 has greater scope to reduce the bit rate.
  • The use of an additional entropy coding tool and this higher frequency accuracy justify why MP3 doesn't need as high a bit rate as MP2 to get an acceptable audio quality. Inversely MP2 shows a better behavior than MP3, in the time domain, due to its lower frequency resolution which implies less codec time delay (simpler editing) and native ruggedness to the digital recording and digital transmission errors.
  • Moreover, MP2 sub-band filter bank provides an inherent transient concealment feature due to the specific temporal masking effect of its mother filter. This unique characteristics of the MPEG-1 Audio family codecs implies a very good sound quality on audio signals with rapid energy changes such as percussive sounds both on the MP2 and the MP3 codecs which use the same basic sub-band filter bank.

[edit] Applications of MP2

Part of the DAB digital radio and DVB digital television standards.

Used internally within the radio industry, for example in NPR's PRSS Content Depot programming distribution system.

All DVD-Video players in PAL countries contain stereo MP2 decoders, making MP2 a possible competitor to Dolby Digital in these markets. DVD-Video players in NTSC countries are not required to decode MP2 audio, although most do. While some DVD recorders store audio in MP2 and many consumer-authored DVDs use the format, commercial DVDs with MP2 soundtracks are rare.

MPEG-1 layer 2 is the standard audio format used in the Video CD and Super Video CD formats (SVCD and CVD also support variable bitrate and MPEG Multichannel as added by MPEG-2).

MPEG 1 layer 2 is the standard audio format used in the MHP standard for set-top boxes.

MPEG 1 layer 2 is the audio format used in HDV camcorders.

[edit] Naming and Extensions

The term MP2 and filename extension .mp2 usually refer MPEG-1 Audio Layer II data, but can also refer to 'MPEG-2 Audio Layer II, a mostly backwards compatible extension which adds support for multichannel audio variable bitrate encoding, and additional sampling rates, defined in ISO/IEC 13818-3. The abbreviation MP2 is also sometimes erroneously applied to MPEG-2 video or MPEG-2 AAC audio.

[edit] See also

[edit] References

  1. ^ http://www.chiariglione.org/MPEG/faq/mp1-aud/mp1-aud.htm#16
  2. ^ http://www.chiariglione.org/mpeg/meetings/kurihama89/kurihama_press.htm
  3. ^ Wustenhagen et al, Subjective Listening Test of Multi-channel Audio Codecs, AES 105th Convention Paper 4813, San Francisco 1998
  4. ^ http://www.faqs.org/faqs/mpeg-faq/part1/ " For medium and high bitrates (120 kbps or more per channel), Layer-2 and Layer-3 scored rather similar, i.e. even trained listeners found it difficult to detect differences between original and reconstructed signal."
    "You can compress the same stereo program down to 256 kbit/s with no loss in discernible quality."
  • Genesis of the MP3 Audio Coding Standard by Hans Georg Musmann[1] in IEEE Transactions on Consumer Electronics, Vol. 52, Nr. 3, pp. 1043-1049, August 2006
  • MUSICAM Source Coding by Yves-François Dehery, AES 10th International Conference: Kensington, London, England, (7-9 Sept 1991), pp 71-79.

[edit] External links