High-Efficiency Advanced Audio Coding

Not to be confused with MPEG-4 SLS, which is branded HD-AAC.
High-Efficiency Advanced Audio Coding
Internet media type audio/aacp, audio/3gpp, audio/3gpp2
Developed by ISO
Type of format Audio compression format
Contained by 3GP, MP4, .dvb, etc.
Extended from AAC
Standard(s) ISO/IEC 14496-3

High-Efficiency Advanced Audio Coding (HE-AAC) is a lossy data compression scheme for digital audio defined as a MPEG-4 Audio profile in ISO/IEC 14496-3. It is an extension of Low Complexity AAC (AAC LC) optimized for low-bitrate applications such as streaming audio. HE-AAC version 1 profile (HE-AAC v1) uses spectral band replication (SBR) to enhance the compression efficiency in the frequency domain. HE-AAC version 2 profile (HE-AAC v2) couples SBR with Parametric Stereo (PS) to enhance the compression efficiency of stereo signals. It is a standardized and improved version of the AACplus codec.

HE-AAC is used in digital radio standards like DAB+ and Digital Radio Mondiale.

Contents

History

The progenitor of HE-AAC was developed by Coding Technologies under the trade name CT-aacPlus. CT-aacPlus combined MPEG-2 AAC LC with the Coding Technologies invented spectral band replication (SBR). CT-aacPlus is the codec used by XM Radio for their satellite radio service. Subsequently, Coding Technologies submitted their SBR to MPEG as a basis of HE-AAC.

HE-AAC version 1 was standardized as a profile of MPEG-4 Audio in 2003 by MPEG and published as part of MPEG-4 in document ISO/IEC 14496-3:2001/Amd 1:2003.[3] The HE-AAC version 2 profile was standardized in 2006 as ISO/IEC 14496-3:2005/Amd 2:2006.[1][4] A parametric stereo coding tool used in HE-AAC v2 was standardized in 2004 by MPEG and published in document ISO/IEC 14496-3:2001/Amd 2:2004.[5]

HE-AAC version 2 was also standardized under the name Enhanced aacPlus by 3GPP for 3G UMTS multimedia services in September 2004 (3GPP TS 26.401). It is based on the AAC LC, SBR and Parametric Stereo coding tools defined in the MPEG-4 Audio standard (and it refers to ISO/IEC 14496-3:2001/Amd.1:2003 - Bandwidth Extension, ISO/IEC 14496-3:2001/Amd.2:2004 - Parametric Coding for High Quality Audio, ISO/IEC 14496-3:2001 and ISO/IEC 14496-3:2001/Amd.1:2003/DCOR1). In addition it includes further tools such as error concealment, spline resampler, and stereo-to-mono downmix.[6] It was also published by ETSI as TS 126 401 V6.1.0 in December 2004.[7]

Prior to the standardization of HE AAC v2 by MPEG, Coding Technologies submitted the combination of HE-AAC v1 coupled with Parametric Stereo under the name Enhanced AAC+. As a result, aacPlus v2 and eAAC+ are now common trade names that refer to HE-AAC v2. AAC+ and aacPlus are common trade names that refer to HE-AAC v1.

Perceived quality

Scientific testing by the European Broadcasting Union has indicated that HE-AAC at 48 kbit/s was ranked as "Excellent" quality using the MUSHRA scale.[8] MP3 in the same testing received a score less than half that of HE-AAC and was ranked "Poor" using the MUSHRA scale. Data from this testing also indicated that some individuals confused 48 kbit/s encoded material with an uncompressed original.

Testing indicates that material decoded from 64 kbit/s HE-AAC does not yet have similar audio quality to material decoded from MP3 at 128 kbit/s using high quality encoders.[9][10][11][12] The test, taking bitrate distribution and RMSD into account, is a tie between MP3pro, HE AAC and Ogg Vorbis.

Further controlled testing by 3GPP during their revision 6 specification process indicates that HE-AAC and its derivative MPEG-4 HE-AAC v2 provide "Good" audio quality for music at low bit rates (e.g. 24 kbit/s).

MPEG-2 and MPEG-4 AAC LC decoders without SBR support will decode the AAC LC part of the audio, resulting in audio output with only half the sampling frequency, thereby reducing the audio bandwidth. This usually results in the high-end, or treble, portion of the audio signal missing from the audio product.

Support

Encoding

Sony supports HE-AAC encoding since SonicStage version 4. iTunes 9 supports HE-AAC encoding and playback.[13][14] Nero has released a free-of-charge command line HE-AAC encoder,[15] and also supports HE-AAC inside the Nero software suite. Sorenson Media’s Squeeze Compression Suite includes an HE-AACv1 encoder and is available for Mac OS X as well as Windows. The 3GPP consortium released source code of a reference HE-AACv2 encoder that appears to offer competitive quality.[16] Winamp Pro also supports ripping music to HE-AAC. Using a transcoding plugin for Winamp's media library, any file can be transcoded to HE-AAC.[17] XLD, an OS X audio encoding program, offers encoding from any of its supported formats to HE-AAC. Nokia PC Suite may encode audiofiles to eAAC+ format before transmitting them to mobile phone.

Decoding

HE-AAC is supported in the open source FAAD/FAAD2 decoding library (and all players incorporating it): VLC media player, Winamp, foobar2000, Audacious Media Player, and Sony's latest SonicStage version 4. HE-AAC is also used by AOL Radio and Pandora Radio clients to deliver high-fidelity music at low bitrates.

Adobe's Flash Player 9 supports HE-AACv2.[18]

iTunes 9.2 and iOS4 include full decoding of HE-AAC v2 parametric stereo streams. iTunes 9 thru 9.1, iPhone OS 3.1 and Fall 2009 iPods have support for HE-AAC playback for version 1 with no parametric stereo. Other third-party software featuring support for HE-AAC include FStream, VLC (without metadata/title streaming), Songbird, Snowtape, and recent development versions of Audacious Media Player.

Older versions of Apple iTunes, iPod Touch, and iPhone will play HE-AAC files at reduced fidelity because they ignore the spectral-band replication and parametric stereo information, instead playing them as though they were standard AAC-LC files without the high-frequency, or "treble," information that is only present in the SBR part of the signal. These will report the track length as twice its actual length.

Dolby released Dolby Pulse decoders and encoders in September 2008. HE-AAC v2 is the core of Dolby Pulse so files and streams encoded in Dolby Pulse will playback on AAC, HE-AAC v1 and v2 decoders. Conversely files and streams encoded in AAC, HE-AAC v1 or v2 will playback on Dolby Pulse decoders.

Dolby Pulse provides the following additional capabilities beyond HE-AAC v2:

Dolby has additionally released a PC decoder as an SDK suitable for integration into PC applications requiring Dolby Pulse, HE-AAC or AAC playback capabilities.

Promotion aspects

Commercial trademarks and labeling

HE-AAC is marketed under the trademark aacPlus(tm) by Coding Technologies and under the trademark Nero Digital(tm) by Nero AG. Sony Ericsson, Nokia and Samsung use AAC+ to label support for HE AAC v1 and eAAC+ to label support for HE-AAC v2 on their phones. Motorola uses AAC+ to indicate HE AAC v1 and "AAC+ Enhanced" to indicate HE AAC v2.

Licensing and patents

Companies holding patents for HE AAC have formed a patent pool administered by Via Licensing Corporation [19] to provide a single point of license for product makers.

Patent licenses are required for end-product companies that make hardware or software products that include HE AAC encoders and/or decoders.[20] Unlike the MP3 format,[21] content owners are not required to pay license fees to distribute content in HE AAC.

Standards

HE-AAC profile was first standardized in ISO/IEC 14496-3:2001/Amd 1:2003.[3] HE-AAC v2 profile (HE-AAC with Parametric Stereo) was first specified in ISO/IEC 14496-3:2005/Amd 2:2006.[1][4][22] The Parametric Stereo coding tool used by HE-AAC v2 was standardized in 2004 and published as ISO/IEC 14496-3:2001/Amd 2:2004.[5][6]

The current version of the MPEG-4 Audio (including HE-AAC standards) is published in ISO/IEC 14496-3:2009.

Enhanced aacPlus is required audio compression format in 3GPP technical specifications for 3G UMTS multimedia services and should be supported in IP Multimedia Subsystem (IMS), Multimedia Messaging Service (MMS), Multimedia Broadcast/Multicast Service (MBMS) and Transparent end-to-end Packet-switched Streaming Service (PSS).[23][24][25][26] HE-AAC version 2 was standardized under the name Enhanced aacPlus by 3GPP for 3G UMTS multimedia services in September 2004 (3GPP TS 26.401).[27]

HE-AAC and HE-AAC v2 audio coding for DVB applications is standardized by TS 101 154.[28][29] AacPlus v2 by Coding Technologies[30] is also standardized by the ETSI as TS 102 005 for Satellite services to Handheld devices (DVB-SH) below 3 GHz.

In December 2007, Brazil started broadcasting terrestrial DTV standard called International ISDB-Tb that implements video coding H.264 with audio AAC LC on main program (single or multi) and video H.264 with audio HE-AACv2 in the 1Seg mobile sub-program.

Versions

The following is the summary of the different versions of HE-AAC:

Version Common trade names Codec feature Standards
HE-AAC v1 aacPlus v1, eAAC, AAC+, CT-aacPlus AAC LC + SBR ISO/IEC 14496-3:2001/Amd 1:2003
HE-AAC v2 aacPlus v2, eAAC+, AAC++, Enhanced AAC+ AAC LC + SBR + PS ISO/IEC 14496-3:2005/Amd 2:2006

See also

References

  1. ^ a b c ISO/IEC JTC1/SC29/WG11/N7016 (2005-01-11) (DOC), Text of ISO/IEC 14496-3:2001/FPDAM 4, Audio Lossless Coding (ALS), new audio profiles and BSAC extensions, http://www.itscj.ipsj.or.jp/sc29/open/29view/29n6475t.doc, retrieved 2009-10-09 
  2. ^ Fraunhofer IIS (PDF), MPEG-4 Audio and Video Technology, http://www.itu.int/ITU-R/study-groups/seminars/rsg6-eng/docs/(7)%20Nikolaus%20Rettelbach.pdf, retrieved 2009-10-15 
  3. ^ a b ISO (2003). "Bandwidth extension, ISO/IEC 14496-3:2001/Amd 1:2003". ISO. http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=38148. Retrieved 2009-10-13. 
  4. ^ a b ISO (2006). "Audio Lossless Coding (ALS), new audio profiles and BSAC extensions, ISO/IEC 14496-3:2005/Amd 2:2006". ISO. http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=43026. Retrieved 2009-10-13. 
  5. ^ a b ISO (2004). "Parametric coding for high-quality audio, ISO/IEC 14496-3:2001/Amd 2:2004". ISO. http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=39382. Retrieved 2009-10-13. 
  6. ^ a b 3GPP (2004-09-30). "3GPP TS 26.401 V6.0.0 (2004-09), General Audio Codec audio processing functions; Enhanced aacPlus General Audio Codec; General Description (Release 6)" (DOC). 3GPP. http://www.3gpp.org/ftp/Specs/archive/26_series/26.401/26401-600.zip. Retrieved 2009-10-13. 
  7. ^ 3GPP (2005-01-04). "ETSI TS 126 401 V6.1.0 (2004-12) - Universal Mobile Telecommunications System (UMTS); General audio codec audio processing functions; Enhanced aacPlus general audio codec; General description (3GPP TS 26.401 version 6.1.0 Release 6)". 3GPP. http://webapp.etsi.org/workprogram/Report_WorkItem.asp?wki_id=21806. Retrieved 2009-10-13. 
  8. ^ http://tech.ebu.ch/docs/techreview/trev_305-moser.pdf
  9. ^ Results of 64kbit/s Listening Test
  10. ^ Multiformat Listening Test @ 48 kbps - FINISHED - Hydrogenaudio Forums
  11. ^ 80 kbps personal listening test (summer 2005) - Hydrogenaudio Forums
  12. ^ - MP3 - WMA - AAC - OGG - qualité à 96 kbps (évaluation) - Traitement Audio - Video & Son - FORUM HardWare.fr
  13. ^ http://www.apple.com/itunes/features/#importing
  14. ^ http://www.apple.com/itunes/features/#radio
  15. ^ "Nero AAC Codec". http://www.nero.com/ena/technologies-aac-codec.html. Retrieved 2009-11-23. 
  16. ^ Bouvigne, Gabriel (2006-03-20). "48kbps AAC public test results". MP3'Tech. http://www.mp3-tech.org/tests/aac_48/results.html. Retrieved 2008-09-05. 
  17. ^ Winamp Transcoder
  18. ^ Adobe HD video: Supported technologies
  19. ^ Via Licensing Corporation
  20. ^ AAC Licensing FAQ
  21. ^ Thomson. "Thomson/FhG MP3 Licensing". http://www.mp3licensing.com/royalty/. 
  22. ^ Mihir Mody (2005-06-06). "Audio compression gets better and more complex". Embedded.com. http://www.embedded.com/columns/specialreports/164300682?_requestid=60812. Retrieved 2009-10-13. 
  23. ^ ETSI (2009-04) ETSI TS 126 234 V8.2.0 (2009-04); 3GPP TS 26.234; Transparent end-to-end Packet-switched Streaming Service (PSS); Protocols and codecs Page 58. Retrieved on 2009-06-02.
  24. ^ ETSI (2009-01) ETSI TS 126 140 V8.0.0 (2009-01); 3GPP TS 26.140; Multimedia Messaging Service (MMS); Media formats and codes Page 11. Retrieved on 2009-06-02.
  25. ^ ETSI (2009-01) ETSI TS 126 141 V8.0.0 (2009-01); 3GPP TS 26.141; IP Multimedia System (IMS) Messaging and Presence; Media formats and codecs Page 10. Retrieved on 2009-06-02.
  26. ^ 3GPP (2009). "ETSI TS 126 346 V8.3.0 (2009-06); 3GPP TS 26.346; Multimedia Broadcast/Multicast Service (MBMS); Protocols and codecs". ETSI. p. 85. http://www.3gpp.org/ftp/Specs/html-info/26346.htm. Retrieved 2009-10-13. 
  27. ^ 3GPP (2004). "3GPP TS 26.401 - General audio codec audio processing functions; Enhanced aacPlus general audio codec; General description". 3GPP. http://www.3gpp.org/ftp/Specs/html-info/26401.htm. Retrieved 2009-10-13. 
  28. ^ ETSI TS 101 154 v1.5.1: Specification for the use of Video and Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream
  29. ^ ETSI (2009-03-31). "TS 101 154 version 1.9.1 - Digital Video Broadcasting (DVB); Specification for the use of Video and Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream". ETSI. http://webapp.etsi.org/workprogram/Report_WorkItem.asp?wki_id=30945. Retrieved 2009-10-13. 
  30. ^ http://www.codingtechnologies.com/products/assets/CT_aacPlus_whitepaper.pdf

External links