Windows Media Audio

From Wikipedia, the free encyclopedia

Windows Media Audio
WMA Extension Icon
File name extension .wma
Internet media type audio/x-ms-wma
Uniform Type Identifier com.microsoft.windows-?media-wma
Developed by Microsoft
Type of format Audio file format

Windows Media Audio (WMA) is an audio data compression technology developed by Microsoft. The name can be used to refer to its audio file format or its audio codecs. It is a proprietary technology which forms part of the Windows Media framework. WMA consists of four distinct codecs. The original WMA codec, known simply as WMA, was conceived as a competitor to the popular MP3 and RealAudio codecs.[1][2] Today it is one of the most popular codecs, together with MP3 and MPEG-4 AAC. In 2003 it came second after MP3 in terms of standalone players supporting it.[3] WMA Pro, a newer and more advanced codec, supports multichannel and high resolution audio.[4] A lossless codec, WMA Lossless, compresses audio data without loss of audio fidelity.[4] And WMA Voice, targeted at voice content, applies compression using a range of low bit rates.[4]

Contents

[edit] Development history

The first WMA codec was based on the previous work from Henrique Malvar and his team.[5] According to the published article, the technology was transferred over to the Windows Media team at Microsoft.[5] Malvar was a senior researcher and manager of the Signal Processing Group at Microsoft Research,[6] whose team worked on the project called MSAudio.[7] The first finalized codec was at first referred as MSAudio 4.0.[8][9] It was later officially released as Windows Media Audio,[10] as part of Windows Media Technologies 4.0. Microsoft initially claimed that WMA delivers the same quality of MP3 at half the bit rate;[11] Microsoft also claimed that WMA delivers "CD-quality" audio at 64 kbit/s.[11] The former claim however was rejected by some audiophiles according to EDN.[12] RealNetworks also challenged Microsoft's claims regarding WMA's superior audio quality compared to RealAudio.[2]

Newer versions of WMA became available: Windows Media Audio 2 in 1999,[13] Windows Media Audio 7 in 2000,[14] Windows Media Audio 8 in 2001,[15] and Windows Media Audio 9 in 2003.[4] Microsoft first announced its plans to license WMA technology to third-parties in 1999.[16] Although earlier versions of Windows Media Player played WMA files, support for WMA file creation was not added until the seventh version.[17] In 2003, Microsoft released new audio codecs which were not compatible with the original WMA codec. These codecs were Windows Media Audio 9 Professional,[4] Windows Media Audio 9 Lossless,[4] and Windows Media Audio 9 Voice.[4]

[edit] Container format

A WMA file is in most circumstances encapsulated, or contained, in the Advanced Systems Format (ASF) container format,[18] featuring a single audio track in one of following codecs: WMA, WMA Pro, WMA Lossless, or WMA Voice. These codecs are technically distinct and mutually incompatible. The ASF container format specifies how metadata about the file is to be encoded, similar to the ID3 tags used by MP3 files. Metadata may include song name, track number, artist name, and also audio normalization values.

This container can optionally support digital rights management (DRM) using a combination of elliptic curve cryptography key exchange, DES block cipher, a custom block cipher, RC4 stream cipher and the SHA-1 hashing function.

[edit] Codecs

[edit] Windows Media Audio

Windows Media Audio (WMA) is the most common codec of the four WMA codecs. Colloquial usage of the term WMA, especially in marketing materials and device specifications, usually refers to this codec only. The first version of the codec released in 1999 is regarded as WMA 1. In the same year, the bit stream syntax, or compression algorithm, was altered in minor ways and became WMA 2.[13] Since then, newer versions of the codec were released, but the decoding process remained the same, ensuring compatibility between codec versions.[13] WMA is a lossy audio codec based on the study of psychoacoustics. Audio signals which are deemed to be imperceptible to the human ear encoded with reduced resolution during the compression process.

WMA can sample audio signals at up to 48000 times per second (48 kHz) with up to two discrete channels (stereo). WMA 9 introduced variable bit rate (VBR) and average bit rate (ABR) coding techniques into the MS encoder although both were technically supported by the original format,[13]. WMA 9.1 also added support for low-delay audio,[19] which reduces latency for encoding and decoding.

Fundamentally, WMA is a transform coder based on modified discrete cosine transform (MDCT), somewhat similar to AAC and Vorbis. The bit stream of WMA is composed of superframes, each containing 1 or more frames of 2048 samples. If the bit reservoir is not used, a frame is equal to a superframe. Each frame contains a number of blocks, which are 64, 128, 256, 512, 1024, or 2048 samples long after being transformed into the frequency domain via the MDCT. In the frequency domain, masking for the transformed samples is determined, and then used to requantize the samples. Finally, the floating point samples are decomposed into coefficient and exponent parts and independently huffman coded. Stereo information is typically mid/side coded. At low bit rates, line spectral pairs (typically less than 17 kbit/s) and a form of noise coding (typically less than 33 kbit/s) can also be used to improve quality.

Like AAC and Ogg Vorbis, WMA was intended to address perceived deficiencies in the MP3 standard. Given their common design goals, it's not surprising that the three formats ended up making similar design choices. All three are pure transform codecs. Furthermore the MDCT implementation used in WMA is essentially a superset of those used in Ogg and AAC such that WMA iMDCT and windowing routines can be used to decode AAC and Ogg Vorbis almost unmodified. However, quantization and stereo coding is handled differently in each codec. The primary distinguishing trait of the WMA Standard format is its unique use of 5 different block sizes, compared to MP3, AAC, and Ogg Vorbis which each restrict files to just two sizes.

WMA is one of the most popular audio codecs. Certified PlaysForSure devices, as well as a large number of uncertified devices, ranging from portable hand-held music players to set-top DVD players, support the playback of WMA files. Most PlaysForSure-certified online stores distribute content using this codec only. In 2005, Nokia announced its plans to support WMA playback in future Nokia handsets.[20] In the same year, an update was made available for the PlayStation Portable (version 2.60) which allowed WMA files to be played on the device for the first time.[21]

[edit] Windows Media Audio Professional

Screenshot of Windows Media Encoder 9 Series, displaying new encoding options for Windows Media Audio 10 Professional.
Screenshot of Windows Media Encoder 9 Series, displaying new encoding options for Windows Media Audio 10 Professional.

Windows Media Audio Professional (WMA Pro) is a newer and more advanced lossy audio codec. It is based on a new compression algorithm which is not only superior to WMA in terms of quality, efficiency, and features, but also scales well at low bit rates. Its main competitors include AAC, HE-AAC, Vorbis, Dolby Digital, and DTS. It can support audio resolutions of up to 96 kHz and 24-bit, for up to eight discrete channels (7.1 channel surround).[22] WMA Pro also supports dynamic range compression, which reduces the volume difference between the loudest and quietest sounds in the audio track. According to Microsoft's Amir Majidimehr, WMA Pro can technically go beyond 7.1 surround sound and support "an unlimited number of channels."[23]

The codec's bit stream syntax was frozen at the first version, WMA 9 Pro.[24] Later versions of WMA Pro introduced low-bit rate encoding, low-delay audio,[25] frequency interpolation mode,[26] and an expanded range of sampling rate and bit-depth encoding options. A WMA 10 Pro file compressed with frequency interpolation mode comprises a WMA 9 Pro track encoded at half the original sampling rate, which is then restored using a new compression algorithm.[26] In this situation, WMA 9 Pro players which have not been updated to the WMA 10 Pro codec can only decode the lower quality WMA 9 Pro stream. Starting with WMA 10 Pro, eight channel encoding starts at 128 kbit/s, and tracks can be encoded at the native audio CD resolution (44.1 kHz, 16-bit), previously the domain of WMA Standard.

Despite a growing number of supported devices and its superiority over WMA, WMA Pro still enjoys little hardware and software support. Some notable exceptions to this are the Microsoft Zune (limited to stereo),[27] Xbox 360,[28] Windows Mobile-powered devices with Windows Media Player 10 Mobile,[29] and newer Toshiba Gigabeat and Motorola devices.[30][31] In addition, WMA Pro is a requirement for the WMV HD certification program.[32] On the software side, Verizon utilizes WMA 10 Pro for its V CAST Music Service,[33] and Windows Media Player 11 has promoted the codec as an alternative to WMA for copying audio CD tracks.[34] However, the Microsoft Silverlight platform does not support WMA Pro.[35] In the absence of the appropriate audio hardware, WMA Pro can automatically downmix multichannel audio to stereo or mono, and 24-bit resolution to 16-bit during playback.

[edit] Windows Media Audio Lossless

Label for 5.1 surround sound, the maximum channel configuration for Windows Media Audio Lossless.
Label for 5.1 surround sound, the maximum channel configuration for Windows Media Audio Lossless.

Windows Media Audio Lossless (WMA Lossless) is a lossless audio codec that competes with ATRAC Advanced Lossless, Dolby TrueHD, DTS-HD Master Audio, Apple Lossless, Shorten, Monkey's Audio, FLAC, and WavPack (the last two have the advantage of being open source software and available for nearly any operating system). Designed for archival purposes,[36] it compresses audio signals without loss of quality from the original using VBR. When decompressed, the audio signal is an exact replica of the original. The first version of the codec, WMA 9 Lossless, and its revisions support up to 96 kHz, 24-bit audio for up to 6 discrete channels (5.1 channel surround) with dynamic range compression control. While Microsoft claims that it provides a compression ratio of up to 3:1 for audio CD tracks,[36] tests performed by ExtremeTech have found the realizable compression ratio to be closer to 2:1.[37]

Hardware support for the codec is available on the Bang & Olufsen Serenata [38], Sony Walkman NWZ-A and NWZ-S series, Zune 4, 8, 80 and 30 (with firmware version 2.2 or later),Xbox 360,[28] Windows Mobile-powered devices with Windows Media Player 10 Mobile,[29] Toshiba Gigabeat S and V models, the Meizu M3, and Best Buy's Insignia NS-DV, Pilot, and Sport music players. Contrary to some claims, the Archos make of media devices do not support WMA Lossless, nor does the SONOS system. Like WMA Standard, WMA Lossless is being used by a few online stores to distribute music online.[39][40] Similar to WMA Pro, WMA Lossless can perform downmixing when capable audio hardware is not present.

[edit] Windows Media Audio Voice

Windows Media Audio Voice (WMA Voice) is a lossy audio codec that competes with Speex (used in Microsoft's own Xbox Live online service[41]), ACELP, and other codecs. Designed for low-bandwidth, voice playback applications,[42] it employs low-pass and high-pass filtering of sound outside the human speech frequency range to achieve higher compression efficiency than WMA. It can automatically detect sections of an audio track containing both voice and music and use the standard WMA compression algorithm instead.[24] WMA Voice supports up to 22.05 kHz for a single channel (mono) only.[42] Encoding is limited to constant bit rate (CBR) and up to 20 kbit/s. The first and only version of the codec is WMA 9 Voice.

Windows Mobile-powered devices with Windows Media Player 10 Mobile have native support for WMA 9 Voice playback.[29] In addition, BBC World Service has employed WMA Voice for its Internet radio streaming service.[43]

[edit] Sound quality

See codec listening test for a table of double-blind listening test results.

Microsoft claims that audio encoded with WMA sounds better than MP3 at the same bit rate; Microsoft also claims that audio encoded with WMA at lower bit rates sound better than MP3 at higher bit rates.[44] Double blind listening tests with other lossy audio codecs have shown varying results, from failure to support Microsoft's claims about its superior quality to supremacy over other codecs. One independent test conducted in May 2004 at 128 kbit/s showed that WMA was roughly equivalent to LAME MP3; inferior to AAC and Vorbis; and superior to ATRAC3 (software version).[45] Another test performed by ExtremeTech showed different results, however, placing WMA at the top of the list in terms of quality.[37]

Some conclusions made by recent studies:

  • At 32 kbit/s, WMA Standard was noticeably better than LAME MP3, but not better than other modern codecs in a collective, independent test in July 2004.
  • At 48 kbit/s, WMA 10 Pro was ranked second after Nero HE-AAC and better than WMA 9.2 in an independent listening test organized and supported by Sebastian Mares and Hydrogenaudio Forums in December 2006. This test, however, used CBR for WMA 10 Pro and VBR for the other codecs.
  • At 64 kbit/s, WMA Pro outperformed Nero HE-AAC in a commissioned, independent listening test performed by the National Software Testing Labs in 2005. Out of 300 participants, "71% of all listeners indicated that WMA Pro was equal to or better than HE AAC."
  • At 80 kbit/s and 96 kbit/s, WMA had lower quality than HE-AAC, AAC-LC, and Vorbis; near-equivalent quality to MP3, and better quality than MPC in individual tests done in 2005.
  • At 128 kbit/s, there was a four-way tie between aoTuV Vorbis, LAME MP3, WMA 9 Pro and AAC in a large scale test in January 2006, with each codec sounding close to the uncompressed music file for most listeners.
  • At 768 kbit/s, WMA 9 Pro delivered full-spectrum response at half the bit rate required for DTS in a comparative test done by EDN in October 2003. The test sample was a 48 kHz, 5.1 channel surround audio track.

[edit] Players

Screenshot of Windows Media Player 11, built into most Windows Vista operating systems which supports all Windows Media Audio codecs.
Screenshot of Windows Media Player 11, built into most Windows Vista operating systems which supports all Windows Media Audio codecs.

Apart from Windows Media Player, the WMA format can be played using MPlayer, RealPlayer, Winamp (with certain limitations—DSP plugin support and DirectSound output is disabled using the default WMA plugin), and many other software media players. The Microsoft Zune media management software supports most WMA codecs, but uses a variation of Windows Media DRM which is used by PlaysForSure.

The FFmpeg project has reverse-engineered and re-implemented the WMA codecs (except WMA Pro) to allow their use on POSIX-compliant operating systems such as Linux. The rockbox project further extended this codec to be suitable for embedded cores, allowing playback on portable MP3 players and cell phones running open source software. RealNetworks has announced plans to support playback of DRM-unprotected WMA files in RealPlayer for Linux.[46] On the Macintosh platform, Microsoft released a PowerPC version of Windows Media Player for Mac OS X in 2003,[47] but further development of the software has ceased. Microsoft currently endorses the third-party Flip4Mac WMA, a QuickTime component which allows Macintosh users to play WMA files in any player that uses the QuickTime framework.[48] Flip4Mac however does not currently support the Windows Media Audio Voice codec.

[edit] Encoders

Software that can export audio in WMA format include Windows Media Player, Windows Movie Maker, Microsoft Expression Encoder, RealPlayer,[49] Adobe Premiere Pro,[50] Adobe Audition,[51] and Adobe Soundbooth.[52] Microsoft Office OneNote supports encoding in all WMA codecs,[53] and Windows Media Encoder supports all available bit rate and resolution options as well.

[edit] Digital rights management

Main article: Windows Media DRM

While none of the WMA codecs themselves contain any DRM facilities, the ASF container format, in which a WMA track may be encapsulated, can. Windows Media DRM, which can be used in conjunction with WMA, supports time-limited music subscription services such as those offered by unlimited download services, including MTV's URGE, Napster, Rhapsody, Yahoo! Music Unlimited and Virgin Digital. Windows Media DRM, a component of PlaysForSure and Windows Media Connect, is supported on many modern portable audio devices and streaming media clients such as Roku, SoundBridge, Xbox 360 and Wii. Players which support the WMA format but not Windows Media DRM lists protected titles as unplayable.

[edit] Criticism

[edit] Sound quality claims

WMA has been subjected to a number of complaints. "Some audiophiles challenge Microsoft's claims regarding WMA's quality," according to a published article from EDN.[12] Another article from MP3 Developments wrote that Microsoft's claim about CD-quality audio at 64 kbit/s with WMA was "very far from the truth."[54] At the early stages of WMA's development, a representative from RealNetworks claimed that WMA was a "clear and futile effort by Microsoft to catch up with RealAudio 8"[55]

Microsoft has sometimes claimed that the sound quality of WMA at 64 kbit/s equals or exceeds that of MP3 at 128 kbit/s (commonly considered to be near-transparent). In a 1999 study funded by Microsoft, NSTL found that listeners preferred WMA at 64 kbit/s to MP3 at 128 kbit/s (as encoded by MusicMatch Jukebox).[56] However, a September 2003 public listening test conducted by Roberto Amorim found that listeners preferred 128 kbit/s MP3 to 64 kbit/s WMA audio with greater than 99% confidence. This conclusion applied equally to other codecs at the same bitrate, leading him to conclude that:

No codec delivers the marketing plot of same quality as MP3 at half the bitrates.[57]

It is important to note that both MP3 and WMA encoders have undergone active development and improvement for many years, so their relative quality may change over time.

A July 2007 public listening test by Sebastian Mares found that 64 kbit/s HE-AAC audio (encoded by Nero Digital) was statistically tied with 64 kbit/s WMA Pro audio, in terms of listener preference.[58]

See also codec listening test for a table of public listening test results.

[edit] DRM

With regards to the DRM of WMA, the loss of the ability to restore licenses for WMA files in the Windows Media Player 11 was not positively received in an article from The Inquirer.[59] In addition, the publisher criticized the Microsoft Zune for not supporting the standard Windows Media DRM system, rendering protected WMA files unplayable on the device.[60]

[edit] See also

  • Codec – The technical term for compressor and decompressor
  • Windows Media – A digital media platform emcompassing individual Windows Media technologies
  • Windows Media Video – A video file format and codec developed by Microsoft
  • JPEG XR / HD Photo – An image file format and codec developed by Microsoft
  • Windows Media DRM – A digital rights management component of Windows Media used to control rights of accessing media content
  • WMV HD – The marketing name for high definition videos encoded using the Windows Media Video 9 codec
  • Surround sound – Audio with more than two channels
  • MPlayer – A third-party, open source, cross-platform media player capable of playing many Windows Media files using FFmpeg
  • FFmpeg – A third-party, cross-platform, free software codec library which implements Windows Media Video codecs
  • Lossy data compression – Data compression with loss of information
  • Lossless data compression – Data compression without loss of information
  • List of audio formats
  • Comparison of audio codecs

[edit] External links

[edit] References

  1. ^ Smith, Tony (1999-03-12). Microsoft readies MP3-killer digital music format. Retrieved on 2007-08-16.
  2. ^ a b Analysis of the Microsoft Audio Codec. RealNetworks. Retrieved on 2007-08-16.
  3. ^ Rosoff, Matt (2004-01-19). PC At Center of Microsoft's Digital Media Hub Strategy. Retrieved on 2007-08-16.
  4. ^ a b c d e f g Windows Media 9 Series Capabilities and Benefits Overview (DOC). International Narcotics Control Board. Retrieved on 2007-08-16.
  5. ^ a b Hinchberger, Bill (2001-09-09). Riding the Malvar Wave. Retrieved on 2007-08-16.
  6. ^ Press Kit: Henrique Malvar. Microsoft Research. Retrieved on 2007-08-16.
  7. ^ Communication, Collaboration, and Signal Processing. Microsoft Research. Retrieved on 2007-08-16.
  8. ^ Microsoft Windows Media Technologies Gains Support for Downloadable Music from Top Music Sites, Independent Labels, Popular Bands And Innovative Developers. Microsoft PressPass. Retrieved on 2007-08-16.
  9. ^ Barry, Richard (1999-04-14). MS Audio 4.0 will eat MP3.... Retrieved on 2007-08-16.
  10. ^ Windows Media Technologies 4 Delivers Cutting-Edge CD-Quality Audio On the Internet. Microsoft PressPass. Retrieved on 2007-08-16.
  11. ^ a b MS Windows Media Technologies Features. Microsoft TechNet. Retrieved on 2007-08-16.
  12. ^ a b The Internet-audio (r)evolution. Retrieved on 2007-08-16. “some audiophiles challenge Microsoft's claims regarding WMA's quality”
  13. ^ a b c d Broadcom Corporation: Audio Codecs.
  14. ^ Microsoft Announces Windows Media Technologies 7. Microsoft. Retrieved on 2007-08-16.
  15. ^ Microsoft Releases Windows Media Audio and Video 8. CDRInf. Retrieved on 2007-08-16.
  16. ^ Microsoft Wins Major ISV Support for Windows Media Technologies 4.0. Microsoft PressPass. Retrieved on 2007-08-16.
  17. ^ Thurrott, Paul (2005-04-30). SuperSite for Windows Media Player 7 Review. Retrieved on 2007-08-16.
  18. ^ The Difference Between ASF and WMV/WMA Files. Microsoft. Retrieved on 2007-08-16.
  19. ^ Windows Media Format 11 SDK Low-Delay Audio. Microsoft MSDN. Retrieved on 2007-08-16.
  20. ^ Microsoft and Nokia Collaborate to Help Ensure Consumers Can Enjoy Digital Music Anywhere.
  21. ^ Carnoy, David (2005-03-23). Sony PSP review. Retrieved on 2007-08-16.
  22. ^ Windows Media Audio Codecs: Windows Media Audio 10 Professional Windows Media Audio Codecs: Windows Media Audio 9 Professional. Microsoft. Retrieved on 2007-08-16.
  23. ^ Paul Thurrott's SuperSite for Windows: Windows Media 9 Series reviewed.
  24. ^ a b Windows Media Audio & Video 9 Series.
  25. ^ Windows Media Developer Center: Low-Delay Audio
  26. ^ a b Smith, Tony (2007-02-21). Best Practices for Windows Media Encoding. Retrieved on 2007-08-16.
  27. ^ Zune.net: How-To - Provide Content for Zune.
  28. ^ a b Spring '07 Video Playback FAQ.
  29. ^ a b c Windows Media Player Mobile FAQ.
  30. ^ Motorola and Microsoft Plan to Bring More Choice to Mobile Music Fans.
  31. ^ Motorola Dis Apple, Expect More Microsoft Music Phones: 3GSM.
  32. ^ WMV HD DVD Encoding Profile Guidelines.
  33. ^ Verizon Wireless Chooses Microsoft Windows Media to Power Its New V CAST Music Service.
  34. ^ Windows Vista Features Explained Windows Media Player 11.
  35. ^ Clarkezone James Clarke's Weblog.
  36. ^ a b Windows Media Audio Codecs: Windows Media Audio 9 Lossless Windows Media Audio Codecs: Windows Media Audio 9 Lossless. Microsoft. Retrieved on 2007-08-16.
  37. ^ a b ExtremeTech Audio Codec Quality Shootout.
  38. ^ [1]
  39. ^ Is This Digital Music's Future.
  40. ^ Online Stores in Windows Media Player.
  41. ^ Ralph Giles of Xiph.org explained that Xbox Live uses Speex for voice compression in June 6, 2005 interview on LugRadio: http://www.lugradio.org/episodes/29
  42. ^ a b Windows Media Audio Codecs: Windows Media Audio Voice Windows Media Audio Codecs: Windows Media Audio 9 Voice. Microsoft. Retrieved on 2007-08-16.
  43. ^ Roku - SoundBridge Internet Radio.
  44. ^ Windows Media: Music.
  45. ^ Results of Multiformat at 128 kbit/s public Listening Test.
  46. ^ Real to plug Windows media support into Linux (Stephen Shankland, CNET News.com, 17 August 2006)
  47. ^ Windows Media Player 9 for Mac OS X.
  48. ^ Important information for Windows Media Player for Mac users.
  49. ^ RealPlayer Customer Support: What formats can I use to record tracks from a CD with RealPlayer?.
  50. ^ Supported file formats in Adobe Premiere Pro 2.0,
  51. ^ Supported file formats (Adobe Audition 2.0).
  52. ^ Soundbooth CS3 supported file formats.
  53. ^ General information about the audio functionality in OneNote 2003.
  54. ^ Lossy Audio Formats. MP3Developments. Retrieved on 2007-08-16.
  55. ^ Codec Rivalry Spurs Development. streamingmedia.com Codec. Retrieved on 2007-08-16.
  56. ^ Microsoft's summary of the study. Full report from NSTL.
  57. ^ Results of 64kbit/s Listening Test
  58. ^ Results of Public, Multiformat Listening Test @ 64 kbps (July 2007)
  59. ^ Demerjian, Charlie (2006-09-21). Microsoft Media Player shreds your rights. Retrieved on 2007-08-16.
  60. ^ Demerjian, Charlie (2006-09-19). Zune will not play MS DRM infected files. Retrieved on 2007-08-16.