Audio bit depth
From Wikipedia, the free encyclopedia
The current version of the article or section is written like a magazine article; it does not use the direct, balanced tone expected of an encyclopedia. Please discuss this issue on the talk page. Editing help is available. |
This article includes a list of references or external links, but its sources remain unclear because it lacks in-text citations. You can improve this article by introducing more precise citations. |
In digital audio, bit depth describes the number of bits of information recorded for each sample. Bit depth directly corresponds to the resolution of each sample in a set of digital audio data. Common examples of bit depth include CD audio, which is recorded at 16 bits, and DVD-Audio, which can support up to 24-bit audio.
Contents |
[edit] Digital audio
A set of digital audio samples contains data that, when converted into an analog signal, provides the necessary information to reproduce the sound wave. In pulse-code modulation (PCM) sampling, the bit depth will limit quantities such as dynamic range and signal-to-noise ratio. The bit depth will not limit frequency range, which is limited by the sample rate.
By increasing the sampling bit depth, smaller fluctuations of the audio signal can be resolved (also referred to as an increase in dynamic range). The 'rule-of-thumb' relationship between bit depth and dynamic range is, for each 1-bit increase in bit depth, the dynamic range will increase by 6 dB (see Signal-to-noise ratio#Fixed point). 24-bit digital audio has a theoretical maximum dynamic range of 144 dB, compared to 96 dB for 16-bit; however, current digital audio converter technology is limited to dynamic ranges of ~115 dB because of 'real world' limitations in integrated circuit design.[citation needed][1]
Technically speaking, bit depth is only meaningful when applied to pure PCM devices. Non-PCM formats, such as DSD or lossy compression systems like MP3, have bit depths that are not defined in the same sense as PCM. This is particularly true for lossy audio compression, where bits are allocated to other types of information, and the bits actually allocated to individual samples are allowed to fluctuate within the constraints imposed by the allocation algorithm. Recently, many lossy formats such as DTS and WMA Pro have been promoted as 24-bit. However, this is not correct.[citation needed] A lossy file will not actually contain 24 bits worth of information per sample, but is actually a file that was originally mastered at 24 bits and then compressed.
[edit] Dynamic range
The importance of bit depth in PCM audio is that it determines the maximum possible dynamic range of the signal, or the difference between the loudest possible sounds and the lowest possible noise. For a typical PCM recording, in which no noise shaping is employed and the frequency range extends most of the way to the Nyquist limit, the dynamic range in dB is equal to 1.76 + 6.02 * bits. This formula is often simplified to 6 dB per bit, which yields the common value of 96 dB for 16-bit CD audio.
It should be restated that this is only valid for PCM sampling without post-processing. Systems such as DSD use a different modulation technique where the SNR is not determined exclusively by the bit depth and the audio band does not extend to the Nyquist Limit.
[edit] What is a 'bit' of data?
In computing parlance, 'bit' is the abbreviation for a single 'binary digit', represented by a 0 or a 1. Since a digital 'word' is simply a binary number, the number of bits per word is simply the number of digits that make up the corresponding number.
Binary numerics are base-2; thus, each digit can only be a '0' or a '1'. In comparison, traditional decimal numerics are base-10, having digits that can only be 0 through 9. For example, the 16-bit binary number '1001011011001010' is equivalent to the 5-digit decimal number 38602.
[edit] Bit rate
Bit rate refers to the amount of data, specifically bits, transmitted or received per second.
One of the most common bit rates given is that for compressed audio files. For example, an MP3 file might be described as having a bit rate of 160 kbit/s or 160000 bits/second. This indicates the amount of compressed data needed to store one second of music.
The standard audio CD is said to have a data rate of 44.1 kHz/16, implying the audio data was sampled 44,100 times per second, with a bit depth of 16. CD tracks are usually stereo, using a left and right track, so the amount of audio data per second is double that of mono, where only a single track is used. The bit rate is then 44100 samples/second * 16 bits/sample * 2 = 1,411,200 bit/s or 1.4 Mbit/s.
This explains why, for example, a Minidisc recorder, which uses ATRAC compression, can store files lasting twice as long on a disc, if the default, recording in 2 channel stereo, is set to single channel mono recording.
To fully define a sound file's digital audio bit rates, the sampling rate, word size, number of channels, (e.g. mono, stereo, four-track), and format of the data also need to be known.
[edit] Calculating values
There is an easy way to determine a file's bit rate when given sufficient information. In fact, as long as you know any three of the following four values, you can calculate the missing value.
Bit rate = (bit depth) x (sampling rate) x (number of channels)
For a recording with a 44.1 kHz sampling rate, 2 channels (stereo) and a 16 bit depth:
16 x 44100 x 2 = 1411200 bits per second, or, 1411.2 kbit/s
[edit] Sources
Much of the information in this article can be found in Principles of Digital Audio, 4th Edition (Pohlmann, McGraw Hill) with some contributions made by one or more users knowledgeable in the area of digital audio; the book was not the specific reference for this article. Nevertheless, it is one of possibly many printed sources for this information.