AES3

AES3 (also known as AES/EBU) is a standard for the exchange of digital audio signals between professional audio devices. AES3 was jointly developed by the Audio Engineering Society (AES) and the European Broadcasting Union (EBU). An AES3 signal can carry two channels of PCM audio over several transmission media including balanced lines, unbalanced lines, and optical fiber.^[1] The standard was first published in 1985 and has been revised in 1992 and 2003.

AES3 has been incorporated into the International Electrotechnical Commission's standard IEC 60958, and is available in a consumer-grade variant known as S/PDIF.

History and development

The development of standards for digitising analog audio, as used to interconnect both professional and domestic audio equipment, began in the late 1970s^[2] in a joint effort between the Audio Engineering Society and the European Broadcasting Union, and culminated in the publishing of AES3 in 1985. Early on, the standard was frequently known as AES/EBU. Both AES and EBU versions of the standard exist. Variants using different physical connections—essentially consumer versions of AES3 for use within the domestic "Hi-Fi" environment using connectors more commonly found in the consumer market—are specified in IEC 60958. These variants are commonly known as S/PDIF.

The standard has been revised in 1992 and 2003 and is published in AES and EBU versions. Worldwide, it is the most commonly used method for digitally interconnecting audio equipment.

Hardware connections

The AES3 standard parallels part 4 of the international standard IEC 60958. Of the physical interconnection types defined by IEC 60958, three are in common use.

IEC 60958 Type I—Balanced, XLR

XLR connectors, used for IEC 60958 Type I connections.

Type I connections use balanced, 3-conductor, 110-ohm twisted pair cabling with XLR connectors. Type I connections are most often used in professional installations and are considered the AES3 standard connector. The hardware interface is usually implemented using RS-422 line drivers and receivers.

Type I connector ends
	Cable end	Device end
Input	XLR male plug	XLR female jack
Output	XLR female plug	XLR male jack

IEC 60958 Type II—Unbalanced, RCA

RCA connectors, used for IEC 60958 Type II connections.

Type II connections use unbalanced, 2-conductor, 75-ohm coaxial cable with RCA connectors. Type II connections are used in most often in consumer audio installations and are often called coaxial S/PDIF connections.

Type II connector ends
	Cable end	Device end
Input	RCA male plug	RCA female jack
Output	RCA male plug	RCA female jack

IEC 60958 Type III Optical—Fiber, F05/TOSLINK

F05/TOSLINK connector, used for IEC 60958 Type III connections.

Type III Optical connections use optical fiber—usually plastic, but occasionally glass—with F05 connectors, which are more commonly known by their Toshiba brand name, TOSLINK. Like Type II, Type III Optical connections are also used in consumer audio installations and are often called optical S/PDIF connections.

Type III Optical connector ends
	Cable end	Device end
Input	F05/TOSLINK male plug	F05/TOSLINK female jack
Output	F05/TOSLINK male plug	F05/TOSLINK female jack

Other connections

BNC connector, used for AES-3id connections.

The AES-3id standard defines a 75-ohm BNC electrical variant of AES3. This uses the same cabling, patching and infrastructure as analogue or digital video, and is thus common in the broadcast industry.

AES3 digital audio format can also be carried over an Asynchronous Transfer Mode network. The standard for packing AES3 frames into ATM cells is AES47.

For information on the synchronization of digital audio structures, see the AES11 standard. The ability to insert unique identifiers into an AES3 bit stream is covered by the AES52 standard.

Relation to S/PDIF

The precursor of the IEC 60958 Type II specification was the Sony/Philips Digital Interface, or S/PDIF. S/PDIF and AES3 are similar in many ways and are interchangeable at the protocol level, but at the physical level they specify different electrical signaling levels and impedances, which may be significant in some applications.

Protocol

Simple representation of the protocol for both AES3 and S/PDIF

The low-level protocol for data transmission in AES3 and S/PDIF is largely identical, and the following discussion applies for S/PDIF, except as noted.

AES3 was designed primarily to support stereo PCM encoded audio in either DAT format at 48 kHz or CD format at 44.1 kHz. No attempt was made to use a carrier able to support both rates; instead, AES3 allows the data to be run at any rate, and encoding the clock and the data together using biphase mark code (BMC).

Each bit occupies one time slot.

Each audio sample (of up to 24 bits) is combined with four flag bits and a synchronisation preamble which is four time slots long to make a subframe of 32 time slots.

Two subframes (A and B, normally used for left and right audio channels) make a frame. Frames contain 64 time slots and are produced once per sample time. This determines the clock rate.

At the highest level, each 192 consecutive frames are grouped into an audio block. While samples repeat each frame time, metadata is only transmitted once per audio block.

At the default 48 kHz sample rate, there are 250 audio blocks per second, and 3,072 kilobits per second with a biphase clock of 6.144 MHz ^[3]

The 32 time slots of each subframe are assigned as follows:

AES3 subframe
Time slot	Name	Description
0–3	Preamble	A synchronisation preamble (biphase mark code violation) for audio blocks, frames, and subframes.
4–7	Auxiliary sample (optional)	A low-quality auxiliary channel used as specified in the channel status word, notably for producer talkback or recording studio-to-studio communication.
8–27, or 4–27	Audio sample	One sample stored with most significant bit (MSB) last. If the auxiliary sample is used, bits 4–7 are not included. Data with smaller sample bit depths always have MSB at bit 27 and are zero-extended towards the least significant bit (LSB).
28	Validity (V)	Unset if the audio data are correct and suitable for D/A conversion. During the presence of defective samples, the receiving equipment may be instructed to mute its output. It is used by most CD players to indicate that concealment rather than error correction is taking place.
29	User data (U)	Forms a serial data stream for each channel (with 1 bit per frame), with a format specified in the channel status word.
30	Channel status (C)	Bits from each frame of an audio block are collated giving a 192-bit channel status word. Its structure depends on whether AES3 or S/PDIF is used.
31	Parity (P)	Even parity bit for detection of errors in data transmission. (I.e. bits 4–31 have an even number of ones.)

Synchronisation preamble

This is a specially coded preamble that identify the subframe and its position within the audio block. They are not normal BMC-encoded data bits, although they do still have zero DC bias.

Three preambles are possible :

X (or M) : 11100010₂ if previous time slot was 0, 00011101₂ if it was 1. (Equivalently, 10010011₂ NRZI encoded.) Marks a word for channel A (left), other than at the start of an audio block.
Y (or W) : 11100100₂ if previous time slot was 0, 00011011₂ if it was 1. (Equivalently, 10010110₂ NRZI encoded.) Marks a word for channel B (right).
Z (or B) : 11101000₂ if previous time slot was 0, 00010111₂ if it was 1. (Equivalently, 10011100₂ NRZI encoded.) Marks a word for channel A (left) at the start of an audio block.

They are called X, Y, Z in the AES3 standard; and M, W, B in IEC 958 (an AES extension).

The 8-bit preambles are transmitted in time allocated to the first four time slots of each subframe (time slots 0 to 3). Any of the three marks the beginning of a subframe. X or Z marks the beginning of a frame, and Z marks the beginning of an audio block.

 | 0 | 1 | 2 | 3 |  | 0 | 1 | 2 | 3 | Time slots
  _____       _            _____   _
 /     \_____/ \_/  \_____/     \_/ \ Preamble X
  _____     _              ___   ___
 /     \___/ \___/  \_____/   \_/   \ Preamble Y
  _____   _                _   _____
 /     \_/ \_____/  \_____/ \_/     \ Preamble Z
  ___     ___            ___     ___ 
 /   \___/   \___/  \___/   \___/   \ All 0 bits BMC encoded
  _   _   _   _        _   _   _   _
 / \_/ \_/ \_/ \_/  \_/ \_/ \_/ \_/ \ All 1 bits BMC encoded
 
 | 0 | 1 | 2 | 3 |  | 0 | 1 | 2 | 3 | Time slots

In two-channel AES3, the preambles form a pattern of ZYXYXYXY…, but it is straightforward to extend this structure to additional channels (more subframes per frame), each with a Y preamble, as is done in the MADI protocol.

Channel status word

As stated before there is one channel status bit in each subframe, making one 192 bit word for each channel in each block. This 192 bit word is usually presented as 192/8 = 24 bytes. The contents of the channel status word are completely different between the AES3 and S/PDIF standards, although they agree that the first channel status bit (byte 0 bit 0) distinguishes between the two. In the case of AES3, the standard describes in detail how the bits have to be used. Here is a summary of the channel status word:^[1]

byte 0: basic control data: sample rate, compression, emphasis
- bit 0: A value of 1 indicates this is AES3 channel status data. 0 indicates this is S/PDIF data.
- bit 1: A value of 0 indicates this is linear audio PCM data. A value of 1 indicates other (usually non-audio) data.
- bits 2–4: Indicates the type of signal preemphasis applied to the data. Generally set to 100₂ (none).
- bit 5: A value of 0 indicates that the source is locked to some (unspecified) external time sync. A value of 1 indicates an unlocked source.
- Bits 6–7: Sample rate. These bits are redundant when real-time audio is transmitted (the receiver can observe the sample rate directly), but are useful if AES3 data is recorded or otherwise stored. Options are unspecified, 48 kHz (the default), 44.1 kHz, and 32 kHz.
byte 1: indicates if the audio stream is stereo, mono or some other combination.
- bits 0–3: Indicates the relationship of the two channels; they might be unrelated audio data, a stereo pair, duplicated mono data, music and voice commentary, a stereo sum/difference code.
- bits 4–7: Used to indicate the format of the user channel word.
byte 2: audio word length
- bits 0–2: Aux bits usage. This indicates how the aux bits (time slots 4–7) are used. Generally set to 000₂ (unused) or 001₂ (used for 24-bit audio data).
- bits 3–5: Word length. Specifies the sample size, relative to the 20- or 24-bit maximum. Can specify 0, 1, 2 or 4 missing bits. Unused bits are filled with 0, but audio processing functions such as mixing will generally fill them in with valid data without changing the effective word length.
- bits 6–7: Unused
byte 3: used only for multichannel applications
byte 4: Additional sample rate information.
- bits 0–1: indicate the grade of the sample rate reference, per AES11.
- bit 2: reserved
- bits 3–6: Extended sample rate. This indicates other sample rates, not representable in byte 0 bits 6–7. Values are assigned for 24, 96, and 192 kHz, as well as 22.05, 88.2, and 176.4 kHz.
- bit 7: This "sampling frequency scaling flag", if set, indicates that the sample rate is multiplied by 1/1.001 to match NTSC video frame rates.
byte 5: reserved
bytes 6–9: Four ASCII characters for indicating channel origin. Widely used in large studios.
bytes 10–13: Four ASCII characters indicating channel destination, to control automatic switchers. Less often used.
bytes 14–17: 32-bit sample address, incrementing block-to-block by 192 (because there are 192 frames per block). At 48 kHz, this wraps every 24h51m18.485333s.
bytes 18–21: as above, but offset to indicate samples since midnight.^[4]
byte 22: contains information about the reliability of the channel status word.
- bits 0–3: reserved
- bit 4: if set, bytes 0–5 (signal format) are unreliable.
- bit 5: if set, bytes 6–13 (channel labels) are unreliable.
- bit 6: if set, bytes 14–17 (sample address) are unreliable.
- bit 7: if set, bytes 18–21 (timestamp) are unreliable.
byte 23: CRC. This byte is used to detect corruption of the channel status word, as might be caused by switching mid-block. (Generator polynomial is x⁸+x⁴+x³+x²+1, preset to 1.)

Embedded timecode

SMPTE timecode timestamp data can be embedded within AES3 digital audio signals. It can be used for synchronization and for logging and identifying audio content. According to John Ratcliff's Timecode: A user's guide, it is embedded as a 32-bit binary word in bytes 18 to 21 of the channel status data.^[5]

References

1 2 "Specification of the AES/EBU digital audio interface (The AES/EBU interface)" (PDF). European Broadcast Union. 2004. Retrieved 2014-01-07.
↑ "About AES Standards". Audio Engineering Society. Retrieved 2014-01-07. In 1977, stimulated by the growing need for standards in digital audio, the AES Digital Audio Standards Committee was formed.
↑ Robin, Michael (1 September 2004). "The AES/EBU digital audio signal distribution standard". Broadcastengineering.com. Archived from the original on 2012-07-09. Retrieved 2012-05-13.
↑ "Specification of the AES/EBU digital audio interface (The AES/EBU interface)" (PDF). European Broadcast Union. 2004. p. 12. Retrieved 2014-01-07. Bytes 18 to 21, Bits 0 to 7: Time of day sample address code. Value (each Byte): 32-bit binary value representing the first sample of current block. LSBs are transmitted first. Default value shall be logic "0". Note: This is the time-of-day laid down during the source encoding of the signal and shall remain unchanged during subsequent operations. A value of all zeros for the binary sample address code shall, for the purposes of transcoding to real time, or to time codes in particular, be taken as midnight (i.e., 00 h, 00 mm, 00 s, 00 frame). Transcoding of the binary number to any conventional time code requires accurate sampling frequency information to provide the sample accurate time.
↑ Ratcliff, John (1999). Timecode: A user's guide. Focal Press. pp. 226, 228. ISBN 0-240-51539-0.

External links

Download page for AES standards

List of International Electrotechnical Commission standards
IEC standards	IEC 60027 IEC 60034 IEC 60038 IEC 60062 IEC 60063 IEC 60068 IEC 60112 IEC 60228 IEC 60269 IEC 60297 IEC 60309 IEC 60320 IEC 60364 IEC 60446 IEC 60559 IEC 60601 IEC 60870 IEC 60870-5 IEC 60870-6 IEC 60906-1 IEC 60908 IEC 60929 IEC 60958 AES3 S/PDIF IEC 61030 IEC 61131 IEC 61131-3 IEC 61158 IEC 61162 IEC 61334 IEC 61346 IEC 61355 IEC 61400 IEC 61499 IEC 61508 IEC 61511 IEC 61850 IEC 61883 IEC 61960 IEC 61968 IEC 61970 IEC 62014-4 IEC 62056 IEC 62061 IEC 62196 IEC 62262 IEC 62264 IEC 62304 IEC 62325 IEC 62351 IEC 62365 IEC 62366 IEC 62379 IEC 62386 IEC 62455 IEC 62680 IEC 62682 IEC 62700
ISO/IEC standards	ISO/IEC 646 ISO/IEC 2022 ISO/IEC 5218 ISO/IEC 6429 ISO/IEC 6523 ISO/IEC 7810 ISO/IEC 7811 ISO/IEC 7812 ISO/IEC 7813 ISO/IEC 7816 ISO/IEC 7942 ISO/IEC 8613 ISO/IEC 8632 ISO/IEC 8652 ISO/IEC 8859 ISO/IEC 9126 ISO/IEC 9293 ISO/IEC 9592 ISO/IEC 9593 ISO/IEC 9899 ISO/IEC 9945 ISO/IEC 9995 ISO/IEC 10021 ISO/IEC 10116 ISO/IEC 10165 ISO/IEC 10179 ISO/IEC 10646 ISO/IEC 10967 ISO/IEC 11172 ISO/IEC 11179 ISO/IEC 11404 ISO/IEC 11544 ISO/IEC 11801 ISO/IEC 12207 ISO/IEC 13250 ISO/IEC 13346 ISO/IEC 13522-5 ISO/IEC 13568 ISO/IEC 13818 ISO/IEC 14443 ISO/IEC 14496 ISO/IEC 14882 ISO/IEC 15288 ISO/IEC 15291 ISO/IEC 15408 ISO/IEC 15444 ISO/IEC 15445 ISO/IEC 15504 ISO/IEC 15511 ISO/IEC 15693 ISO/IEC 16262 ISO/IEC 17024 ISO/IEC 17025 ISO/IEC 18000 ISO/IEC 18004 ISO/IEC 18014 ISO/IEC 19752 ISO/IEC 19757 ISO/IEC 19770 ISO/IEC 19788 ISO/IEC 20000 ISO/IEC 21000 ISO/IEC 21827 ISO/IEC 23003 ISO/IEC 23270 ISO/IEC 23360 ISO/IEC 24707 ISO/IEC 24727 ISO/IEC 24744 ISO/IEC 26300 ISO/IEC 27000 ISO/IEC 27000-series ISO/IEC 27002 ISO/IEC 27040 ISO/IEC 29119 ISO/IEC 33001 ISO/IEC 42010 ISO/IEC 80000 ISO/IEC 38500 ISO/IEC 4909
Related	International Electrotechnical Commission

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.