Multimedia Container Format

From Wikipedia, the free encyclopedia

MCF is an open (the specifications are available for everybody, free of charge), free (no royalties) data storage format called Multimedia Container Format (or Movie Container Format). The group has promised that the format and all software developed by them for will also stay free; it won't be turned into a commercial project once it's popular.

Essentially it is a file format, like Microsoft's AVI. However, it is also much more than that, having streaming and broadcasting features in the same format. It is not video or audio compression algorithm, but instead just a container that can hold any media inside it. This includes MPEG-4 (XviD and DivX), AC3, Ogg Vorbis, MP3 and others.

Contents

[edit] History

The reason this project was started is that in year 2000 the only usable movie format on PC was AVI, which had seen its best days and couldn't really be extended any further. The project started with brainstorming on what should an optimal video format do. At that time, the project was just a text file on the home computer of the current project leader, Lasse Kärkkäinen (Tronic). He wrote a very primitive specification and then contacted the definite number one AVI hobbyist, Avery Lee, best known of his free video editing application VirtualDub, for comments. After a few e-mails back and forth, the format began to get its shape. At that time the format was basically just a better AVI, with only some simple improvements such as timecodes, tagging and reliability against incomplete file transfers and other errors.

In 2001, the specification was written in HTML and the format was publicly published for the first time, asking for comments. Nobody really seemed to understand what the project was about, because there had not been any alternative container formats on PC before. It was very difficult to get any developers to even have a look at it. Finally, at a certain web forum, that no longer exists, some people paid attention and asked questions. However, they were quite cautious about such new format and many asked why not use QuickTime, Ogg or simply AVI instead. At this point everything was only on paper and thus it was difficult to gain respect for the project. This led to Lasse Kärkkäinen writing the first software for the format. In two days, he had a parser library and a simple ASCII art video player, supporting subtitles, released. This attracted a team together and the real development could finally begin. One of those to join was a C++ coder named Ingo Ralf Blum (ingo). Another talented C++ programmer, Steve Lhomme (robUx4), heard of the project few months later (January 23, 2002 is the exact date, found on his forum post). This man was to have an important role on the project.

Ingo Ralf Blum and Steve Lhomme were soon writing another version of the library, now in C++ (called, more properly, libmcf). Parts of this library are still in use in libebml. However, the format itself really started evolving, as the project was exponentially gathering more interest and thus good ideas on it. Because of that, the library was never quite up to date, on par with the specifications. At some point Ingo had some personal trouble and just disappeared from the net, leaving the code on Steve's hands.

During year 2002 the format matured a lot and the team started looking to get it running and thus into proper testing. However, at the same time, features were still pouring in and the specification could not be frozen. Pretty soon the libmcf was so much behind the specifications (especially thanks to the invention of Elements, which changed the format a lot) that it was decided to discontinue its development and to eventually write a new library, once the specification gets frozen.

In the end of 2002 Steve Lhomme started to experiment with completely new way of storing data, Extensible Binary Meta Language (EBML). He took the MCF format structure and replaced most of its simple binary fields by these EBML fields. After some discussion (he was proposing to replace the almost ready for release MCF with this new system), he started his own project (the date was December 6, 2002), Matroska, to experiment more with his system. Around the same time, Lasse Kärkkäinen, still leading the project and writing the specifications, had to go to military service and because the developers still left weren't comfortable enough for coding software, the MCF development was stalled for six months.

After a six month hibernation, in summer 2003, Matroska had got a four-month head-start in writing the software. All those who were in a hurry had moved their interests towards Matroska. This brought off the pressure of rushed release for MCF, and instead suggested that the format could be fine-tuned further. The experiments with Matroska proved to be useful feedback and thus MCF got many improvements that it would not have got without the split. This includes proper multi-segment support (virtual addressing) and even lower overhead.

As of April 8, 2004, the format was finalizing its specifications and the C++ library for handling it (libmcf) was roughly half done. However, the development is very slow, as the university takes most of Lasse's time.

[edit] Key features

Strict standard: doesn't allow users to add extensions by their own, like AVI, OGM and Matroska do. There is no technical reason to limit this, but this increases overall compatibility a lot and this is very important because the project is really aiming for hardware support. It's much better to have a single format that has the important features of all others. To make writing software easy, MCF uses fixed binary structures or otherwise simple methods whenever possible. One thing not found in MCF is XML (however, there might still be some stored inside MCF).

Also, because of MCF's strictly defined binary structure and offsets of few headers, software on a machine of the appropriate endianness can read them with single read command; this is very fast and really counts when you want to make inventory of all the files on your hard disk, on a network drive, or even of those on some FTP site (just read the first 2.5 KB of each file and you have all the data you need: no seeking and only a very little transfer).

Menus, chapters, subtitles and everything in the same file/stream.

ASCII-looking binary: MCF uses a lot of human-readable ASCII as identifiers instead of some more opaque binary. This allows easy debugging files with any hex editor. Also, the beginning of each MCF file contains some information readable with a text editor.

Variable framerates: MCF allows changing the frame rate of video in each scene. For instance, NTSC and PAL video use different frame rates, and MCF allows them to be intercut. Also, some codecs are tuned for a specific amount of motion per frame. An MCF encoder can lower the frame rate in scenes with less motion, reducing the amount of bandwidth required and improving quality with such codecs.

Seamless multi-segment: MCF supports dividing a long movie to several files, which can be recorded on several discs. If all segments are present at the same time, the user won't even notice the crossing of boundaries (most of the dirty work is done transparently in the parser, so players or codecs don't really need to bother with it). The user can also join the segments into a single big file or split it into segments of different sizes. Also, every segment is playable alone too, and you can define any overlap time you wish on them.

Full CRC32 protection: data can be protected, and if an error occurs (broken resume on download being the most common reason), the broken part is skipped (no more frozen frames or unplayable files). The parser can also tell where exactly the error happened, so you can re-download this specific part of the file. Another related feature is the playback of incomplete downloads without slow index regeneration or a requirement for a smart player with a smart parser.

Digital signatures: allow authenticating an author's releases against changes. If anyone changes the content on route, the digital signature can tell which parts have been changed, removed or added after signing it. One movie can also be signed by several different people, signing different parts of it, with different keys. This system uses commonly known public key algorithms, so it should be hard to break. Unlike a signature on an entire AVI file, MCF signs only the content, not the container. So, one can remux it (to better suit streaming, maybe), divide it into segments in a different way (or combine all segments), or basically do anything that doesn't change the actual contents and the signature will stay valid. Or he can add or remove tracks (some languages he doesn't need, for instance), and the digital signature still validates the parts that haven't been removed (but also tells which are missing or added).

No standardized digital rights management (DRM) or copy protection bits. The maintainers intend never to add support for these features trying to limit what users can do with the files they have. If you want to protect your content from being freely distributed, don't give copies of it for people you can't trust in the first place! The technical reasons for this are obvious too — if you can view something and you have specifications to the copy limiting system or sources for the player, you can break any protection in no time. The only systems that could work even for a moment (and probably not much longer) would be closed-source or hardware protection. Both of those also limit who can watch it (i.e. do I need a Fritz chip on my computer, with a complete Fritz-protected hardware/OS/software chain, or maybe Microsoft Windows with some commercial player). And in the end, someone will still break it, like happened with DVD's CSS. Not to mention that one could just make a copy through analog reconversion no matter how strong the digital protection used is. However, implementors are free to use a proprietary DRM layer around the whole MCF file.

[edit] Limits

Some limits of MCF are based on the limits of human perception, which cannot change. Humans can't notice a timing error of 0.5 ms between video and audio, and they don't benefit from over 1000 frame/s framerate either. This makes these borders available for it — the format can be simplified by using one millisecond precision in timecodes because we can be absolutely sure that it won't cause trouble in future either (except maybe for writers of some special editing software which needs 100% accuracy).

File sizes (also applies to size of a single stream without any breaks, or to total size of a movie split over several files) are limited by the 64-bit addressing, that cannot be changed without breaking compatibility. If we assume that the current exponential growth of 1000 times the amount every 10 years to continue, this should be enough for another 30 years. However, it is likely that even that growth cannot be sustained.

Frame sizes are limited by 32-bit addressing, so it is possible to have single frames up to four gigaoctets in size.

The timecode system allows a single movie to last for almost 35 years, without breaks. This means 40-bit integer for milliseconds, but actually a combination of 32-bit and 16-bit integers is used, to save space.

The number of different tracks, that can put in a single stream/file, is 65536. It is possible to fit a video stream in one, multi-channel audio in the second one and then there will still be 65534 to spare.

The number of segments one movie can be divided one movie into is 255.

Every MCF file requires around 3 KB of headers, so you really need to store at least 20 KB of data per file for it to be even remotely efficient (however, file systems would lose even much larger amounts of data because of allocation unit waste space).

The minimal possible overhead in normal situations is mostly dictated by frame overhead. This is only 7 octets in MCF! OpenDML AVI's overhead is around 40 octets (with full indices and legacy AVI compatibility) and Matroska's is around 10 octets.

When streaming and looking for lowest possible latency, the smallest possible transfer unit is one frame (with 25 octet overhead each, if using such minimal size units). The overhead is still only a half of that of OpenDML AVI file, and MCF can offer checksum protection in that! However, Matroska can do this with just about 16 octets.

[edit] Design goals

  • User-oriented: if there is a conflict between content producers' and users' view on something, users will win.
  • Feature rich: supports almost everything you really need and more features are added if required.
  • Compatible and easy to implement: wild extensions not allowed, simple data structures used whenever possible.
  • Platform independent: streams, broadcasts and files are no different from each-other
  • Extensible: nearly everything in MCF can be extended by adding new fields, still keeping the high space efficiency (most other extensible formats suffer from severe overhead when some extensions are added).
  • Efficient: general space efficiency and the speed of features such as seeking are important
  • Safe: the data structures used aren't as vulnerable to "unchecked buffer" or other common programming flaws as other systems could be, but also protect against some problems that cannot be fixed in parsers. MCF's design goal is that the parser should never hang for a long time, not even when reading broken data.
  • Primarily targeted at high quality movie distribution: extremely low bitrate, low latency streaming or editing capabilities aren't MCF's priorities, but MCF can pretty much get them for free too.

[edit] See also

[edit] External links

In other languages