High dynamic range imaging

An example of a rendering of an HDRI tone-mapped image in a New York City nighttime cityscape.
Tone-mapped image made from nine exposures of Trenčín, Slovakia at night.

In image processing, computer graphics, and photography, high dynamic range imaging (HDRI or just HDR) is a set of techniques that allows a greater dynamic range of luminances between light and dark areas of a scene than normal digital imaging techniques. The intention of HDRI is to accurately represent the wide range of intensity levels found in real scenes ranging from direct sunlight to shadows.

High dynamic range imaging was originally developed in the 1930s and 1940s by Charles Wyckoff. Wyckoff's detailed pictures of nuclear explosions appeared on the cover of Life magazine in the mid 1940s. The process of tone mapping together with bracketed exposures of normal digital images, giving the end result a high, often exaggerated dynamic range, was first reported in 1993[1], and resulted in a mathematical theory of differently exposed pictures of the same subject matter that was published in 1995[2]. In 1997 this technique of combining several differently exposed images to produce a single HDR image was presented to the computer graphics community by Paul Debevec.

This method was developed to produce a high dynamic range image from a set of photographs taken with a range of exposures. With the rising popularity of digital cameras and easy-to-use desktop software, the term HDR is now popularly used[3] to refer to this process. This composite technique is different from (and may be of lesser or greater quality than) the production of an image from a single exposure of a sensor that has a native high dynamic range. Tone mapping is also used to display HDR images on devices with a low native dynamic range, such as a computer screen.

Contents

History

The idea of using several exposures to fix a too extreme range of luminance was pioneered as early as the 1850s by Gustave Le Gray to render seascapes showing both the sky and the sea. Such rendering was impossible at the time using standard techniques, the luminosity range being too extreme. Le Gray used one negative for the sky, and another one with a longer exposure for the sea, and combined the two in a single picture in positive.[4]

The desirability of HDR has been recognised for decades but its wider usage was, until quite recently, precluded by the limitations imposed by the available computer processing power. Probably the first practical application of HDRI was by the movie industry in late 1980s and, in 1985, Gregory Ward created the Radiance RGBE image file format which was the first (and still the most commonly used) High Dynamic Range Imaging file format. In 1997 the technique of combining several differently exposed images to produce a single HDR image was presented to the public by Paul Debevec and his research has since contributed significantly to the popularization of HDRI.

HDRI is commonly used in the real estate and architectural photography markets<refneeded> due to its advantage of properly exposing both window and room areas simultaneously.

Until recently there were no "pure" examples of HDR based cinematography, since the effects were most commonly used during composited sequences in films. However with the advent of low cost prosumer digital cameras, many amateurs began posting HDR timelapse videos on sites such as Blip.tv and Vuze.com. In 2008 Mobius/Quark Films released "Silicon Valley Timelapse" which is said to be feature almost 1.1 million frames of tone mapped HDR, making it the largest single source of tone mapped HDR footage available to date.

Comparison with traditional digital images

Fountain in Dupont Circle, Washington DC using HDRI.

Information stored in high dynamic range images usually corresponds to the physical values of luminance or radiance that can be observed in the real world. This is different from traditional digital images, which represent colors that should appear on a monitor or a paper print. Therefore, HDR image formats are often called "scene-referred", in contrast to traditional digital images, which are "device-referred" or "output-referred". Furthermore, traditional images are usually encoded for the human visual system (maximizing the visual information stored in the fixed number of bits), which is usually called "gamma encoding" or "gamma correction". The values stored for HDR images are often linear, which means that they represent relative or absolute values of radiance or luminance (gamma 1.0).

HDR images require a higher number of bits per color channel than traditional images, both because of the linear encoding and because they need to represent values from 10−4 to 108 (the range of visible luminance values) or more. 16-bit ("half precision") or 32-bit floating point numbers are often used to represent HDR pixels. However, when the appropriate transfer function is used, HDR pixels for some applications can be represented with as few as 10–12 bits for luminance and 8 bits for chrominance without introducing any visible quantization artifacts.[5]

Synthetic HDR images

Main article: High dynamic range rendering

Computer-created HDR images were first produced with various renderers, notably Radiance. This allowed for more realistic renditions of modelled scenes because the units used were based on actual physical units e.g. watts/steradian/m². It made it possible for the lighting of a real scene to be simulated and the output to be used to make lighting choices (assuming the geometry, lighting, and materials were an accurate representation of the real scene).

At the 1997 SIGGRAPH, Paul Debevec presented his paper entitled "Recovering High Dynamic Range Radiance Maps from Photographs".[6] It described photographing the same scene many times with a wide range of exposure settings and combining those separate exposures into one HDR image. This HDR image captured a higher dynamic range of the viewed scene, from the dark shadows all the way up to bright lights or reflected highlights.

A year later at SIGGRAPH '98, Debevec presented "Rendering Synthetic Objects into Real Scenes: Bridging Traditional and Image-Based Graphics with Global Illumination and High Dynamic Range Photography".[7] In this paper he used his previous technique to photograph a shiny chrome ball to produce what he called a "light probe", essentially an HDR environment map. This light probe could then be used in the rendering of a synthetic scene. Unlike a normal environment map that simply provides something to show in reflections or refractions, the light probe also provided the light for the scene. In fact, it was the only light source. This added an unprecedented level of realism, supplying real-world lighting data to the whole lighting model.

HDRI lighting plays a great part in movie making when computer 3D objects are to be integrated into real-life scenes.

Tone mapping

An HDR image made from three exposures and tone mapped into an 8-bit JPEG image.
Main article: Tone mapping

One problem with HDR has always been in viewing the images. Typical computer monitors (CRTs, LCDs), prints, and other methods of displaying images only have a limited dynamic range. Thus various methods of converting HDR images into a viewable format have been developed, generally called "tone mapping".

Early methods of tone mapping were simple. They simply showed a "window" of the entire dynamic range, clipping to set minimum and maximum values. However, more recent methods have attempted to compress the dynamic range into one reproducible by the intended display device. The more complex methods tap into research on how the human eye and visual cortex perceive a scene, trying to show the whole dynamic range while retaining realistic colour and contrast.

Images with too much "HDR" processing have their range over-compressed, creating a surreal low-dynamic-range rendering of a high-dynamic-range scene.

Image sensors

CMOS image sensors are now capable of creating HDR images up to 110dB, for chips up to 2mp, and form factors of 1/5", immediately, for low cost in contrast to extremely costly CCD sensors.

Exposure examples

Three exposures of the same image.

Here the dynamic range of the image is demonstrated by adjusting the "exposure" when tone-mapping the HDR image into an LDR one for display. The above sequence uses an image rendered with Radiance using Paul Debevec's light probe of the Uffizi gallery. The rendering software produces a high dynamic range image. When making the JPEG images, one selects a part of that range for display. This is similar to how a conventional camera captures only a portion of the dynamic range of a real physical scene.

The middle exposure is the desired exposure and is likely how this scene would normally be presented. The exposure to the left is 4 EV darker, showing some detail in the bright clouds in the sky. The exposure to the right is 3 EV lighter, showing some detail in the darker parts of the scene. This shows why compositing is desirable; a composite image can retain the interesting details from all three exposure settings.

See also

References

  1. "Compositing Multiple Pictures of the Same Scene", by Steve Mann, in IS&T's 46th Annual Conference, Cambridge, Massachusetts, May 9-14, 1993
  2. "On Being ‘Undigital’ With Digital Cameras: Extending Dynamic Range By Combining Differently Exposed Pictures"., S. Mann and R. W. Picard
  3. "Flickr: HDR". Retrieved on 2007-01-29.
  4. J. Paul Getty Museum. Gustave Le Gray, Photographer. July 9 – September 29, 2002. Retrieved September 14, 2008.
  5. "High Dynamic Range Image Encodings" by Greg Ward, Anyhere Software; "Perception-motivated High Dynamic Range Video Encoding" by Max Planck Institute for Computer Science
  6. Debevec, Paul (1997). "Recovering High Dynamic Range Radiance Maps from Photographs".
  7. Debevec, Paul (1998). "Rendering Synthetic Objects into Real Scenes: Bridging Traditional and Image-Based Graphics with Global Illumination and High Dynamic Range Photography".

External links