Thomas Huang

Thomas Shi-Tao Huang (traditional Chinese: 黃煦濤; simplified Chinese: 黄煦涛; pinyin: Huáng Xùtāo, born Shanghai) is a professor at the University of Illinois at Urbana-Champaign (UIUC). Huang is one of the leading figures in computer vision, pattern recognition and human computer interaction.

Biography

Huang studied electronics at the National Taiwan University and received his bachelor's degree in 1956. Huang went to study in the United States, and obtained his D.Sc degree from the Massachusetts Institute of Technology (MIT) in 1963.[1]

Huang is the William L. Everitt Distinguished Professor in the UIUC Department of Electrical & Computer Engineering and the Coordinated Science Lab (CSL). Huang is also a faculty member (full-time) at the Beckman Institute and participates in the Image Formation and Processing and Artificial Intelligence laboratories.[1]

Research

Multimodal human computer interaction, especially the use of speech- and vision-based techniques in developing more natural and effective interfaces as alternatives to complements of conventional interfaces such as the keyboard and the mouse. Research projects include the integration of speech recognition and visual gesture analysis in controlling display in virtual environments; and the use of visual lip reading to enhance audio speech recognition accuracy.

3-D modeling, analysis, and synthesis (animation) of human face, hands, and body. The original motivation for this research is very low bitrate 3-D model-based video coding, esp. for video phone and teleconferencing scenarios. The idea is that if a 3-D model of the user at the transmitting end is constructed at the receiving end, then only the movement information needs to be extracted at the transmitting end and sent to the receiving end, where this information is used to drive the 3-D model and to regenerate the video sequence. Obviously, the tools developed for these scenarios are applicable to many other problems, such as virtual space conferencing with avatars, and electronic games.

Multimedia (images, video, audio, text) databases including content based image retrieval. Of special interest are the use of relevance feedback in adapting the databases system to user intentions (when browsing or searching), and the construction of a table of contents and a semantic index for video using multimedia information (image sequence, audio, and closed-captions if available).

Although the above problems are application motivated, the main goal is to develop general concepts, methodologies, theories, and algorithms which would be widely applicable to multimodal and multimedia signal processing in general. Huang's research support includes the NSF, DOD, UIUC Research Board, and a number of industrial firms.

Published work

Representative publications by Thomas Huang include:

Honors and Outstanding Achievements

Huang has received numerous honors and awards in his career, including:[2]

See also

References

  1. 1.0 1.1 "Beckman Institute Directory: Thomas S. Huang". Beckman Institute for Advanced Science & Technology. Retrieved May 28, 2010. (English)
  2. "Thomas S. Huang". ECE Illinois, Department of Electrical and Computer Engineering. Retrieved May 28, 2010. (English)
  3. "IEEE Jack S. Kilby Signal Processing Medal Recipients" (PDF). IEEE. Retrieved February 27, 2011.
  4. "IEEE Jack S. Kilby Signal Processing Medal Recipients - 2001 - Thomas S. Huang and Arun N. Netravali". IEEE. Retrieved February 27, 2011.