Margaret Oakley Dayhoff

From Wikipedia, the free encyclopedia

Dr. Margaret Belle (Oakley) Dayhoff (March 11, 19251983) was an American biochemist and a pioneer in the field of bioinformatics. She was the first woman to hold office in the Biophysical Society, first as Secretary and eventually President. She originated one of the first substitution matrices, Point accepted mutations or (PAM).

[edit] Early life

Dayhoff was born an only child in Philadelphia, but moved to New York City as a child. Her academic promise was evident from the outset; she was valedictorian (class of 1942) at Bayside High School, Bayside, New York and from there received a scholarship to Washington Square College of New York University, graduating magna cum laude in mathematics in 1945.

[edit] Research

From there, Dayhoff undertook a Ph.D. in quantum chemistry, under George Kimball, in the Columbia University Department of Chemistry. In her graduate thesis, Dayhoff had pioneered the use of computer capabilities -- i.e. mass-data processing-- to theoretical chemistry; specifically, she applied punch card machines to calculate the resonance energies of several polycyclic organic molecules.

After completing her Ph.D, Dayhoff studied electrochemistry at the Rockefeller Institute from 1948 to 1951. In 1952, she moved to Maryland with her family and later received a research fellowship from the University of Maryland (1957-1959), working on a model of chemical bonding with Ellis Lippincott. She taught physiology and biophysics for 13 years, while becoming affiliated with the National Biomedical Research Foundation, a Fellow of the American Association for the Advancement of Science, a councillor of the International Society for the Study of the Origins of Life (1980) and acting on the editorial boards of DNA, Journal of Molecular Evolution and Computers in Biology and Medicine.

By the 1960s, Frederic Sanger’s discovery of the complete amino acid structure of insulin had spurred tremendous interest in deducing the function of biomolecules from their biochemical structure. But this interest was also sparked by the theory that small divergences in sequence homologies (sequences with a high likelihood of common ancestry) in informational macromolecules could indicate the process and rate of evolutional change on the molecular level. The notion that such molecular analysis could help scientists decode evolutionary patterns in organisms was formalized in the published papers of Emile Zuckerkandl and Linus Pauling during the early 1960s. Dayhoff worked side by side with Lippincott and Carl Sagan on thermodynamic models of cosmo-chemical systems, including prebiological planetary atmospheres.

Dayhoff went on to pioneer the development of programmable computer methods for use in comparing protein sequences and deriving their evolutionary histories (in other words, discerning homologies) from their sequence alignments. Though this was before the days of massive outputs of sequence information by automated and other methods, Margaret Dayhoff was perspicacious enough to anticipate the potential pertinence of computers to the current theories of Zuckerkandl & Pauling and the method which Sanger had engineered.

With Richard Eck, she published the first reconstruction of a phylogeny (evolutionary tree) by computers from molecular sequences, using a maximum parsimony method. She also formulated the first probability model of protein evolution, the PAM model, in 1966.

She initiated the collection of protein sequences in the Atlas of Protein Sequence and Structure, a book of all known protein sequences that she published in 1965. It has since been republished in several editions. This led to the Protein Information Resource database of protein sequences, which was developed by her group. It and the parallel effort by Walter Goad which led to the Genbank databse of nucleic acid sequences are the twin origins of the modern databases of molecular sequences. The Atlas was organized by gene families, and she is regarded as a pioneer in their recognition. Her approach to proteins was always determinedly evolutionary.

David Lipman has called Dayhoff the mother and father of bioinformatics. Lipman, who is director of the National Center for Biotechnology Information is also the scientist who spearheaded the collaborative project that produced BLAST. His ongoing work in developing better computational methods for molecular biology attests to his inheritance of Dayhoff’s legacy.

[edit] External links