Talk:Structural alignment

From Wikipedia, the free encyclopedia

Good article Structural alignment has been listed as one of the Natural sciences good articles under the good article criteria. If you can improve it further, please do. If it no longer meets these criteria, you can delist it, or ask for a reassessment.
September 30, 2006 Good article nominee Listed
Molecular and Cellular Biology WikiProject This article is within the scope of the Molecular and Cellular Biology WikiProject. To participate, visit the WikiProject for more information. The WikiProject's current monthly collaboration is focused on improving Restriction enzyme.
Good article GA This article has been rated as GA-Class on the assessment scale.
Low This article is on a subject of Low-importance within molecular and cellular biology.

Article Grading: The article has been rated for quality and/or importance but has no comments yet. If appropriate, please review the article and then leave comments here to identify the strengths and weaknesses of the article and what work it will need.

Contents

[edit] From PNA/Biology

This entry is not only bad, but probably full of mistakes. Plese feel free to voice your opinion, create new topics where appropriate and generally change at will the contents of this page.

The correct refence to cite SARF2: Alexandrov, N.N. SARFing the PDB. Protein Engineering (1996), 9:727-732. There is a mistake in the authorship of SARF2 program.

You aren't wrong about it being full of mistakes!
  1. Some approaches use quaternerions to reduce the dimensionality of the space -- Actually, quaternions increase the dimensionality of rotation space.
  2. The Dali algorithm (named after Salvador Dali) uses network isomorphism between the contact networks of two proteins to perform alignment. -- DALI is an abbreviation of Distance matrix Alignment.
Stewart Adcock 23:21, 20 Feb 2004 (UTC)

I think you should be brave and fix this page!

Don't worry, I definitely will -- when I have time. I even have a report that I could wikify. Stewart Adcock 17:15, 16 Mar 2004 (UTC)
Cool! --Dan 15:34, 1 Apr 2004 (UTC)

Did a complete rework, hope it is a bit clearer... One could expand on the algorithmic side of things, as well as on the use of the method. Will try, as soon as I find time. Dr. Strangelove 20:00, 19 Jul 2004 (UTC)

---

The following comments made by me, Andrew Dalke <dalke@dalkescientific.com> on the Biopython mailing list. I don't like learning a bajillion wiki formatting conventions so I'm leaving this as-is on the talk page.

If it's two conformations of the same structure and the goal is to minimize overall RMSD through a single global alignment matrix then the usual reference I know is Kabsch

 [Kabsch, 1976] Kabsch, W. (1976). A solution for the best rotation to relate two sets of vectors. Acta. Crystal, 32A:922-923.
 [Kabsch, 1978] Kabsch, W. (1978). A discussion of the solution for the best rotation to related two sets of vectors. Acta. Crystal, 34A:827-828.

(The first had an ambiguity that could cause a sign error; fixed in the second.)

Several structure program implement that including:

 O - http://xray.bmc.uu.se/usf/factory_4.html
 VMD - http://www.ks.uiuc.edu/Research/vmd/vmd-1.7.1/ug/node183.html
 PyMol - http://www.pymolwiki.org/index.php/Kabsch

This algorithm is not mentioned on the Wiki page and is 11 years older than the oldest mentioned reference. It isn't NP-hard as alluded to in the Wiki, since it's solving a much simpler problem.

The date for ProFit is strange on the Wiki. I knew about ProFit before the claimed 1996. The web site says http://www.bioinf.org.uk/software/profit/faq.html

No paper has been published describing ProFit itself since it is simply a convenient program (I hope) to let you use a standard fitting algorithm consequently, it is a little difficult to reference. The exact wording is up to you and dependent on the context, but I suggest something similar to: Fitting was performed using the McLachlan algorithm (McLachlan, A.D., 1982 ``Rapid Comparison of Protein Structres, Acta Cryst A38, 871-873) as implemented in the program ProFit (Martin, A.C.R., http://www.bioinf.org.uk/software/profit/).

The McLachlan algorithm is also not mentioned in the Wiki.

I found a usenet announcement about ProFit at http://groups.google.com/group/bionet.software/msg/f219c5163bbadbdc?dmode=source&hl=en

 From: mar...@bsm.bioc.ucl.ac.uk (Andrew Martin)
 Subject: ANNOUNCE: Protein Fitting Software
 Date: 1995/07/20
 Message-ID: <1995Jul20.101545.37119@ucl.ac.uk>#1/1
 X-Deja-AN: 106640715
 organization: University College London
 newsgroups: bionet.software
 
 I've finally (after about 2 years) got around to fixing the last (?)
 remaining bugs in my ProFit protein least-squares fitting software and
 decided to make it generally available.

so the date should be 1995 or 1993 or earlier. I suspect the 1996 date came from the "first written" date for the documentation, at http://www.bioinf.org.uk/software/profit/doc/

what happened to the image? Str-alignment.png

according to the log the link was removed because the image was deleted from wikimedia

any way to link it back?

[edit] Software List

the list of available programs is gone.... think it was a good idea. —Preceding unsigned comment added by 146.203.21.23 (talk • contribs) 12:16, 3 July 2006

It was moved to Sequence alignment software, currently linked in the See Also section. It'll get a more prominent link when the list is in better shape. If you have any new additions, or a perspective on which packages are the most commonly used, that would be great! Opabinia regalis 00:11, 4 July 2006 (UTC)
'Sequence alignment software' perhaps is not the most accurate place for structure based alignment programs.—Preceding unsigned comment added by 146.203.21.237 (talk • contribs) 14:46, 7 July 2006
Structural alignment software now redirects there. I think the advantages of having everything in one place outweigh the minor disadvantage of following a redirect, but I'll put the question on the sequence alignment peer review if anyone else wants to chime in. Opabinia regalis 01:15, 8 July 2006 (UTC)

[edit] GA review

1. Well written? Pass, upon changes below.
2. Factually accurate? Pass
3. Broad in coverage? Pass
4. Neutral point of view? Pass
5. Article stability? Pass
6. Images? Pass
6. Appropriate referencing? Pass, upon changes below.

Additional comments: Some changes needed.

  • However, at O(n10 / ε6) for a globular protein of n residues, the algorithm is still too expensive for practical use.
I don't think the algorithm itself is expensive, don't you mean expensive to implement in terms of CPU time?
  • Representation of structures
No references in this section, could add one to cover the statement about improving data by discarding noise.
  • This representation is expensive because the features in the square matrix are symmetrical...
Again, expensive isn't what you mean. TimVickers 14:06, 30 September 2006 (UTC)

Thanks for your image and useful copyedits. I added the relevant ref and clarified your examples to "computationally expensive" and "memory-intensive" respectively; I hadn't thought about that usage being confusing. Opabinia regalis 15:54, 30 September 2006 (UTC)

[edit] Pls remove TM-Align

In the Methods section, only DALI, SSAP and CE should be included (but not TM-Align). DALI, SSAP and CE are considered "classical methods" in this area, and the papers on them have received a large number of citations (100+ for each). We should not describe the new and not-so-popular-yet methods like TM-Align, because there also exist several other structural alignment methods. (Please see http://en.wikipedia.org/wiki/Structural_alignment_software). If we include TM-Align, we also should include all of these methods. However, I would like to recommend to add some popular methods like: VAST (recognized by PDB), FATCAT (recognized by PDB), and SSM (recognized by SCOP). Zeyaraun 10:48, 18 December 2006 (UTC)

Thanks, for some reason this article fills up with good-natured but promotional text if unwatched. I've shortened the TM-align section (it was rather wordy and diffuse anyway) out of desire not to give new and 'exciting' methods undue weight; however, as a Zhang/Skolnick project I think it's notable enough to merit a blurb. The new arrangement also provides an easy way to add other new developments without bloating the article.
On the other methods, feel free to add sections on them if you're interested. VAST in particular, as it's quite old and maybe not used so much anymore, but a notable piece of the development of these techniques. Opabinia regalis 01:46, 20 December 2006 (UTC)