Writer invariant

From Wikipedia, the free encyclopedia

Writer invariant, also called authorial invariant or author's invariant, is property of a text which is invariant of its author, that is, it will be similar in all texts of a given author and different in texts of different authors. It can be used to find plagiarism or discover who is real author of anonymously published text. Writer invariant is also an author's pattern of writing a letter in handwritten text recognition.[1]

While it is generally recognised that writer invariants exist,[2] it is not agreed what properties of a text should be used.[2][3] Among the first ones used was distribution of word lengths;[2] other proposed invariants include average sentence length,[2][3] average word length,[2][3] noun, verb or adjective usage frequency,[3] vocabulary richness,[2] and frequency of function words,[2][3] or specific function words[3].

Of these, average sentence lengths can be very similar in works of different authors[2][3] or vary significantly even within a single work;[3] average word lengths likewise turn out to be very similar in works of different authors.[3] Analysis of function words shows promise because they are used by authors unconsciously.[2][3][4]

[edit] See also

[edit] References

  1. ^ Ali Nosary, Laurent Heutte, Thierry Paquet and Yves Lecourtier. "A Step Towards the Use of Writer's Properties for Text Recognition". . Laboratoire Perception, Systèmes, Information (PSI), Université de Rouen Retrieved on 2007-09-06.
  2. ^ a b c d e f g h i "Quantitative analysis of literary styles. (General)." (08 2002). The American Statistician. 
  3. ^ a b c d e f g h i j Fomenko, A. T.; V. P. Fomenko and T. G. Fomenko [2005] (2005). "The authorial invariant in Russian literary texts. Its application: who was the real author of the "Quiet Don"?", History: Fiction or Science?, 425-444. ISBN 2913621066. 
  4. ^ Buckland, Warren. "Forensic Semiotics". The Semiotic Review of Books 10 (3).