From Wikipedia, the free encyclopedia
|
I am currently working on an vandalism detection algorithm based on bayesian statistics. To train this algorithm, I need, as you have probably guessed, a metric assload of diffs of vandalism and not vandalism. Please post them below, so I can use them to train the algorithm. I estimate that I will need at least one thousand diffs from varied types of vandalism and non-vandalism. |
[edit] Vandalism
http://en.wikipedia.org/w/index.php?title=Samra&diff=94235323&oldid=94056919 — VANDALISM
[edit] Not Vandalism