Wikipedia:WikiProject Films/New articles

From Wikipedia, the free encyclopedia

See New articles by topic

Recently created Film-related articles can be found here. Please nominate the best new articles at Template talk:Did you know so Wikipedia can highlight them on the main page.

Contents

[edit] Current bot results

The following articles have been identified by AlexNewArtBot as potentially being within the scope of the project, based on the Cinema User:AlexNewArtBot/Cinema. It is likely that many are false positives; please examine the User:AlexNewArtBot/CinemaLog for more information. Please scan through the backlog in the archives for more articles.



Archives of this page                                                                                                                                                              

[edit] Archives

[edit] Adding feeds to the new article bot

The following is Alex Bakharev's rules for adding Film-related feeds to the new article bot. Contact him for more information.

[edit] Step 1 - select a name for the new feed

The name should not be used for any other feeds. It should not contain spaces and other non-letter symbols. It should be disambigous. E.G. UK is not good: is it Ukraine or United Kingdom. It should be reasonably short and you should be able to spell it uniformly through out a few step. E.g. Pneumonoultramicroscopicsilicovolcanoconiosis is not a good name.

[edit] Step 2 - announce the new feed

Put the template {{Subst:User:AlexNewArtBot/NewFeed|FeedName|Portal Name}} at the bottom of the #Currently supported section of this page. Here the FeedName stand for the name you have selected and the Portal Name is the name of a Portal page with the feed (Portal:### or Wikipedia:WikiProject ###). The feed would create redlinks for the Rules, Search Results and Log of the new feed. The Portal Name will be linked automatically, so there is no need to put it in square brackets.

[edit] Step 3 - compile the rules

Well this is the most tricky part: You have to provide some rules for the bot. Each rule has some numerical value (might be negative). All the values from the rules applicable to the article are added together to get a score. If a rule matches the lead of the article the points for the rule are doubled. If the final score is above the threshold the article is in.

Both threshold and rules a written in the rules page. One line per rule (and one line per threshold).

Threshold is specified as

@@number@@ where the number is the threshold (duh). If ommited the default threshold is 10 points. E.g.
@@14@@

means 14 points threshold

Rules has format:

Points /Pattern that we should have/ , /Inhibitor1/ , /Inhibitor2/ , /Inhibitor3/ ...

The Points is the number of points for the rule. If omitted than by default every rule costs 10 points. Do not forget that if the lead is matched points are doubled. Pattern that we should have is a Regular expression as in Perl that should be matched in the text of an article so to rule fired. The inhibitors are the patterns that "inhibit" the rule. Making it inactive even if the Pattern that we should have is matched. E.g when creating the rules for the Russia related articles I want to include Saint Petersburg - the second largest city in Russia, On the other hand many American articles mention Saint-Petersburg, Florida and other American cities. Thus, I could want to decrease the value of the rule and inhibit it completely if Florida mentioned:

7 /Petersburg/ , /Florida/

In general the names of a country or of its capital are often mentioned in unrelated articles (e.g. somebody travelled there, etc.) But lead rarely mentions unrelated articles, thus usually we would want to have the name of the country to be below the threshold but above the half-threshold.

Categories are usually friends of the bot (if only all the new article writers used them!) so they deserve the cost above the threshold.

Note that \W (uppercase only) is needed to mark a word boundary. Without this by default a rule can match any part of a word.

The following symbols

{}[]()^$.|*+?\

must be preceded by \ to be taken literally. Otherwise they have special functions: * - wildcard, x? - optionality of the preceding symbol or bracketed string, (xy) - scope marking (e.g. for the purpose of | or ?), (x|y) or [xy] - alternatives, etc.

For other inspirations look in the rules for similar newsfeeds.

There are a few magical words in the rule file:

  • $USER substitutes into the user name
  • $SIZE>value / $SIZE < value : if the pattern true then it is matched.

[edit] Step 4 - inform the bot about the new job

Put into the bottom User:AlexNewArtBot/Master a new line with the name of the newsfeed (the same as the name of the rules file after the /).

If the bot is suppose to feed new articles into a board, we usually do not want the bot to post the articles already published on the board. We give the bot the name of such a board by putting =>Board name after the newsfeed name

The next time the bot works it would work on your feed.