User:HBC Archive Indexerbot

From Wikipedia, the free encyclopedia

Contents

This bot is approved. HighInBC (Need help? Ask me) 00:48, 9 December 2006 (UTC)

[edit] Introduction

I am an automated bot coded and operated by HighInBC and Krellis. I can read through piles of talk page archives and produce a handy index of topics with links. This will allow you to browse or search the entire archive for keywords or recurring discussions. Anyone can request that I service any page on the English Wikipedia, simply follow the easy instructions below!

[edit] Instructions

To use this service you need archives, a destination page, and a page for the request.

  1. First create a destination page (the one you want the index to be written to) and place the following text on this page: <!-- HBC Archive Indexerbot can blank this -->
    This will not be visible, as it is contained in HTML comment tags; this gives the bot permission to write to the page.
  2. On the root page of the archives you want indexed, usually the talk page itself, place the following code: {{User:HBC Archive Indexerbot/OptIn|target=<target>|mask=<mask>|leading_zeros=<zeros>|indexhere=<indexhere>|template=<template location>}}
    1. <target> is your destination page (created in step 1)
    2. <mask> is used to describe how to find your archives, see below for details. You may specify multiple masks by specifying the mask=<mask> parameter multiple times. Pages matching all of the masks will be combined together for the final index.
    3. <zeros> is the number of leading 0's in your archive title, "Archive 01" would be 1 leading zero, "Archive 1" is 0 leading zeros.
    4. <indexhere> should be yes or no. If yes, it will index the root talk page the request is on as well as the archive pages.
    5. <template location> is the name of a wiki page containing an archive index template. The template syntax is described below. This parameter is optional, and if left out or left blank, the default template will be applied.
    Note: Relative paths may be specified, and will be assumed to be relative to the page on which the OptIn code is placed. If you are unsure, full paths are always best.
  3. Wait; the next time the bot runs it should write your report.

[edit] Mask

Your mask is simply the path of your archives with the numbers replaced with <#>.

Example:

User talk:HighInBC/Archive <#> would cover User talk:HighInBC/Archive 1 through the last archive.

If your archive has leading zeros, then set the leading_zeros parameter to the amount of leading zeros you have.

If your archives are named using dates rather than numbers, or by topic, you can specify multiple mask parameters in the template, each of which will be read in turn. If there's no incrementing number, simply do not include the <#> in the mask, and it will be used as an individual page name instead.

[edit] Template

Templates are defined as a series of HTML comments followed by wikitext for each part of the page. See the default template for an example template definition. Each section is introduced with an HTML comment consisting of only the section name. All content from the next line until the next section heading is considered to be part of the section. Blank lines inside sections are ignored. The possible sections of a template are:

Section Name Description
LEAD Introductory text, to be placed at the top of the archive index page.
HEADER A header for the index, placed after the bot's description of how the index was created, but before any rows of data are output. Typically used to begin a table, create column headings, etc.
ROW The format of an actual row of data. There are a number of variables available in the row:
  • %%topic%% - The topic from the archives, as pulled from the archive heading.
  • %%replies%% - The estimated number of replies contained within that section of the archive.
  • %%link%% - A link directly to the appropriate section in the appropriate archive for this topic.
  • %%first%% - Timestamp of first comment in the topic (YYYY-MM-DD HH:MM:SS format). Available in epoch seconds as %%firstepoch%%.
  • %%last%% - Timestamp of last comment in the topic (YYYY-MM-DD HH:MM:SS format). Available in epoch seconds as %%lastepoch%%.
  • %%duration%% - Duration of the topic (difference between first and last, in normalized text format). Available in raw seconds as %%durationsecs%%.

This section is required (and this is the only section required in a template, all others are optional).

ALTROW The format of an alternating row. If provided, this section will be used instead of the "Row" section for every other row of data, starting with the second. It is generally used to alternate row colors within a table. The same variables described in "Row" above are available.
FOOTER A footer, output immediately after the last row of data. Generally used to close a table, if one was started in the header.
TAIL A final section below the footer, for any comments, bottom-of-page navigation, categories, etc.
END Indicates that the template definition is over, no more parsing will take place.

[edit] Schedule

The bot is currently scheduled to run twice per day, at 11:23 and 23:23 UTC. It may run at additional times beyond those, and may miss some scheduled runs while it is under continued development/bugfixing.

[edit] Caveats

  • Very long sets of archives such as Wikipedia:Administrators' noticeboard/IncidentArchive<#> will not archive due to timeout errors and maximum page size.
  • When using sortable table for results, the %%duration%% will sort alphabetically, which is probably unfavourable behavior. To work around this, use invisible %%durationsec%% variable like this: <span style="display: none;">%%durationsecs%%</span> %%duration%%

[edit] Links