Wikipedia talk:What is an article?

From Wikipedia, the free encyclopedia

Contents

[edit] Moved from Talk:Main Page

OK now that the new wording is in place on the Main Page that states; "Anyone, including you, can edit any article right now, without even having to log in.", I think it is now necessary for us to write down our definition of just what a Wikipedia article is (some people may be confused with the "edit any article" statement when the main page and several policy pages are clearly read-only by non-admins). Here is a rough sketch of what I feel should be in such a definition;

There are many pages in Wikipedia, far more than we actually consider to be articles. For calculating our site statistics and for regular conversation we have a more or less specific definition of what constitutes an article.
A Wikipedia article can be defined as a page in the database that either has encyclopedic or almanac-like information on it ("almanac-like" being; lists, timelines or charts).
This does not include any pages in any specified namespace such as:
All these specified namespaces also have a yellowish background color to distinguish them from pages in the article namespace which have white backgrounds.
However there are still some non-articles in the article namespace; most notably:
  • the Main Page;
  • hundreds of very short "stub" pages that cannot be considered real articles yet (a rough measure for purposes of automatic detection is that stubs do not contain any commas)
  • thousands of "redirect" pages which are used to re-route traffic going to one page to another page.
It should be noted that our naming conventions only cover what articles should be named. Therefore it is perfectly fine to use whatever capitalization, pluralization or transliteration you like for your user page and even pages in the other namespaces (except special, which can only be created by developers).
The Main Page and a few of the most important policy pages in the Wikipedia: namespace (such as Wikipedia:Policies and guidelines) are protected, because experience has shown that the main page is a major target for vandals and Wikipedia Policy has to be agreeded to by consensus before being changed. But every article and the vast majority of non-article pages can be edited by any user, including you right now!

Please go ahead and edit the above or leave suggestions below. As soon statement is agreed to I plan on placing it in Wikipedia:What is a Wikipedia article? or something similar and then sublink that to the word "article" in the intro message on the front page. --mav

Great, and I've made what I hope are improvements. (I also indented it for readability, but of course the final version won't have that.) I suggest putting it in Wikipedia:What is an article; the lack of a question mark or a repetition of "Wikipedia" will make linking easier. I've hedged a bit on the precise definition of "stub", because my impression is that the comma guideline is just an approximation for purposes of automatic detection (as I wrote), but only human judgement can tell if something is really a stub. However unlikely, a long and substantive article might consist entirely of short declarative sentences and no commas; now that I think of it, we probably have some list pages that fall into this category. OTOH, a page whose only content is a repetition of the title (the ultimate stub) will still have a comma in it if the title does. — Toby 21:50 Aug 12, 2002 (PDT)

Great! I really like your improvements, but I did tweak the reason why certain pages are protected; No Admin has to ask permission to make minor changes and copyedits to any protected page. Any major reorg or refactoring should be announced (a major reorg of the Main Page would need to be agreeded to before hand though). The only time permission and/or consensus is needed is if there will be any change in the meaning of any policy page. --mav

End moved talk


Methinks it would be useful to briefly explain what the namespaces are for, in stead of only mentioning them with an example. Or will it get too lengthy then? Jeronimo

Good idea - although that should be a very brief explaination since namespaces have their own FAQ page. --mav

Is is true that disambiguation pages are not automatically detected and thus are counted in the total number of articles given on the Main Page? — Toby 01:14 Aug 16, 2002 (PDT)

Last time I checked, yes. This is yet another feature request I have in my head -- trouble is that so many of the requests are already from me. Fixing this souldn't be hard though. --mav

OK, somebody has to say this: The fact that we are patting ourselves on the back for intentionally undercounting our articles is just plain silly. I just now went and looked under "short pages" at all 28 pages with exactly 100 bytes, and 13 of them contained a comma. Not a single one of them deserves to be called an article, but almost half are counted. Next I looked at all 33 pages with exactly 200 bytes, and 27 of those contained a comma. A few of them (not eighty percent!) might be considered articles under an extremely lenient definition of article, but does anyone outside of Wikipedia consider a single, brief paragraph to be an article? Are ANY of Brittanica's articles under 500 bytes?

I estimate our median article size as 1000 bytes, because that's the size of our 18943rd longest page according to long pages. (18943 would be the median of 37886 total articles.) To my mind, a conservative count of articles would place an 1000-byte minimum, rather than a 1000-byte median, which would trim our total article count in half. But no matter how we count articles, let us at least prominently post the median size of the articles which are included in the count. And please, please don't call the count "unimpeachable". (For refernence, my little tirade (including this sentence) is 1367 bytes long, i.e. rather longer than our median article.)

--Fritzlein 02:55 Aug 17, 2002 (PDT)

Print encyclopedia's have many very short articles that are little more than definitions. I do agree that our automatic article count is not as conservative as it should be though. It should, for example, search for and exclude from the count any page that has the word "disambiguation" in the text or title and do some other things to exclude pages that are probably better described as definitions of encyclopedia topics and not encyclopedia articles (that is, there needs to be better stub detection). --mav


This FAQ looks good to me now. Shall we link it from Wikipedia:FAQ and the Main Page now? --mav

(Of course, mav changes it after saying this.) Yeah, I think that it's fine, although it'd be nice to write the pages that it links to too. That shouldn't stop us from linking to it, however. — Toby 15:30 Aug 21, 2002 (PDT)

Coolness - now all we need is a small entry in the actual FAQ about this with a brief answer. I will go ahead link it from the Main Page. --mav

Is it just me or has the number of articles dropped? Lir 23:36 Nov 14, 2002 (UTC)

No. It has been at 90679 since Monday. --mav

I prefer the byte-based distinction of stub/article. I'm working on Japanese Wiki, and Japanese writing can go on in a great length without any commas, partly because we have a similar but different Japanese sign for that: "A"

I imagine this situation is not unique to Japanese language.

Tomos 00:35 Feb 3, 2003 (UTC)

[edit] ZONGULDAK regional info @ history in English

You can use text on singlix web site, zonguldak web pages...

I am the owner of these web pages...

INFO page text taken from www.voyagerbook.com but, recently i have updated it at singlix web site.

HISTORY is written by me, directly...

There are not any copyrights on these pages... (www.voyagerbook.com Zonguldak page is not orginal source of information about Zonguldak. All text is compilation/adaption from official turkish state/tourism documents.)

USE THE TEXT of Following web pages...

http://www.singlix.com/zonguldak/index.html http://www.singlix.com/zonguldak/info.html http://www.singlix.com/zonguldak/history.html

Regards... Erdogan Tan


[edit] Problem with Article Count / Article Description

The definition on an article, at Wikipedia:What is an article is reached by clicking onto the word article by the count of articles on the main page. It launches straight into an explanation of what an article is & is not.

However it is not until you get lower down the page that you discover that the published count is not the same as the definition given above: irrespective of the discussion over number of bytes, commas and links, the count does count stubs which are clearly marked in the text as stubs by msg:stub.

I'd earnestly suggest we should seek a means of getting a count which excludes stubs, even if the expedite way to do this is to parse a page as it is saved for the presence of the msg:stub (or a msg:stublist) which sets a flag that can be used by the counting software. Discuss. --Tagishsimon 00:08, 20 Apr 2004 (UTC)

[edit] Dictionary versus Encyclopedia

A lot of entries really do not belong to an encyclopedia but to a dictionary. Is the Wiktionary going to get combined with this project at some stage? Why aren't they already one thing? BozMo(talk)

No, it will likely never happen. In the beginnings of wikipedia, there were disagreements about whether we should include dictionary definitions or not. It was decided that Wikipedia is not a dictionary (see What Wikipedia is not), and wiktionary was created. ✏ Sverdrup 16:10, 26 Jun 2004 (UTC)

[edit] Templates and/or articles

There currently is a discussion going on at Miscellany for deletion about the alleged violation by the WikiProject Stub sorting of wikipedia's ban on ownership of articles. Key issue is whether WP:OWN applies to templates. From the text of WP:OWN, it seems that it only applies to articles. According to the list in this page, "an 'article' does not include any pages in any of the specified namespaces that are used for particular purposes, such as: * the Wikipedia namespace ... * the talk namespaces ... * the special namespace ... * the user namespace ... * the image namespace ... * the MediaWiki namespace." According to the definitions of wikipedia, are templates articles or not? Aecis praatpaal 23:50, 23 December 2005 (UTC)

[edit] count of articles

I've downloaded 20051213_pages_current.xml.bz2 from the downloads.wikipedia.org site. unpacking it gets me an XML file of what I thought would be all the current articles (where "current' is actually some moment on December 12th, 2005.)

Pawing through the file, I can find about 805,196 items. These items include what I would call "articles", but also include user pages, talk pages, and images. Maybe there's other stuff in there, too -- I haven't done that much aggregation just yet.

Do I have the wrong file if I expect all 800-thousand articles that the front page claims? Or is the definition here wrong, and the advertised count of articles really does include the Talk:, User:, and Image: namespaces? And some other stuff?

-- Mikeblas 06:58, 2 January 2006 (UTC)


I'm curious -- is there any way to get wikipedia to generate a page that compiles all original content for a particular page. There are some particular POV pages that would be nice to see the entirety of, especially if you could color card various spots based on their retraction times,etc .. Any ideas??

[edit] New namespaces?

Why can't Wikipedia simply move disambig and redirect to Disambiguation: and Redirect: namespaces? This would reduce clutter and make the site a little more streamlined. MOD 05:43, 9 April 2006 (UTC)


[edit] Structure for describing knowledge to give understanding

It is surprising to find that for a structure composed of articles there is so little discussion of what an article is, or ought to be :)
It seems to me that the aspirational goal of Wikipedia, as with all collections of discrete subjects of knowledge, is to describe these subjects in a way that enlightens the reader.
What is missing is structure.
Like myself, many people go and start articles, only to see them deleted or changed over time because someone doesn't agree. This means that at least two individuals have differing POV of the same subject.
This is not a new problem. ;)

I would suggest that Wikipedia may profit from creating an article template that includes:
Subject definition
Defining the subject of an article as a parent of an offspring item of knowledge

Most knowledge is derived, so the hierarchy needs to be included in the article to inform the reader where they are within the knowledge map

Subject scope of relationships within the field of knowledge hierarchy.
Introduction to the subject that includes basic information only
Sophisticated examination of major statements in the basic section
Expert statements on the subject stating:

  • least strict sourced [[interpretation] of the statement/s
  • most strict sourced interpretation of the statement/s
  • agreed middle ground sourced interpretation of the statement

Conclusion or summary that assists in the synthesis of the data presented
The usual administrative section of references, sources and links.
How does this sound?
I note that neither sophisticated nor field of knowledge have articles in Wikipedia! Yes, I know, its not a dictionary.... --Mrg3105 (talk) 03:42, 18 December 2007 (UTC)

Encyclopedic article has no article in Wikipedia. What better way to exemplify what an article should be than a well-formed meta-article?
The suggested template above would be a good start, although I personally don't see a need for a separate conclusion where a sufficient introduction already exists. A problem I see with many articles is that they are not adequately prepared for the uninformed reader—they consist exclusively of esoteric material (recent fictional works, science). Nahum Reduta (talk) 18:24, 19 January 2008 (UTC)