Wikipedia talk:What is an article?
From Wikipedia, the free encyclopedia
Contents |
[edit] Moved from Talk:Main Page
OK now that the new wording is in place on the Main Page that states; "Anyone, including you, can edit any article right now, without even having to log in.", I think it is now necessary for us to write down our definition of just what a Wikipedia article is (some people may be confused with the "edit any article" statement when the main page and several policy pages are clearly read-only by non-admins). Here is a rough sketch of what I feel should be in such a definition;
- There are many pages in Wikipedia, far more than we actually consider to be articles. For calculating our site statistics and for regular conversation we have a more or less specific definition of what constitutes an article.
- A Wikipedia article can be defined as a page in the database that either has encyclopedic or almanac-like information on it ("almanac-like" being; lists, timelines or charts).
- This does not include any pages in any specified namespace such as:
- the Wikipedia namespace (example, Wikipedia:Statistics);
- any of the talk namespaces (examples, Talk:Main Page or Wikipedia talk:Size of Wikipedia);
- special pages (example, Special:Recent Changes);
- user pages (example, User:Larry Sanger).
- All these specified namespaces also have a yellowish background color to distinguish them from pages in the article namespace which have white backgrounds.
- However there are still some non-articles in the article namespace; most notably:
- It should be noted that our naming conventions only cover what articles should be named. Therefore it is perfectly fine to use whatever capitalization, pluralization or transliteration you like for your user page and even pages in the other namespaces (except special, which can only be created by developers).
- The Main Page and a few of the most important policy pages in the Wikipedia: namespace (such as Wikipedia:Policies and guidelines) are protected, because experience has shown that the main page is a major target for vandals and Wikipedia Policy has to be agreeded to by consensus before being changed. But every article and the vast majority of non-article pages can be edited by any user, including you right now!
Please go ahead and edit the above or leave suggestions below. As soon statement is agreed to I plan on placing it in Wikipedia:What is a Wikipedia article? or something similar and then sublink that to the word "article" in the intro message on the front page. --mav
Great, and I've made what I hope are improvements. (I also indented it for readability, but of course the final version won't have that.) I suggest putting it in Wikipedia:What is an article; the lack of a question mark or a repetition of "Wikipedia" will make linking easier. I've hedged a bit on the precise definition of "stub", because my impression is that the comma guideline is just an approximation for purposes of automatic detection (as I wrote), but only human judgement can tell if something is really a stub. However unlikely, a long and substantive article might consist entirely of short declarative sentences and no commas; now that I think of it, we probably have some list pages that fall into this category. OTOH, a page whose only content is a repetition of the title (the ultimate stub) will still have a comma in it if the title does. — Toby 21:50 Aug 12, 2002 (PDT)
- Great! I really like your improvements, but I did tweak the reason why certain pages are protected; No Admin has to ask permission to make minor changes and copyedits to any protected page. Any major reorg or refactoring should be announced (a major reorg of the Main Page would need to be agreeded to before hand though). The only time permission and/or consensus is needed is if there will be any change in the meaning of any policy page. --mav
End moved talk
Methinks it would be useful to briefly explain what the namespaces are for, in stead of only mentioning them with an example. Or will it get too lengthy then? Jeronimo
- Good idea - although that should be a very brief explaination since namespaces have their own FAQ page. --mav
Is is true that disambiguation pages are not automatically detected and thus are counted in the total number of articles given on the Main Page? — Toby 01:14 Aug 16, 2002 (PDT)
- Last time I checked, yes. This is yet another feature request I have in my head -- trouble is that so many of the requests are already from me. Fixing this souldn't be hard though. --mav
OK, somebody has to say this: The fact that we are patting ourselves on the back for intentionally undercounting our articles is just plain silly. I just now went and looked under "short pages" at all 28 pages with exactly 100 bytes, and 13 of them contained a comma. Not a single one of them deserves to be called an article, but almost half are counted. Next I looked at all 33 pages with exactly 200 bytes, and 27 of those contained a comma. A few of them (not eighty percent!) might be considered articles under an extremely lenient definition of article, but does anyone outside of Wikipedia consider a single, brief paragraph to be an article? Are ANY of Brittanica's articles under 500 bytes?
I estimate our median article size as 1000 bytes, because that's the size of our 18943rd longest page according to long pages. (18943 would be the median of 37886 total articles.) To my mind, a conservative count of articles would place an 1000-byte minimum, rather than a 1000-byte median, which would trim our total article count in half. But no matter how we count articles, let us at least prominently post the median size of the articles which are included in the count. And please, please don't call the count "unimpeachable". (For refernence, my little tirade (including this sentence) is 1367 bytes long, i.e. rather longer than our median article.)
--Fritzlein 02:55 Aug 17, 2002 (PDT)
Print encyclopedia's have many very short articles that are little more than definitions. I do agree that our automatic article count is not as conservative as it should be though. It should, for example, search for and exclude from the count any page that has the word "disambiguation" in the text or title and do some other things to exclude pages that are probably better described as definitions of encyclopedia topics and not encyclopedia articles (that is, there needs to be better stub detection). --mav
This FAQ looks good to me now. Shall we link it from Wikipedia:FAQ and the Main Page now? --mav
(Of course, mav changes it after saying this.) Yeah, I think that it's fine, although it'd be nice to write the pages that it links to too. That shouldn't stop us from linking to it, however. — Toby 15:30 Aug 21, 2002 (PDT)
- Coolness - now all we need is a small entry in the actual FAQ about this with a brief answer. I will go ahead link it from the Main Page. --mav
Is it just me or has the number of articles dropped? Lir 23:36 Nov 14, 2002 (UTC)
- No. It has been at 90679 since Monday. --mav
I prefer the byte-based distinction of stub/article. I'm working on Japanese Wiki, and Japanese writing can go on in a great length without any commas, partly because we have a similar but different Japanese sign for that: "A"
I imagine this situation is not unique to Japanese language.
Tomos 00:35 Feb 3, 2003 (UTC)
[edit] ZONGULDAK regional info @ history in English
You can use text on singlix web site, zonguldak web pages...
I am the owner of these web pages...
INFO page text taken from www.voyagerbook.com but, recently i have updated it at singlix web site.
HISTORY is written by me, directly...
There are not any copyrights on these pages... (www.voyagerbook.com Zonguldak page is not orginal source of information about Zonguldak. All text is compilation/adaption from official turkish state/tourism documents.)
USE THE TEXT of Following web pages...
http://www.singlix.com/zonguldak/index.html http://www.singlix.com/zonguldak/info.html http://www.singlix.com/zonguldak/history.html
Regards... Erdogan Tan
[edit] Problem with Article Count / Article Description
The definition on an article, at Wikipedia:What is an article is reached by clicking onto the word article by the count of articles on the main page. It launches straight into an explanation of what an article is & is not.
However it is not until you get lower down the page that you discover that the published count is not the same as the definition given above: irrespective of the discussion over number of bytes, commas and links, the count does count stubs which are clearly marked in the text as stubs by msg:stub.
I'd earnestly suggest we should seek a means of getting a count which excludes stubs, even if the expedite way to do this is to parse a page as it is saved for the presence of the msg:stub (or a msg:stublist) which sets a flag that can be used by the counting software. Discuss. --Tagishsimon 00:08, 20 Apr 2004 (UTC)
[edit] Dictionary versus Encyclopedia
A lot of entries really do not belong to an encyclopedia but to a dictionary. Is the Wiktionary going to get combined with this project at some stage? Why aren't they already one thing? BozMo(talk)
- No, it will likely never happen. In the beginnings of wikipedia, there were disagreements about whether we should include dictionary definitions or not. It was decided that Wikipedia is not a dictionary (see What Wikipedia is not), and wiktionary was created. ✏ Sverdrup 16:10, 26 Jun 2004 (UTC)
[edit] Templates and/or articles
There currently is a discussion going on at Miscellany for deletion about the alleged violation by the WikiProject Stub sorting of wikipedia's ban on ownership of articles. Key issue is whether WP:OWN applies to templates. From the text of WP:OWN, it seems that it only applies to articles. According to the list in this page, "an 'article' does not include any pages in any of the specified namespaces that are used for particular purposes, such as: * the Wikipedia namespace ... * the talk namespaces ... * the special namespace ... * the user namespace ... * the image namespace ... * the MediaWiki namespace." According to the definitions of wikipedia, are templates articles or not? Aecis praatpaal 23:50, 23 December 2005 (UTC)
[edit] count of articles
I've downloaded 20051213_pages_current.xml.bz2
from the downloads.wikipedia.org site. unpacking it gets me an XML file of what I thought would be all the current articles (where "current' is actually some moment on December 12th, 2005.)
Pawing through the file, I can find about 805,196 items. These items include what I would call "articles", but also include user pages, talk pages, and images. Maybe there's other stuff in there, too -- I haven't done that much aggregation just yet.
Do I have the wrong file if I expect all 800-thousand articles that the front page claims? Or is the definition here wrong, and the advertised count of articles really does include the Talk:, User:, and Image: namespaces? And some other stuff?
-- Mikeblas 06:58, 2 January 2006 (UTC)
I'm curious -- is there any way to get wikipedia to generate a page that compiles all original content for a particular page. There are some particular POV pages that would be nice to see the entirety of, especially if you could color card various spots based on their retraction times,etc .. Any ideas??
[edit] New namespaces?
Why can't Wikipedia simply move disambig and redirect to Disambiguation: and Redirect: namespaces? This would reduce clutter and make the site a little more streamlined. MOD 05:43, 9 April 2006 (UTC)
[edit] random question
there is a clothing i think it's called "Wifekicker" or something like that, i need to see the article about it, sorry if thisisen't the right place for this question, but i found none other place but the discussion about articles, Well... goodbye. --201.230.228.151 08:40, 23 August 2006 (UTC)