Template:Discussion/datadump

From Wikipedia, the free encyclopedia

[edit] I need a list of all pages in the Wikipedia namespace - How is this done?

The Help Project is considering taking on the clean-up of Wikpedia namespace. We'd like to build a site-map, and in order to do this, we need a list of all pages in Wikipedia namespace...

I looked at Special:Allpages, but that only lists the pages one screen at a time, and without brackets. Is there a way to get a list of all of the pages in one step, and with brackets? I'd settle for getting a list in one step, with or without brackets. --Go for it! 04:42, 4 February 2006 (UTC)

Did you try Special:Prefixindex? And didn't we have already not one, but two sitemaps of the Wikipedia namespace, both out of date? --cesarb 04:49, 4 February 2006 (UTC)
And won't this map be out-of-date even before it is started? :-) Seriously, I think a site-wide map of Wikipedia would be a great idea. I've been wanting something like that for a long time. The closest I've got so far is using Category:Wikipedia to drill up and down through categories, but that relies on everything being categorised. Oops. Looks like it's been reorganised. I meant Category:Wikipedia_administration, though that looks different. I was probably thinking of another category... Carcharoth 10:55, 4 February 2006 (UTC)
It's not necessary to use Special:Prefixindex. Special:Allpages is designed for such purposes, and has the list available. Superm401 - Talk 04:15, 5 February 2006 (UTC)
Could you use a the database dumps? A little out of date but has the info you need. --Salix alba (talk) 15:28, 5 February 2006 (UTC)
When the next dump is complete I can make the list, it will probably be in a few days. Martin 23:58, 6 February 2006 (UTC)

The site map is so that we can see what we've got to work with. The goal, hopefully an achievable one, is the clean-up of the Wikipedia namespace. Currently it is a morass of misnamed, misplaced, disorganized, and repititious pages. --Go for it!

Is there a way to get Special:Allpages to list all the pages in a specific namespace all on one page? I would like a continuous list, without the need to cut and paste the thing one page at a time. Some of the answers above seem to imply that this is possible, but I went there and cannot figure out how to do it. Please help. --Go for it!

I have the January 25 db dump loaded on my computer, so should be able to help. I'm running the query now, but it's taking a while. Must be a really long list of pages. --Aude (talk | contribs) 00:12, 8 February 2006 (UTC)
The number of pages in the Wikipedia namespace is really huge — 102,572. Though, this includes every WP:RFC, WP:RFA, WP:POTD, WP:VFD, etc. Thus, we can narrow the number of pages in the list, and sort it by popularity (hits). --Aude (talk | contribs) 00:33, 8 February 2006 (UTC)
I'm going through the list now, weeding out the duplicate pages (e.g. WP:VFD, WP:POTD, ...), as well as redirect pages, and formatting the list. Right now, I have it down to 14,000 pages — still too many for any site map. --Aude (talk | contribs) 04:32, 8 February 2006 (UTC)