User talk:HBC Archive Indexerbot/to do

From Wikipedia, the free encyclopedia

  • Try to figure out how to split up huge indices to avoid timeouts
    • Multiple pages, transcluded onto one?
    • Probably need to have multiple actual pages - otherwise it's just unmanagable. Will probably need new target syntax. split=alpha, segments=A-C,D-G,H-L,etc?
  • Address scalability of cache (filename hashing or something)
  • Make cache more efficient by caching compiled objects (%index hash bits)
  • Allow indexing of talk page if opt-in is on a different page (like in a transcluded header)
  • Handle month/year archives (maybe an option to follow links on the main page to find the archives?)
    • Partially possible by specifying individual single-page masks