DataparkSearch

From Wikipedia, the free encyclopedia

DataparkSearch
Developed by Maxim Zakharov
Initial release November 27, 2003
Latest release 4.49 / February 12, 2008
Written in C
OS FreeBSD, Linux, Solaris
Genre search engine open source
License GNU General Public License
Website http://www.dataparksearch.org/

DataparkSearch Engine is an open source search engine written in C. It is distributed under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system.

[edit] Key features

  • Support for http, https, ftp, nntp and news URL schemes.
  • htdb virtual URL scheme for indexing SQL databases.
  • Built-in support for text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif MIME types.
  • Support for external parsers for other datatypes.
  • Ability to index multilingual websites using content negotiation.
  • Searching all of the word forms using ispell affixes and dictionaries.
  • Fuzzy searching based on acronyms and abbreviations.
  • Stop-words, synonyms and acronyms lists.
  • Boolean query language support.
  • A subset of The Verity Query Language (VQL) is supported.
  • The Neo Popularity Rank based on neural network model.
  • Results sorting by relevance, popularity rank, last modified time and by importance (a multiplication of relevance and popularity rank).
  • Support and automatic detection for wide range of character sets and languages.
  • Accent insensitive search.
  • Phrase segmenting for Chinese, Japanese, Korean and Thai languages.
  • mod_dpsearch - search module for Apache web server.
  • Support for Internationalized Domain Names.
  • The Summary Extraction Algorithm.

[edit] External links