DataparkSearch
From Wikipedia, the free encyclopedia
DataparkSearch | |
---|---|
Developed by | Maxim Zakharov |
Initial release | November 27, 2003 |
Latest release | 4.49 / February 12, 2008 |
Written in | C |
OS | FreeBSD, Linux, Solaris |
Genre | search engine open source |
License | GNU General Public License |
Website | http://www.dataparksearch.org/ |
DataparkSearch Engine is an open source search engine written in C. It is distributed under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system.
[edit] Key features
To meet Wikipedia's quality standards, this article or section may require cleanup because it is in a list format that may be better presented using prose. You can help by converting this section to prose, if appropriate. Editing help is available. (January 2008) |
- Support for http, https, ftp, nntp and news URL schemes.
- htdb virtual URL scheme for indexing SQL databases.
- Built-in support for text/html, text/xml, text/plain, audio/mpeg (MP3) and image/gif MIME types.
- Support for external parsers for other datatypes.
- Ability to index multilingual websites using content negotiation.
- Searching all of the word forms using ispell affixes and dictionaries.
- Fuzzy searching based on acronyms and abbreviations.
- Stop-words, synonyms and acronyms lists.
- Boolean query language support.
- A subset of The Verity Query Language (VQL) is supported.
- The Neo Popularity Rank based on neural network model.
- Results sorting by relevance, popularity rank, last modified time and by importance (a multiplication of relevance and popularity rank).
- Support and automatic detection for wide range of character sets and languages.
- Accent insensitive search.
- Phrase segmenting for Chinese, Japanese, Korean and Thai languages.
- mod_dpsearch - search module for Apache web server.
- Support for Internationalized Domain Names.
- The Summary Extraction Algorithm.