DataparkSearch
From Wikipedia, the free encyclopedia
DataparkSearch | |
Image:Dpsearch2.gif | |
Developer: | Maxim Zakharov |
---|---|
Latest release: | 4.43 / October 30, 2006 |
OS: | FreeBSD, GNU/Linux, Sun Solaris |
Use: | search engine open source |
License: | GNU General Public License |
Website: | www.dataparksearch.org |
DataparkSearch Engine is an open source search engine written in C. It is distributed under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system.
DataparkSearch can index text/plain, text/html and text/xml data natively, and many others datatypes using external parsers.
This engine is ready to index multilingual websites: a wide range of character sets and languages are supported and can be detected automatically, it uses content negotiation technology to fetch versions of same page in different languages, it can perform accent insensitive search and segment phrases in Chinese, Japanese, Korean and Thai. It's possible to use synonyms, acronyms and ispell-based fuzzing to extend search results.
The search engine uses home-grown neural network based technology to assist ranking of webpages. This method is called The Neo Popularity Rank. Results can be sorted by relevance, popularity rank, last modified time and by importance (a multiplication of relevance and popularity rank).
DataparkSearch Engine can construct automaticaly the summary for every page indexed and use this summary to adjust ranking of webpages (so called the Summary Extraction Aalgorithm, SEA).
[edit] External links
- Official page of the project
- A demo site using DataparkSearch, you may test this searching in Chinese, Japanese, Korean and Thai languages. Also some pages in other languages were indexed.
- Newslookup.com, a news search site using DataparkSearch.
- Sravniti.com, a goods search site using DataparkSearch.