YaCy

YaCy
Original author(s) Michael Christen
Developer(s) YaCy Community
Stable release 1.8 / 16 September 2014
Operating system Cross-platform
Type Overlay network, Search engine
License GPLv2+
Website www.yacy.net/en

YaCy (pronounced "ya see") is a free distributed search engine, built on principles of peer-to-peer (P2P) networks.[1][2] Its core is a computer program written in Java distributed on several hundred computers, as of September 2006, so-called YaCy-peers. Each YaCy-peer independently crawls through the Internet, analyzes and indexes found web pages, and stores indexing results in a common database (so called index) which is shared with other YaCy-peers using principles of P2P networks.

Compared to semi-distributed search engines, the YaCy-network has a decentralised architecture. All YaCy-peers are equal and no central server exists. It can be run either in a crawling mode or as a local proxy server, indexing web pages visited by the person running YaCy on his or her computer. (Several mechanisms are provided to protect the user's privacy.)

Access to the search functions is made by a locally running web server which provides a search box to enter search terms, and returns search results in a similar format to other popular search engines.

System components

YaCy search engine is based on four elements:[3]

Crawler
A search robot which traverses from web page to web page and analyzes their content.
Indexer
Creates a Reverse Word Index (RWI) i.e. each word from the RWI has its list of relevant URLs and Ranking information. Words are saved in form of word hashes.
Search and Administration interface
Made as a web interface provided by a local HTTP servlet with servlet engine.
Data Storage
Used to store the Reverse Word Index Database utilizing a Distributed Hash Table.

Advantages

PDF slides from ApacheCon 2012: A Web Search Appliance with Solr and YaCy

Disadvantages

See also

References

  1. "YaCy takes on Google with open source search engine". The Register. 2011-11-29. Retrieved 2012-04-16.
  2. "YaCy: It's About Freedom, Not Beating Google". PC World. 2011-12-03. Retrieved 2012-04-16.
  3. "YaCy Technology Architecture". YaCy.net. Retrieved 2012-02-14.
  4. "Search Engine Technology". Retrieved 28 January 2014.
  5. "YaCy crawler cannot parse URI's with IPv6 address in it inside square brackets. -". YaCy-Bugtracker. MantisBT Team. Retrieved 7 April 2014.
Wikimedia Commons has media related to YaCy.

External links