NutchSearchEngine
Last edit May 7, 2005
Project Name:
Nutch
Official Web Page:
http://nutch.org/
current wiki:
http://wiki.apache.org/nutch/
Old Wiki:
http://www.nutch.org/cgi-bin/twiki/view/Main/Nutch
Nutch is an
OpenSource
search engine application. It consists of a fetcher, indexer, parser and searcher - each of these functional area are provided as plugins.
Fetchers:
HTML
FTP
file
Indexer:
Basic
Parsers:
HTML
MS Word
text
PDF
MP3
Searcher:
Basic
Site