Open Source IndexersInfomotions Musings; May 29, 2001 by Eric Lease
Describes the history and features of eight open-source search engines, freeWAIS-sf
(aging code and hard to install, but good for searching email and public domain
etexts); Harvest (powerful gathering features for
frequently-changing data stores, good with structured documents); ht://Dig
(tricky to configure, no phrase searching, automatic stemming and match word
highlighting); Isearch (weak documentation and
support, easy to install, dated interface, Z39.50 support); MPS
Information Server (zippy indexing of both text and structured data, Z39.50
support, Perl API, limited documentation); SWISH-E
(simple to install engine, CGIs in Perl and PHP still beta, good for HTML
pages, recognizes new META tags, sorts results by field; WebGlimpse
(easy to install and configure, requires commercial version for customized
output); Yaz/Zebra (mainly Z39.50, no Perl API, mainl
y a toolkit to index and respond to distributed client queries). Article also
points out that chaotic information is less than helpful and encourages organization,
structure and vocabulary control.
-- Combining Browsing and SearchingUsenix presentation, January 10,
Contents of a presentation at the Usenix Technical Conference in early 1997,
points out some major issues of context and location in searching that have
not really been addressed by other tools.
Search Tools Consulting's principal analyst, Avi Rappoport, may be available to help you with selection, indexing and search log analysis, as well as relevance evaluation, user experience testing, and functional search engine work. Please contact us for more information.