I have just started using a great application (sphider) that indexes websites and provides a search facility (basically a do it yourself google).
This has done a great job of providing a custom search for a site that I am involved with.
Link to sphider site:
http://www.sphider.eu/docs.phpIt has options to allow the indexing of pdfs, docs, ppts and xls files but it requires various binaries to be available:
pdftotext
catdoc
catppt
xlstocsv
Are any of these already installed or some possible alternatives that may work?
I am most interested in the pdfs being indexed and possibly .docs
Thanks