Instruction for preserving your parse_doc.pl file

  1. download xlHtml from www.xlHtml.org and install it
    (in this example it has been installed to /usr/local, otherwise the path needs to be updated in the file in step 3)
  2. make sure the following two lines are included in mime.types
    application/msexcel   xls
    application/vnd.ms-excel    xls

  3. copy the the new parse_xls.pl  to /usr/doc/packages/htdig/contrib
    (or whereever your parse_doc.pl is currently installed)
    (in this file the path to xlHtml might need to be updated)
  4. edit your htdig configuration file (usually /opt/www/htdig/conf/htdig.conf)
    and add the following to the external_parsers section
    application/msexcel /usr/doc/packages/htdig/contrib/parse_xls.pl \
    application/vnd.ms-excel /usr/doc/packages/htdig/contrib/parse_xls.pl
    (remember to check the path to parse_xls.pl from step 3)
  5. start rundig and then search with htDig for a word included in the excel file
Troubleshooting

Standard Instructions