Click to See Complete Forum and Search --> : site search utility


llama
01-27-2005, 05:04 AM
Hi,

I'm looking for a PHP search utility to put on one of my sites. It needs to be able to spider pdfs and word docs as well as the usual site pages.

Has anyone got any recommendations?

Ideally the front-end part would be XHTML, or at least adaptable so I could make it XHTML.

Thanks a lot,

Stephen

ShrineDesigns
01-27-2005, 06:05 AM
the pdf functions (http://www.php.net/manual/en/ref.pdf.php) in php only writes pdf files. i have never tried to create a script to do this, looking at the data in a pdf file...
what it looks like is the obj starts and endobj ends a page, image, and/or set of instructions, and the instructions in between << and >> set how the object is rendered or its function, stream and endstream must be the actual data of the object, i think it is gunzip data, i thought i saw a few gunzip characters (indentification marks), startxref looks like the structure for the pdf bookmarks
...pretty complex for something so simple lol

doc files would be pretty easy, i know people have devised way of stripping the garbage of them

llama
01-27-2005, 06:32 AM
I wasn't intending on writing something from scratch. I would use something like iSearch, zoom, phpdig etc. Just wondered if anyone had used any of these, or had other recommendations?

I would happily consider paying for a commercial option if the functionality merits it.

cheers

Stephen

ShrineDesigns
01-27-2005, 04:40 PM
but that takes all the fun out of it lol

george1234
01-27-2005, 10:53 PM
I've used Zoom, and it does exactly what you want. You'll have to buy either the Standard or Professional versions for PDF and DOC support though. But its definitely the most polished solution - works out of the box, heaps of features.

The PHP script is clean and commented, so you can make hacks to it if need be. It also outputs in XHTML 1.0, AFAIK.

Check their website: http://www.wrensoft.com/zoom/

George.