Click to See Complete Forum and Search --> : converting a website to text online?


your_twin
02-19-2007, 07:23 AM
Is there a way to convert a website into text? Into a string variable that can be searched for specific words and characters?

Charles
02-19-2007, 07:39 AM
It's easily done but I wouldn't convert the HTML to text so much as I would run the thing through some HTML Parser. What are you up to exactly and in full?

your_twin
02-19-2007, 08:28 AM
I am trying to make an internal site search engine that will search through the text of all the different pages that I create and to extract the first few characters of pertinent articles and display them, with links to the actual article or page, in a table.

Stephen Philbin
02-19-2007, 09:30 AM
You don't have your site content stored in a database? It'd be much easier to do it the usual database way and let it do its job.

ray326
02-19-2007, 03:56 PM
I'd recommend you look at Lucene.