Click to See Complete Forum and Search --> : web dev / search engine question


GleepGlop22
03-17-2004, 12:35 PM
im doing a job where i have to train an AI by giving it URLs on certain subjects so it can learn to recognize them. so i have to find URLs that the AI can learn from, the more the better.
Is there a way to recognize what websites have a lot of internal links with distinct URLs (eg. bla.com, bla.com/bla)... is there a way to use a search engine for this? i know there are a lot of search engine commands that i dont know, so...
thanks!

PeOfEo
03-17-2004, 12:43 PM
when you add the site you could just search the meta keywords and description for the phrase. If you are scripting it yourself I suggest using some regexp w/ a binary search maybe. You can also search for that part of the url btw, and find all of the sites that have the same .com. Just treat it like any other string.