Click to See Complete Forum and Search --> : Pattern Matching Help!


cusimar9
06-15-2005, 11:56 AM
I'm trying to write a script to find all url's on a web page

I'm using the MSXML2.ServerXMLHTTP object to find the text by pattern matching

This works fine for most cases:

objRegExp.Pattern = "href=(""|')http://(.*?)(""|')"

But it doesn't find web pages in frames, where its just the name of a html file ie homepage.html

I need to say that the above pattern needs to be matched OR its a .html file.

Anyone have any idea?

phpnovice
06-15-2005, 12:13 PM
It doesn't find the frame reference because you're only matching on href=.
Frame references are src=.

cusimar9
06-27-2005, 09:36 AM
Thanks!

phpnovice
06-27-2005, 01:08 PM
Cheers.