cusimar9
06-15-2005, 11:56 AM
I'm trying to write a script to find all url's on a web page
I'm using the MSXML2.ServerXMLHTTP object to find the text by pattern matching
This works fine for most cases:
objRegExp.Pattern = "href=(""|')http://(.*?)(""|')"
But it doesn't find web pages in frames, where its just the name of a html file ie homepage.html
I need to say that the above pattern needs to be matched OR its a .html file.
Anyone have any idea?
I'm using the MSXML2.ServerXMLHTTP object to find the text by pattern matching
This works fine for most cases:
objRegExp.Pattern = "href=(""|')http://(.*?)(""|')"
But it doesn't find web pages in frames, where its just the name of a html file ie homepage.html
I need to say that the above pattern needs to be matched OR its a .html file.
Anyone have any idea?