dcsimg
www.webdeveloper.com
Results 1 to 4 of 4

Thread: PHP script to crawl URLs retrieved from xml files

  1. #1
    Join Date
    May 2006
    Posts
    57

    PHP script to crawl URLs retrieved from xml files

    Hi, I have collected a number of xml files from RSS feeds. I will retrieve the useful contents (e.g., title, URL link, ...) from these xml files and store them into the MySQL database. However, the content / description in the xml files are just the summaries, which are not what I want. What I want for the content is the full text of the entries. Therefore, I am thinking of using a PHP script to crawl the URLs retrieved from the xml files, followed by extracting the full text. What should be the logic/algorithm of writing this php script ?

    Thank you very much.

  2. #2
    Join Date
    Jun 2004
    Location
    4846′36″ N 910′48″ E
    Posts
    3,747
    what version of PHP do you have?

  3. #3
    Join Date
    May 2006
    Posts
    57
    both php4 and php5. I can switch from 1 version to another

  4. #4
    Join Date
    Jun 2004
    Location
    4846′36″ N 910′48″ E
    Posts
    3,747
    did you have a look at http://php.net/simplexml ? (requires PHP5)

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
HTML5 Development Center



Recent Articles