CarolW
01-29-2006, 10:41 AM
I have two related questions.
1) I really don't understand the robots.txt file. I put one in recently to prevent my images from being indexed (I think). I just parrotted what I could find, without understanding it.
2) Currently I have a site I'm putting together for a friend sitting in a subdirectory on my site. We're working on transferring my friend's domain registration; when it's transferred, I'll then put her site on her own domain. I will need a robots.txt file for her, too; it will probably be similar to mine.
Here's my current robots.txt file:
User-agent: Googlebot-Image
Disallow: /
User-agent: *
Disallow: /coflip/
Disallow: /cohorse/
Disallow /newsite/
Disallow: /images/
In the disallow list, "/newsite/" is the one we're transferring. As long as I leave /newsite/ files on my site, I should surely keep that statement in my robots.txt file. Presumably I can delete that statement when I delete the /newsite/ directory and all its contents from my own site. Right?
Let's assume I use this same robots.txt file (changed, though so it will work properly) for /newsite/ when its own domain is ready and I can FTP its files to it.
How can I change this to allow the W3C link-checker access? What it's not being able to access is images.
Too bad that in my continuing studies, I really have no idea what's going on - what these statements actually mean, so if somebody can point me to a resource I might have a chance of comprehending, or perhaps even try to explain it yourself, I'd be very grateful!
Thanks in advance!
Sun, 29 Jan 2006 08:37:56
1) I really don't understand the robots.txt file. I put one in recently to prevent my images from being indexed (I think). I just parrotted what I could find, without understanding it.
2) Currently I have a site I'm putting together for a friend sitting in a subdirectory on my site. We're working on transferring my friend's domain registration; when it's transferred, I'll then put her site on her own domain. I will need a robots.txt file for her, too; it will probably be similar to mine.
Here's my current robots.txt file:
User-agent: Googlebot-Image
Disallow: /
User-agent: *
Disallow: /coflip/
Disallow: /cohorse/
Disallow /newsite/
Disallow: /images/
In the disallow list, "/newsite/" is the one we're transferring. As long as I leave /newsite/ files on my site, I should surely keep that statement in my robots.txt file. Presumably I can delete that statement when I delete the /newsite/ directory and all its contents from my own site. Right?
Let's assume I use this same robots.txt file (changed, though so it will work properly) for /newsite/ when its own domain is ready and I can FTP its files to it.
How can I change this to allow the W3C link-checker access? What it's not being able to access is images.
Too bad that in my continuing studies, I really have no idea what's going on - what these statements actually mean, so if somebody can point me to a resource I might have a chance of comprehending, or perhaps even try to explain it yourself, I'd be very grateful!
Thanks in advance!
Sun, 29 Jan 2006 08:37:56