Forum     

Go Back   Digit Technology Discussion Forum > Portables, Peripherals and Electronics > QnA (read only)
Register FAQ Calendar Mark Forums Read

QnA (read only) Mods please help transfer the contents of this forum to proper sections. :)


 
 
LinkBack Thread Tools Search this Thread Display Modes
Old 02-03-2005, 06:38 PM   #1 (permalink)
In The Zone
 
tuXian's Avatar
 
Join Date: Nov 2004
Location: Hyderabad
Posts: 364
Default .:: Is robots.txt necessary? ::.


I dont want any restriction on indexing so dont require robots.txt file.

But is it any harm or good if you put one? coz a blank file will stop giving the 404 message.

What do u all say? Pros and Cons plz let me know

Thanks
tuXian is offline  
Advertisements. Register and be a member of the community to get rid of them.
Advertisement

Old 03-03-2005, 06:30 PM   #2 (permalink)
Wise Old Owl
 
enoonmai's Avatar
 
Join Date: Oct 2004
Location: Parked diagonally in a parallel universe
Posts: 1,304
Default

If you don't want any restriction on crawling, just put in

User-agent: *
Disallow:

and leave it at that. This way, if you want to disallow your cgi-bin or some other private folder later, you can easily add it. Let the file exist though. No harm done if you leave it, and if you leave it in, chances are that it will get cross-indexed faster.
__________________
Face it, kid! Provoking a reaction isn't the same thing as saying something significant - Calvin
A64 3000+@2.4G/Asus A8V-DLX/1G DDR400/BBA X800 XT PE/320G HGST SATA2
Playing FEAR XP/LSW2
enoonmai is offline  
Old 03-03-2005, 08:16 PM   #3 (permalink)
In The Zone
 
tuXian's Avatar
 
Join Date: Nov 2004
Location: Hyderabad
Posts: 364
Default

Quote:
No harm done if you leave it, and if you leave it in, chances are that it will get cross-indexed faster.
By this you mean to say that If I dont put the file on the server at all then it will cross index faster?

Am I right? Plz reply
__________________
You know it's love when you memorize her IP to skip DNS overhead.
tuXian is offline  
Old 03-03-2005, 08:26 PM   #4 (permalink)
Wise Old Owl
 
enoonmai's Avatar
 
Join Date: Oct 2004
Location: Parked diagonally in a parallel universe
Posts: 1,304
Default

No, if you dont put the file, then *some* spiderbots will usually register a 404 in your logs if they "specifically" request for the file. If you usually dont have a robots.txt file they assume unilimited access, but if a robot does try to crawl the site and explicitly requests a robots.txt, then you're faced with the obvious 404 error. Even a blank file will do, but like I said, its best if you just put in those two lines and forget about it. It will not slow down or speed up your cross-indexing but its compliant with search engines and won't go around generating errors at least.
__________________
Face it, kid! Provoking a reaction isn't the same thing as saying something significant - Calvin
A64 3000+@2.4G/Asus A8V-DLX/1G DDR400/BBA X800 XT PE/320G HGST SATA2
Playing FEAR XP/LSW2
enoonmai is offline  
Old 03-03-2005, 08:35 PM   #5 (permalink)
In The Zone
 
tuXian's Avatar
 
Join Date: Nov 2004
Location: Hyderabad
Posts: 364
Default

thanks a zillion

BTW another question:

My website uses a shared IP. How can I know the other sites on the server using the same IP?
tuXian is offline  
Old 03-03-2005, 09:47 PM   #6 (permalink)
Version 2.0
 
Deep's Avatar
 
Join Date: Jan 2004
Location: Mumbai
Posts: 977
Default

i would sugggest keeping robots.txt file just for search engines and allowing all directories for scannig..

in anyways search engins are not gonna scann dir with password protection (.htaccess)

so if you keep some directory names to disallow then it may open doors for hackers..

coz people always try to look into the closed doors

so they might try to play around with the directories u dont want SE's to access....


about ur 2nd question..

using sites like

www.whois.sc/(SiteName.com here)
http://whois.webhosting.info/(SiteName.com here)

reaplace (SiteName.com here) with the site name you want to search for..

those sites will show no. of sites hosted on the same IP and even list of those sites...

cheers
Deep
__________________
- Deep Ganatra -
www.whoisdeep.com
www.twitter.com/DeepXP/
Deep is offline  
Old 03-03-2005, 10:14 PM   #7 (permalink)
In The Zone
 
tuXian's Avatar
 
Join Date: Nov 2004
Location: Hyderabad
Posts: 364
Default

thanks
__________________
You know it's love when you memorize her IP to skip DNS overhead.
tuXian is offline  
 

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


 
Latest Threads
- by ico
- by Piyush
- by icebags
- by clinton
- by Charan

Advertisement




All times are GMT +5.5. The time now is 12:07 AM.


Powered by vBulletin® Version 3.8.7
Copyright ©2000 - 2012, vBulletin Solutions, Inc.

Search Engine Optimization by vBSEO 3.3.2