Someone said to his friend, "I've got a secret, but you don't tell anyone, okay?, then he told the secret to his friend. The question is, whether a secret will be secret if the secret is told to someone else?
I think it is a erroneous understanding of what the definition of a secret. Now try entering the key words "(inurl:" robot.txt "| inurl:" robots.txt ") intext: disallow filetype: txt" on google search engine. Or simpler, go to "www.facebook.com / robots.txt", you will see the lists of files and directories that were banned from crawling by search engines.
As the proverb says, "the more often we are forbidden, the more often we violate the ban", It has become a common nature of human.
In my opinion, the application of the use of "Disallow" on Facebook is bad, that would be indirectly give the file structure and directory structure information to the public (and it applies to all websites that enacting such this method).
Perhaps the intention was originally to prevent certain files from being indexed by search engines, however indirectly have told other people about something that should not be known by others.
To block the search engines to crawl certain files, I think still there are other alternatives.
Keyword: Robots.txt Secrets | Robot.txt method | google robot algorithm | how google robot work | how google robot to execute
Tumpal Tambunan
I think it is a erroneous understanding of what the definition of a secret. Now try entering the key words "(inurl:" robot.txt "| inurl:" robots.txt ") intext: disallow filetype: txt" on google search engine. Or simpler, go to "www.facebook.com / robots.txt", you will see the lists of files and directories that were banned from crawling by search engines.
# Notice: if you would like to crawl Facebook you can # contact us here: http://www.facebook.com/apps/site_scraping_tos.php # to apply for white listing. Our general terms are available # at http://www.facebook.com/apps/site_scraping_tos_terms.php User-agent: baiduspider Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: Googlebot Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: msnbot Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: naverbot Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: seznambot Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: Slurp Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: teoma Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: Yandex Disallow: /ac.php Disallow: /ae.php Disallow: /album.php Disallow: /ap.php Disallow: /autologin.php Disallow: /checkpoint/ Disallow: /feeds/ Disallow: /l.php Disallow: /o.php Disallow: /p.php Disallow: /photo.php Disallow: /photo_comments.php Disallow: /photo_search.php Disallow: /photos.php User-agent: * Disallow: / # E-mail sitemaps@lists.facebook.com if you are authorized to access these and
are getting denied.
Sitemap: http://www.facebook.com/sitemap.php
As the proverb says, "the more often we are forbidden, the more often we violate the ban", It has become a common nature of human.
In my opinion, the application of the use of "Disallow" on Facebook is bad, that would be indirectly give the file structure and directory structure information to the public (and it applies to all websites that enacting such this method).
Perhaps the intention was originally to prevent certain files from being indexed by search engines, however indirectly have told other people about something that should not be known by others.
To block the search engines to crawl certain files, I think still there are other alternatives.
Keyword: Robots.txt Secrets | Robot.txt method | google robot algorithm | how google robot work | how google robot to execute
|
You may also like: