search for: robotstxt

Displaying 6 results from an estimated 6 matches for "robotstxt".

2013 Sep 18
2
Request for the Admin
...that the Administrator / Owner can >> do (e.g contacting Google and finding out what the problem / reason >> is), I'm sure it will be helpful for all. > > Yeah, right. > > > Groeten > Geert Stappers Perhaps an explicit robots.txt could be of help. Per http://www.robotstxt.org/robotstxt.html an empty file or "User-agent: * Disallow:" A HTTP return 404 _should_ be sufficient (but we know how well some things that _should_ work may not always). -- -Gene
2009 Aug 02
4
Web
I have the following issues on a website, would like to know how would you resolve these issue? 1- CSS is not used efficient. 2- Search engine need to be optimized. 3- Java Scripts are placed between HTML tags. 4- Redirecting homepage through JS code, using client side 5- Web page delay, a lot of objects. 6- Disable listings directories from apache (how) 7- web not compatible with Firefox Thanks
2010 Jan 16
3
httpd and robots.txt
...nd possibly those that have high traffic modify their robots.txt files differently that others ??? please share if you can or care to please? for years we have just did a * (allow all) and disallow on things like /cgi-bin as examples of places to visit for those out or in the know... http://www.robotstxt.org/ http://en.wikipedia.org/wiki/Robots_exclusion_standard http://www.google.com/robots.txt and others... quite frankly, there are many orgs out there that dont follow this anyways, right? anyone? tia - rh
2010 Apr 13
2
import file formatted RFC-822
Dear R-list users: I would like to import a database of web robots, http://www.robotstxt.org/db/all.txt, it?s formatted RFC-822, ?how can I do it? The RFC 822 specification defines a standard format for electronic messages, which consists of a set of header fields and an optional body. The headers contain information about the message, such as to whom it is being sent, from whom it is...
2013 Sep 17
2
Request for the Admin
To the Syslinux mailing list administrator(s): There seem to be some issue when searching the Syslinux mailing list archives. To replicate (example): 1_ Open a Google web search page 2_ Search for "site:http://www.syslinux.org/archives/" 3_ Click "Search tools" 4_ Limit the search to "Past month". Result: no match! This has been happening for about 10 months or
2010 Oct 14
1
[LLVMdev] llvm.org robots.txt prevents crawling by Google code search?
On Wed, Oct 13, 2010 at 11:10 PM, Anton Korobeynikov < anton at korobeynikov.info> wrote: > > indexing the llvm.org svn archive. This means that when you search for > an > > LLVM-related symbol in code search, you get one of the many (possibly > > out-of-date) mirrors, rather than the up-to-date llvm.org version. This > is > > sad. > This is intentional. The