Exactly. I don't know where Jan read such stuff, he isn't telling. Indexing teaser links is IMHO ethically borderline because you think "yeah, I found it!" only to be confronted with a membership application. But it certainly is not illegal, as password hacking would certainly be in most countries.
--
Regards, Joerg
http://www.analogconsultants.com/
"gmail" domain blocked because of excessive spam.
Use another domain or send PM.
On a sunny day (Sat, 6 Mar 2010 13:18:16 -0800 (PST)) it happened " snipped-for-privacy@fonz.dk" wrote in :
would *you*
Interesting, should try that sometime. Do you think calling it Osama would close many doors? Maybe open ones in an other part of the world? Think I am going to try 'santa' too with wget. LOL And Obama. Mr Kim?
On a sunny day (Sat, 06 Mar 2010 17:46:44 -0500) it happened Bitrex wrote in :
Unless I made grove mistakes in my robots.txt, google never gave a shit, It does look at it, that is true. And also it is a hasle to have to maintain a robots.txt, new file types are added all the time, and sometimes you do not want large files with extention .something scanned, like huge .pfd, or tgz, but the small ones are OK, just to reduce bandwidth. I think there is no possibility for file size limit in robots.txt, it is a very limited instrument. I mainly depend on my firewall, 3691 lines, now, makes for more then 1500 bad guys, or IP ranges of bad guys. There are on average 2 to 3 attacks on the site per day, and 1 or 2 per month now on the nameserver, that part is protected by scripts. And of course the usual mail junk attempts. I have closed many ports, so the ssh attacks and tha tsort of stuff no longer cause problems, that was very intense too. Lst year there were huge DNS attacks, that is when I wrote scripts to counter that, mainly political motivated stuff I think, as I posted to us.politics too during the election. Maintaining a website without becoming part of a botnet takes some doing.
On a sunny day (Sat, 6 Mar 2010 15:01:15 -0800 (PST)) it happened mpm wrote in :
The way I have read it, is they scan other pages for references to some words (probably use a list of likely candiadates). For example in my site on one page you can read: 'log in here with user name '...' and password ',,,''. Google has written software to evaluate that, and then enters the pages. The reason for those password protected pages is that it is only for humans, to reduce traffic load by keeping google & other spiders out. Also as part of the net traffic goes via googles own networks, they can, if it is plaintext, simply scan for logins, and make a list.
If you mean NOAA (National Oceanic and Atmospheric Administration), you're talking horse crap.
--
"Electricity is of two kinds, positive and negative. The difference
is, I presume, that one comes a little more expensive, but is more
durable; the other is a cheaper thing, but the moths get into it."
(Stephen Leacock)
It's NOAA. But where do you get all those urban legends from? Those pamphlets at the cash register that have the front page news that an alien orbiter just landed?
--
Regards, Joerg
http://www.analogconsultants.com/
"gmail" domain blocked because of excessive spam.
Use another domain or send PM.
--
"Electricity is of two kinds, positive and negative. The difference
is, I presume, that one comes a little more expensive, but is more
durable; the other is a cheaper thing, but the moths get into it."
(Stephen Leacock)
Actually a website is never password protected. The server will always serve you a webpage! It is the content that may be password protected. The software (php, asp, active java pages, etc) the website is build with can decide based on the browser, IP address, etc to ask you for password or not to show the content.
Just read Google's manual on this and the manuals on how robots.txt is to be interpreted... It is really pretty simple.
--
Failure does not prove something is impossible, failure simply
indicates you are not using the right tools...
nico@nctdevpuntnl (punt=.)
--------------------------------------------------------------
ElectronDepot website is not affiliated with any of the manufacturers or service providers discussed here.
All logos and trade names are the property of their respective owners.