Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Interestingly, Craigslist is being consistent when it comes to not allowing scrapers, including Google.

http://www.craigslist.org/robots.txt

edit: oops! they're not actually blocking google from the apartment listings. thanks smackfu.



Aren't they only disallowing particular pages there? Like apartments are at *.craigslist.org/apa/number.html which doesn't seem blocked by that robots.txt.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: