開源爬蟲軟件匯總:http://blog./uid-22414998-id-3774291.html Scrapy is a fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing. 強大的scrapy爬蟲框架(Python):http:/// Python抓取框架:Scrapy的架構: http://www./scrapy-architecture.html 使用scrapy進行大規(guī)模抓?。篽ttp://www./blog/archives/500 Scrapy入門教程:http://www.cnblogs.com/txw1958/archive/2012/07/16/scrapy-tutorial.html 一個scrapy例子:https://github.com/scrapy/dirbot 一個分布式定向抓取集群的簡單實現:https://github.com/agathewiky/spider-roach
|