Scrapy
1.0

First steps

  • 初窥Scrapy
  • 安装指南
  • Scrapy入门教程
  • 例子

Basic concepts

  • 命令行工具(Command line tools)
  • Spiders
  • 选择器(Selectors)
  • Items
  • Item Loaders
  • Scrapy终端(Scrapy shell)
  • Item Pipeline
  • Feed exports
  • Requests and Responses
  • Link Extractors
  • Settings
  • 异常(Exceptions)

Built-in services

  • Logging
  • 数据收集(Stats Collection)
  • 发送email
  • Telnet终端(Telnet Console)
  • Web Service

Solving specific problems

  • 常见问题(FAQ)
  • 调试(Debugging)Spiders
  • Spiders Contracts
  • 实践经验(Common Practices)
  • 通用爬虫(Broad Crawls)
  • 借助Firefox来爬取
  • 使用Firebug进行爬取
  • 调试内存溢出
  • 下载及处理文件和图片
  • Ubuntu 软件包
  • Deploying Spiders
  • 自动限速(AutoThrottle)扩展
  • Benchmarking
  • Jobs: 暂停,恢复爬虫

Extending Scrapy

  • 架构概览
  • 下载器中间件(Downloader Middleware)
  • Spider中间件(Middleware)
  • 扩展(Extensions)
  • 核心API
  • 信号(Signals)
  • Item Exporters

All the rest

  • Release notes
  • Contributing to Scrapy
  • Versioning and API Stability
Scrapy
  • Docs »


© 版权所有 2008-2016, written by Scrapy developers, translated by Summer&Friends. Revision 0ff57e11.

Built with Sphinx using a theme provided by Read the Docs.