Copyright © 2010 O’Reilly Media
Printed in the United States of America.
Nutshell Handbook, the Nutshell Handbook logo, and the O’Reilly logo are registered trademarks of O’Reilly Media, Inc. !!FILL THIS IN!! and related trade dress are trademarks of O’Reilly Media, Inc.
Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and O’Reilly Media, Inc. was aware of a trademark claim, the designations have been printed in caps or initial caps.
While every precaution has been taken in the preparation of this book, the publisher and authors assume no responsibility for errors or omissions, or for damages resulting from the use of the information contained herein.
- I. Setup
- 1. Theory
- 2. Data
- 3. Agile Tools
- Scalability = Simplicity
- Agile Data Processing
- Setting up a Virtual Environment for Python
- Serializing Data with Avro
- Collecting Data
- Data Processing with Pig
- Publishing Data with MongoDB
- Searching Data with ElasticSearch
- Reflecting on our Workflow
- Lightweight Web Applications
- Presenting our Data
- 4. To the Cloud!
- 5. Cloud Patterns
- II. Climbing the Stack
- 6. The Data Value Stack
- 7. Collecting and Displaying Records
- Putting it all together
- Collect and Serialize our Inbox
- Process and Publish our Emails
- Presenting Emails in a Browser
- Listing Emails
- Searching our Email
- 8. Visualizing Data with Charts
- 9. Exploring Data with Reports
- 10. Making Predictions
- 11. Driving Actions