A few weeks ago, I attended the O’Reilly Strata conference where Werner Vogels, CTO of Amazon, gave a keynote. He spoke about big data – how to collect, store, organize, analyze and share data. In his talk (at minute 13:04) he covered how enterprises are using Mechanical Turk to organize their data. Specifically, how you can leverage the hundreds of thousands Workers on Mechanical Turk to:
- Verify your data
- Correct data
- Enrich data by adding meta information to your data set
- Filter data through content moderation
He then shared a common case study of how a company managing millions of business listings incorporated Mechanical Turk into their data management workflow to optimize the accuracy of their data and improve their customer experience. In the scenario he described, the provider of business listings built a data processing engine that automatically sent data exceptions to Mechanical Turk. These exceptions were reviewed and corrected by Workers; their answers were automatically validated, placed back in the data flow and then updated on their website:
You can see all of the keynotes with transcripts on the O’Reilly summary page – click on the purple icon
( ) to read the transcript.
Thanks to Mechanical Turk partners and who provided the transcriptions for the conference! Here’s the transcript from Werner’s presentation.