Resource:

Resources Index

A collection of tools for Web scraping, data mining and visualization.

ScraperWiki

Links: Website | On Twitter | Facebook
Tags: Public Information, Web Scraping

ScraperWiki is a site and platform for building data scrapers that can transform unstructured data such as plain text files or tables on web pages (or even in PDFs) into structured data that can be queried with an API in JSON or XML format. Scrapers can be written in Python, Ruby, or PHP, and can be edited by anyone who has registered for the site (the “wiki” in the name). Non-programmers can request a data set and let the community help by putting together a working scraper. Source: Journalism Accelerator

The Journalism Accelerator is not responsible for the content we post here, as excerpts from the source, or links on those sites. The JA does not endorse these sites or their products outright but we sure are intrigued with what they’re up to.

Posted on May 31, 2011
Topics: Resources Technology

Weigh In: Remember to refresh often to see latest comments!

1 comment so far.

Jeff Lennan says:

September 22, 2011 at 8:51 am

The ScraperWiki API API now has an option to make RSS feeds as a format (i.e. instead of JSON, CSV or HTML tables).

For example, Anna made a scraper that gets alocohol licensing applications for Islington in London. She wanted an RSS feed to keep track of new applications using Google Reader. Read more about it on their blog: http://blog.scraperwiki.com/2011/09/21/make-rss-with-an-sql-query/

Comment Feed

Check out what's here, offer your comments on what you see. When you do post a comment, the JA team will invite the people behind the resource to connect back with you, responding in line to your comment. Conversation and connection made easy.

Tools to manage your freelancing career, tips for making hard facts easy to read, making journalism “memberful,” a report about the importance of LatinX communities, resources for covering the climate crisis Tools & Tactics Tips & Techniques Innovation & Experiments Reports & Articles People & Collaboration

A Twitter sorting tool, transparency tips, an AI institute for helping vets, how to practice “right speech,” and keeping track of protests and riots Tools & Tactics Tips & Techniques Innovation & Experiments Reports & Articles People & Collaboration

Questions Resources Blog Projects About

Resource:

Resources Index

ScraperWiki

ScraperWiki

Weigh In: Remember to refresh often to see latest comments!

Get JA Updates

Related Questions

What are some of the innovative ways journalists are using Web scraping to access and organize data?

Recent Blog Posts

Decoding Collaboration Part 3: Collective impact deconstructed

Decoding Collaboration Part 2: News collaborations - defining impact

Decoding Collaboration Part 1: Can or should news collaboration be forced?

What kind of journalism education today best sets students up for success tomorrow?

Recent ResourcesMore

Tweets for Keeps: February 2020

Tweets for Keeps: December 2019

About

Contact Us

Site

Connect