I'm a member of the copyleft FFFFF.AT Lab and study the science of the internets at Rocketboom.
I teach the Internet Famous Class at Parsons, where your grade depends on your online popularity.
As seen on
NBC,
TIME,
CNN,
Gawker,
BuzzFeed
, ArtNews
posts tagged 'scraping'
Update 08/15: code is now on GitHub
Update 01/17: new version online that fixes a few issues; download
Been using e-z blog Tumblr lately: jamiew.tumblr.com
Pleased with the ease of use & reblogging functionality, but unbelievably disappointed by the lack of RSS for the dashboard!
Voilá a ruby script to login to yr Tumblr account, scrape the last 50 posts or so, and output as RSS.
usage:
I’m avoiding setting this up as service; just wanted to put the code out there.
Rocketboom-affiliated women are in 4 of the top 20 entries for Wired’s Sexiest Geek 2007 contest — go vote, help us bring home the gold!




stats monitor

Bootstrap your career in data hacking! With Ruby and WWW::Mechanize you can get started collecting data on the web with just a few lines of code.
download Jdubs’ mechanize scrapers 1.0 — simple scraping examples for MySpace, YouTube, and torrent index BTjunkie.
Techniques for exploring a web page, Ruby & gem installation, and explanations of the simple extractors below.