*   \   &   ©   |   /   * *   \   &   ©   |   /   * *   \   &   ©   |   /   * *   \   &   ©   |   /   * *   \   &   ©   |   /   *

posts tagged 'scraping'

RSS feed of your Tumblr dashboard

Update 08/15: code is now on GitHub

Been using the blogging service Tumblr lately: jamiew.tumblr.com
Very happy with the simplicity (and reblogging!) but disappointed by the lack of RSS for the dashboard!

Voilá a quick Ruby script to login to your Tumblr account, scrape the last 50 posts or so, and output as RSS.

>> tumblr-dashboard-rss.rb

usage:

  • set the email & password variables
  • put in a cgi-bin on your server, or
  • use an RSS reader like Vienna or NetNewsWire that can parse RSS from a local file

I’m avoiding setting this up as service; just wanted to put the code out there.

Rocketboom owning Sexy Geeks 2007

Rocketboom-affiliated women are in 4 of the top 20 entries for Wired’s Sexiest Geek 2007 contest — go vote, help us bring home the gold!



stats monitor


How to scrape MySpace, YouTube, BitJunkie

scrape, scraping, screenscraping, web scraping, scrape myspace, scrape youtube, scrape torrents, scraping code, scraper code, scraper example, ruby, mechanize, ruby mechanize, ruby scraper, mechanize scraper

Bootstrap your career in data hacking! With Ruby and WWW::Mechanize you can get started collecting data on the web with just a few lines of code.

download Jdubs’ mechanize scrapers 1.0 — simple scraping examples for MySpace, YouTube, and torrent index BTjunkie.

Techniques for exploring a web page, Ruby & gem installation, and explanations of the simple extractors below.