Bend Bulletin RSS feed

Quick public service announcement: I’ve hacked together an RSS feed for the Bend Bulletin. It’s a first-pass, I’m scraping their Local, Business and Sports pages and building a summary feed only. If I have time, I may go one step further and pull each article on those pages, and provide a full-text feed.

Either way, here’s the RSS feed link [ed. note: no longer valid]. Enjoy!

Blog bot roundup

The variety is amazing: here’s a list of various agents, spiders and bots that I’ve culled from my chuggnutt.com logfiles over the last 30 days that have to do with RSS and/or blogs (specifically blogs, not just general purpose spiders like Google’s). These are only the ones I know for sure are blog or RSS related; others in my logs might be also, but aren’t obvious about it.

Geek types, note that these strings (with wildcards mostly) can be used as-is when identifying HTTP_USER_AGENT.

  • Bloglines: The web-based feed reader/aggregator
  • kinjabot: The (currently) beta bot for the Kinja weblog directory/guide
  • Feedreader: Windows-based feed reader/aggregator
  • PubSub.com RSS reader: Another searchable, web-based aggregator
  • FeedDemon: Windows-based feed reader/aggregator
  • fastbuzz.com: Fastbuzz News is another web-based aggregator that scans news and blogs
  • ORblogs.com-bot and ORblogs-bot: The crawlers for ORBlogs which compile metadata and RSS for the aggregating site
  • SharpReader: Windows-based feed reader/aggregator
  • Technoratibot: Technorati‘s crawler
  • UniversalFeedParser: Mark Pilgrim‘s liberal feed parser which is used in a variety of RSS software
  • Feedster Crawler: Feedster’s RSS spider
  • BlogBot: I think this is Blogdex‘s crawler, but I’m not totally sure
  • BlogPulse: Yet another blog/RSS crawler and indexer
  • Slower, Friendlier Spiders (BlogShares V1.35): The spider for BlogShares, the fantasy share market for blogs
  • NITLE Blog Spider: The National Institute for Technology and Liberal Education‘s spider for their blog census
  • LocalfeedsPageCrawler
  • NusEyeFeedCrawler

Oregon Weblogs

Just wanted to give a few plugs and props to one of the better weblog-related sites out there, Oregon Blogs.

I thought about trying to describe what it does, but the best I could come up with is that it’s equal parts RSS aggregator, group blog, and weblog directory rolled into one; really, the way to find out about ORBlogs is to just visit it. It’s one of the more solid, useful and innovative blogging apps I’ve seen (even if it is written in ASP! :) ), and continues to surprise me with new features; for instance, clicking on the “info” link for a blog reveals a detail-rich page devoted to that site, including a snapshot of what the site looks like, state and city-scale maps showing where the blog lives, metatag data gleaned from the HTML source (much of it clickable), and the most recent blog posts.

Definitely a great site. Worth looking at even if you’re not from Oregon.

SharpReader Gone

SharpReader is outta here. Last night’s crash wasn’t a fluke; after I downloaded the latest version and restored my feeds, I went back and added two Amazon feeds and sure enough, it crashed again. Turns out the URLs for Amazon’s feeds are too long for SharpReader to handle.

The worst part is, after last night’s second crash, SharpReader wouldn’t even start back up at all—not last night, not today, nothing. So, it’s gone and I’m done with it. Won’t be going back.

Right now I’m playing with FeedDemon. Seems pretty nice so far.

SharpReader Crashed

Grrr… SharpReader, the news reader I use to read RSS feeds, just crashed on me, and lost all my feeds—data and URLs. After I’d added four of the new Amazon feeds. Shit.

Oh, well. Fortunately, I had a recent backup of the OPML for my feeds, so I was able to get them back quickly.

Amazon RSS

Another piece of news everyone pointed to yesterday: Amazon is now offering RSS feeds. A list of all their feeds can be found at the Amazon.com Syndicated Content page. Looks like they’re offering feeds for each top-level category in their hierarchy. The next logical step, of course, would be to offer a personalized RSS feed of your recommendations…

RSS as Poor Man’s Copyright

These ideas have been rolling around my brain for a while, fermenting, percolating, but bear with me if they might still be a little incoherent. It’s really the first time I’ve put words to them.

Perhaps you’ve heard of the “poor man’s copyright.” The idea behind it is to provide yourself copyright protection without actually registering your work, typically by mailing yourself a copy of your work with the idea that the postmark on the envelope will be enough to prove the copyright. I rather like the idea behind this concept, although in reality there is no legal provision for the poor man’s copyright and holds no legal weight whatsoever. In practice it would be easy to fake a copyright in this way.

But this idea of being able to prove and protect the copyright on your creative work (short of registering it) is a powerful one, so it’s natural to transfer the poor man’s copyright concept to the computer. The problem is, it’s even easier to fake a datestamp on an electronic file than on an envelope full of materials, so just relying on Word files on your computer is out.

You could borrow the idea of the PMC more literally and email your work to yourself—or better yet, to someone else. That would provide a better claim to credibility than files on a disk, but it’s far from foolproof—dates can be altered and forged on emails too. But now we’re moving in the right direction. And that’s where RSS comes in.

Post your work into an RSS feed that has a decent number of subscribers, the more the better. Their aggregators regularly ping and download your RSS feed, and your work is suddenly distributed among dozens—hundreds—thousands of computers and users, each instance of your work (ideally) stamped with the date and time it was downloaded (important note here: an item in an RSS feed can claim any arbitrary creation date, so that’s why it’s important to disinguish the download date at the aggregator level). There would be a standard deviation of several hours to several days, perhaps, of these datestamps. But what would you have? A distributed, decentralized, and dated web of your copyrighted work, collectively becoming a digital postmark on the proverbial envelope.

Fakeable? Sure, if you had access to a small number of controlled computers. But the larger the audience, especially a well-distributed one, the less able you would be to pull this off. That’s the beauty of this system: for a large enough set, the likelihood of faking or gaming the system approaches zero. There’s no single point of failure or vulnerability.

Other drawbacks? Well, you’d have to have a fairly large audience downloading your RSS feed regularly. That’s a bit of a trick. RSS aggregators would have to be sure to accurately record the download date of the feed. Also, anytime you wanted to back up a claim, you’d somehow have to mobilize enough of this audience to check their aggregator archives and confirm your claim in a timely manner and communicate this assertion to the other party securely and independently. Details, details. :-)

Would RSS PMC be any more legal and provide real protection over regular old PMC? In practice, I doubt it. Again, it’s the idea that’s powerful here and takes us to the next step. You’d have a peer-reviewed network where the group could at anytime confirm or deny the validity of what you claim. An online archived record distributed among thousands of computers of everything you created and loaded into your RSS feed.

It’s a double-edged sword, too. If you tried to plagiarize someone else’s work and claim it as your own, you’d have the community calling foul and moving against you. And the community has a long memory.

Suddenly, this sounds a lot like an online reputation system, doesn’t it? Once you get started thinking about this stuff, the ideas just start rolling out. That’s the beauty of this RSS thing—the possibilities and potential it creates.