|
For a while, I've been passively looking for a RSS parser in Python. Lo and behold, when surfing the blogosphere this morning, I came across a recommendation by Aaron Swartz: Orchard. In fact, Orchard looks like something even better than just a RSS processor -- it's a general toolkit for handling node-containing documents (a class that encompasses basically all the documents I care about: METS, RSS, IMS-CP, OPML, OpenOffice.org XML, etc). Looking forward to trying it. Here's the obligatory paragraph blurb on Orchard:
Orchard is a data manipulation framework centered around "nodes". Nodes are quite similar to "objects" in most Object-Oriented languages but emphasize passive data models over methods and behaviors, or object models. Nodes in Orchard languages act just like native container classes -- hashes in Perl, objects in Python.
There are a lot of blogs right now that I know about and read occassionally that aren't on my blogroll yet. I've been wanting to have an integrated solution -- one that holds together references to blogrolls that I accumulate while browsing, materials I've dumped into Ecco, a tool that would let me read off other people's blogrolls, one that also is a RSS crawler and aggregator -- all rolled into one. Perhaps a solution exists already but I just haven't had the time to find it. (I imagine there is, with all the pieces out there to do the job now: RSS aggregators, blogroll indicators and collectors, RSD, trackback, etc.)
Oh LazyWeb, may you make such a solution manifest to us all.
Posted by Raymond Yee on 5/28/03; 8:04:41 AM
from the Web Technology dept.
Discuss (1 response)
|
|