Part of my workflow with Internet Archive is to favorite things I go back to a lot. But the fav page there is nigh-unusable, it lists in order from most recent fave to oldest, including duplicates (Huh?), and even sorting by title doesn’t put related things together. So I made a tool, and generated
Internet Archive Favorites which I’ll update every so often.
My first attempt was simply scraping an RSS feed, but they only publish the last 50 faves! Bogus! Even if I cached them, I’d still have to check it often and reorganize things. Then I learned they have a developer interface, usable with an
ia script or right from Python, which is more useful. It’s slow without caching, but after first run it’s very fast, mostly 1 API call.
Read the docs at the top of the script, look at the example config file (almost a Markdown outline, but I do some clever/stupid things in it). As usual license is BSD, an ye harm none, do what thou wilt shall be the whole of the law.
Now all I have to do is write a cfg file:
The stuff I've found that I like on [Internet Archive](https://archive.org), loosely sorted. ## Retrocomputing + Basic_Computer_Games_Microcomputer_Edition_1978_Creative_Computing + More_BASIC_Computer_Games + Basic_Computer_Adventures_1986_MS_Press + Best_of_Creative_Computing_Vol_1_1978_Creative_Computing_Press + creativecomputing …
% archive-fav-extract.py -q mdhughes
And it makes a nice html file, tells me about any errors, and I paste the output file into a wordpress page.