Hello fedi

I would like to find a way I can have a cold copy of articles that I view, something like a self-hosted archive.is … but it does it automatically for every site I visit.

Please help me find out how or where!

@fluffy

1) Yes, you can do this up to a point. However, manual capture works better than automatic.

2) One approach is Pale Moon plus ScrapBook X. This approach is simple, but it only works for some sites.

3) For near-perfect captures, learn to use WARC toolsets. I've tested openwayback and pywb and suggest starting with those two. This isn't plug and play but I've gotten it to work pretty well.

4) Links:

github.com/webrecorder/pywb
github.com/iipc/openwayback/wi
loc.gov/preservation/digital/f

Follow

@Gamercat @fluffy

I think that openwayback per se is designed more for interaction with full web browsers but there are parts of these toolsets that you can run in CLI. Lynx could be used to trigger captures by way of CLI scripts.

· · Web · 1 · 1 · 1
Sign in to participate in the conversation
Minetest Tooter

Mastodon server for creative rational­ists. In­tend­ed for light or tech­nical discussion as op­pos­ed to strong debate.

Discouraged: Identity poli­tics, religion, profanity, national events, X-rated dis­cus­sion or materials. We might set up other ser­vers for those things.

Encour­aged: Crea­tive Com­mons works, Mine­test and other FOSS games, FOSS, writers, artists, reci­pes, rhymes, cat photos, G-rated web­comics.