An Awesome List for getting started with web archiving
-
Updated
Apr 9, 2025
An Awesome List for getting started with web archiving
Wayback Machine API interface & a command-line tool
WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
A list of things related to software, literature, and other content for ?? Memento
Parse And Create Web ARChive (WARC) files with node.js
A dockerized, queued high fidelity web archiver based on Squidwarc
Various Jupyter notebooks about Common Crawl data
Quick Cache and Archive search buttons
metawarc: a command-line tool for metadata extraction from files from WARC (Web ARChive)
A social media open post web archiving tool
Awesome list dedicated to digital and data preservation tools, sources, services and so on.
Digital Preservation of HTTP in documentary heritage.
Decentralized web archiving
Seeder - Czech webarchive curating tool and public site
A javascript for fighting link rot and content drift using link decoration and web archives.
?? File-Based Reference Filing System.
A tool for detecting viruses and NSFW material in WARC files
Parser for WARC (aka WebArchive) files
Add a description, image, and links to the webarchiving topic page so that developers can more easily learn about it.
To associate your repository with the webarchiving topic, visit your repo's landing page and select "manage topics."
硫酸羟氯喹片是治什么病hcv8jop9ns3r.cn | 乳房硬块疼是什么原因wmyky.com | 戊戌是什么意思hcv9jop6ns0r.cn | 猪砂是什么东西hcv8jop0ns2r.cn | 学杂费包括什么hcv9jop3ns6r.cn |
大便出油是什么原因onlinewuye.com | 张家界莓茶有什么功效hcv8jop0ns5r.cn | 亵渎什么意思hcv8jop1ns8r.cn | 脑供血不足做什么检查hcv8jop3ns1r.cn | 颈部多发淋巴结是什么意思hcv8jop7ns7r.cn |
一个黑一个今念什么hcv8jop7ns5r.cn | 男人交公粮什么意思hcv7jop9ns2r.cn | 男人性功能太强是什么原因hcv7jop9ns3r.cn | 日进斗金什么意思520myf.com | 拉屎像拉水一样为什么hcv8jop8ns9r.cn |
梦见摘西红柿是什么意思hcv7jop6ns5r.cn | 甘少一横是什么字hcv7jop6ns9r.cn | 火影忍者什么时候出的hcv8jop4ns5r.cn | 尿液发绿是什么原因baiqunet.com | 送表的寓意是什么jiuxinfghf.com |