Install Docker on your system (if not already installed).□ Other Options docker + electron Desktop App (macOS/Linux/Windows) See below for usage examples using the CLI, Web UI, or filesystem/SQL/Python to manage your archive. Nix: nix-env -install archivebox (contributed by More: contribute another distribution.!.Arch: yay -S archivebox (contributed by FreeBSD: curl -sSL '' | sh (uses pkg + pip3 under-the-hood).These are contributed by external volunteers and may lag behind the official pip channel. Pacman / pkg / nix (Arch/FreeBSD/NixOS/more) See the pip-archivebox repo for more details about this distribution. See below for more usage examples using the CLI, Web UI, or filesystem/SQL/Python to manage your archive. # completely optional, CLI can always be used without running a server # archivebox Support/consulting pays for hosting and funds new ArchiveBox open-source development. for individuals, NGOs, academia, governments, journalism, law, and more.Īll our work is open-source and primarily geared towards non-profits. setup & support, team permissioning, hashing, audit logging, backups, custom archiving etc.Planned: support for running JS during archiving to adblock, autoscroll, modal-hide, thread-expandĬontact us if your non-profit institution/org wants to use ArchiveBox professionally.Advanced users: support for archiving content requiring login/paywall/cookies (see wiki security caveats!).Saves all pages to as well by default for redundancy (can be disabled for local-only mode).Usable as a oneshot CLI, self-hosted web UI, Python API (BETA), REST API (ALPHA), or desktop app (ALPHA).Uses standard, durable, long-term formats like HTML, JSON, PDF, PNG, MP4, TXT, and WARC.Supports scheduled/realtime importing from many types of sources.Extracts a wide variety of content out-of-the-box: media (yt-dlp), articles (readability), code (git), etc.Comprehensive documentation, active development, and rich community.Powerful, intuitive command line interface with modular optional dependencies.Free & open source, doesn't require signing up online, stores all data locally.The goal is to sleep soundly knowing the part of the internet you care about will be automatically preserved in durable, easily accessible formats for decades after it goes down. Researchers: collecting AI training sets, feeding analysis / web crawling pipelines.Lawyers: evidence collection, hashing & integrity verifying, search, tagging, & review.Journalists: crawling and collecting research, preserving quoted material, fact-checking and review. Individuals: backing up browser bookmarks/history, saving FB/Insta/etc. □️ ArchiveBox is used by many professionals and hobbyists who save content off the web, for example: It uses normal filesystem folders to organize archives (no complicated proprietary formats), and offers a CLI + web UI.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |