What does HackerNews think of waybackpack?

Download the entire Wayback Machine archive for a given URL.

Language: Python

Thank you! But the only thing that really deserves credit is Jeremy Singer-Vine's https://github.com/jsvine/waybackpack library. Pretty much made this a very straightforward task
> Is there some straightforward way to list all of archive.org's snapshots (of a particular site) without a javascript-enabled browser?

I use https://github.com/jsvine/waybackpack.

  $ waybackpack --list https://diziet.dreamwidth.org/11840.html
  ...
  https://web.archive.org/web/20220727234836/https://diziet.dreamwidth.org/11840.html
  https://web.archive.org/web/20220728045504/https://diziet.dreamwidth.org/11840.html
  https://web.archive.org/web/20220728084126/https://diziet.dreamwidth.org/11840.html
Which paid services are you referring to? It is likely that these services aren't distributing the projects they are based on, if so, then they are in compliance with the licenses of the open source projects, which probably don't require attribution unless you distribute them.

This project started in 2015 btw. Another similar project called waybackpack started in 2016. There are probably more projects. IMO wayback-machine-downloader is the better project though.

https://github.com/jsvine/waybackpack

The Wayback CDX Server API these projects are based on is quite simple to use btw, just some JSON responses to decode.

https://archive.org/help/wayback_api.php https://github.com/internetarchive/wayback/blob/master/wayba...