>>76542547> I don't have petabytes of spare storage lying around to mirror the whole Wayback Machine, no.And that's exactly the problem, isn't it? No other entities were created that have anything like the job to replicate this archive.
The other petabyte stoages are pretty much all reserved to botnet marketing data and modern KGB/Gestapo/Stasi services, right?
Likewise, sub-petabyte storages are very disconnected, yours included. Would you share your essential data if the IA went down/do you share it now? Do you have enough metadata associated that it could be fit together with other data into some derivative topical collection like you said would be nice?
>I created a local mirror of content I felt was essential to me, yes.That's more than I expected! Nice - I guess.
> It may very well be impossible.It currently is, yes.
> Projects like IA could do with being a bit more discriminating with what they archive.IA already is. For most sites, they basically just quickly grabbed the front page and a few links. For others, they did a more complete crawl.
What else were they supposed to do?