The Internet Archive reportedly has over 50 petabytes of data archived. That seriously, mind-bogglingly, utterly not very small at all.
@liw Makes me wonder what the write-to-read ratio of that content is. I suspect it's far above 1.
345 billion web pages saved. That compares with 39 million books in the US Library of Congress (largest collection in the world) and about 130 million books ever published (growing at 300k/yr trad, 1.5m/yr nontrad publishing).
TIA's 2016 report suggests ~90 kB/pg, or ~2% of a book (~5 MB PDF text).
TIA have 5 billion books worth of data.
It's just like my workplace!
Takes a while to rebalance storage, even with pretty snappy networking.
@maswan I'm not at all envious. Not even a little bit.
@liw hehe backups hehe
@Jmtd I don't think I have.
@liw it's a git-annex powered, crowd sourced effort to back up Internet archive. Joey was involved in some way
Nasqueron is a budding community of creative people, writers, developers and thinkers. We focus on free culture, ethics and to be a positive change. We share values like respect, justice and equity.