/>
X

The Internet Archive's ten petabyte celebration

The Internet Archive - the Internet's library and home of the Wayback Machine - celebrated the addition of the 10,000,000,000,000,000th byte to its massive collections at its San Francisco home.
violet-blue.jpg
By Violet Blue, Contributor on
dsc00060.jpg
1 of 12 Violet Blue/ZDNet

The Internet Archive has been in a former Christian Scientist church since 2009. The night of its Ten Petabyte Party, power went out in the entire neighborhood. Power was restored by PG&E at the end of the party, with all speeches and presentations held with improvised lighting and power.

dsc00054.jpg
2 of 12 Violet Blue/ZDNet

A petabyte is a thousand terabytes, or a million gigabytes. The Internet Archive uses custom made petabox servers to store its data (a petabox is comprised of ten racks with each rack holding thirty-eight three-terabyte hard drives).

dsc00058.jpg
3 of 12 Violet Blue/ZDNet

The atmosphere was joyous in the huge dark former church, and the pews were packed with supporters, fans and volunteers.

dsc00061.jpg
4 of 12 Violet Blue/ZDNet

Outside the Archive, and around the corner in the former Christian Scientist reading room, is where the Archive has its book scanning room.

dsc00063.jpg
5 of 12 Violet Blue/ZDNet

The Archive's repurposed reading room makes lovely, modernized use of the antique fixtures.

dsc00065.jpg
6 of 12 Violet Blue/ZDNet

The Archive wants to create the world's largest library and uses a scanning system called Scribe. The San Francisco scanning room is one of many worldwide that contribute to the Archive - all which combine to scan over 1000 books a day (47 books an hour, or one book every 90 seconds).

dsc00066.jpg
7 of 12 Violet Blue/ZDNet

The Internet Archive's goal is to make and preserve one copy of every published work it is able to attract or acquire - books, movies, records, everything.

dsc00068.jpg
8 of 12 Violet Blue/ZDNet

The Archive's Scribe scanning system is available as a service, and is non-destrtuctive. The software that powers it is available on SourceForge. FYI, the Scribe software hasn't been updated in a while and was engineered specifically for the custom hardware IA uses in their scanning room.

dsc00070.jpg
9 of 12 Violet Blue/ZDNet

Fun fact: the Internet Archive is known for archiving copies of the world's websites with its Wayback Machine. But did you know it is a bigger source of public domain e-books than Google? It has over one million torrents, too.

dsc00072.jpg
10 of 12 Violet Blue/ZDNet

The scanning room at the end of the party. The Archive's scanning services offer open and free online access, permanent storage and lifetime file management. It was heartwarming to see how much the volunteers love their work, and geeked out with big smiles whenever someone asked a question.

dsc00096.jpg
11 of 12 Violet Blue/ZDNet

The downstairs office at the Internet Archive. The Archive was established in 1996, is a non-profit, believes in free and open access to knowledge for all, and is dedicated to preserving the internet. I liked seeing an Iron Man mask at one Archive employee's desk.

dsc00099.jpg
12 of 12 Violet Blue/ZDNet

The Archive is making 80 terabytes of archived web crawl research available for research. Keep up with more of the Archive's history-making archival activities by watching the Archive's blog or following the Internet Archive on Twitter.

Related Galleries

A peek inside NextDC’s S2 data centre
nextdc-pic-01.jpg

Related Galleries

A peek inside NextDC’s S2 data centre

11 Photos
What a brand new data center looks like - from the inside
02.jpg

Related Galleries

What a brand new data center looks like - from the inside

11 Photos
Pictures: Inside Lenovo's new Beijing campus
lenovo-campus-intro.jpg

Related Galleries

Pictures: Inside Lenovo's new Beijing campus

22 Photos
High-performance storage: From flash drives to server hard drives (April 2018 edition)
OWC ThunderBlade SSD

Related Galleries

High-performance storage: From flash drives to server hard drives (April 2018 edition)

12 Photos
Photos: Inside Apple, Facebook, Google, IBM's frozen Nordic datacenters
ndcboom7.jpg

Related Galleries

Photos: Inside Apple, Facebook, Google, IBM's frozen Nordic datacenters

13 Photos
Photos: Inside vast abandoned mine set to be world's biggest data center
lmd07.jpg

Related Galleries

Photos: Inside vast abandoned mine set to be world's biggest data center

10 Photos
The 10 scariest cloud outages (and lessons learned from them)
nasdaq-businessman-losses-corbis.jpg

Related Galleries

The 10 scariest cloud outages (and lessons learned from them)

20 Photos