{"id":656,"date":"2004-11-29T17:50:07","date_gmt":"2004-11-29T21:50:07","guid":{"rendered":"http:\/\/www.theoreti.ca\/?p=656"},"modified":"2004-11-29T17:50:07","modified_gmt":"2004-11-29T21:50:07","slug":"how-the-wayback-machine-works","status":"publish","type":"post","link":"https:\/\/theoreti.ca\/?p=656","title":{"rendered":"How the Wayback Machine Works"},"content":{"rendered":"<p>The Internet Archive is an amazing database of old web sites. James Chartrand pointed me to an interview with the director of the Internet Archive, Brewster Kahle from January 21, 2002, titled <a title=\"webservices.xml.com: How the Wayback Machine Works\" href=\"http:\/\/webservices.xml.com\/pub\/a\/ws\/2002\/01\/18\/brewster.html?page=1\">How the Wayback Machine Works<\/a>. The interview is by Richard Koman and still interesting, especially for those of us interested in text spiders and archives. I was intrigued by Kahle&#8217;s claim that the IA is the largest database in the world, &#8220;It&#8217;s larger than Walmart&#8217;s, American Express&#8217;, the IRS. It&#8217;s the largest database ever built.&#8221;<\/p>\n<p>See my previous post on <a title=\"Internet Archive\" href=\"http:\/\/www.theoreti.ca\/wp-content\/uploads\/notes\/000464.html\">Ghost Sites and the Internet Archive<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Internet Archive is an amazing database of old web sites. James Chartrand pointed me to an interview with the director of the Internet Archive, Brewster Kahle from January 21, 2002, titled How the Wayback Machine Works. The interview is by Richard Koman and still interesting, especially for those of us interested in text spiders &hellip; <a href=\"https:\/\/theoreti.ca\/?p=656\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">How the Wayback Machine Works<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5],"tags":[],"class_list":["post-656","post","type-post","status-publish","format-standard","hentry","category-history-of-computing-and-multimedia"],"_links":{"self":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/posts\/656","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=656"}],"version-history":[{"count":0,"href":"https:\/\/theoreti.ca\/index.php?rest_route=\/wp\/v2\/posts\/656\/revisions"}],"wp:attachment":[{"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=656"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=656"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/theoreti.ca\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=656"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}