« HuffPo's Shaw Renounces Pacifism | Main | New Problems For Greg Mitchell Of E&P »

Saturday, August 26, 2006

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Dan, the original text was captured by the Internet Archive (h/t/ Ed Driscoll), and I have now now linked to that.

It just gets worse. An LGF commenter noticed that Niagra Falls was shut off in 1969, not 1967. Which makes him 21, I guess.

CY,

Someone is in serious CYA mode because the Wayback archive results now suggest that the edits took place between November 2003 and January 2004. I read the comment by Lewis on your site and looked at the Wayback FAQ to see what I could find about the date stamp in every URL at Wayback.

Wayback states:

How did I end up on the live version of a site? or I clicked on X date, but now I am on Y date, how is that possible?

Not every date for every site archived is 100% complete. When you are surfing an incomplete archived site the Wayback Machine will grab the closest available date to the one you are in for the links that are missing. In the event that we do not have the link archived at all, the Wayback Machine will look for the link on the live web and grab it if available. Pay attention to the date code embedded in the archived url. This is the list of numbers in the middle; it translates as yyyymmddhhmmss. For example in this url http://web.archive.org/web/20000229123340/http://www.yahoo.com/ the date the site was crawled was Feb 29, 2000 at 12:33 and 40 seconds.

As noted by Lewis over at CY, the closest date to the 2004 versions resolves to a URL that indicates mediainfo was archived today.

More importantly, if you go to the Wayback history page:http://web.archive.org/web/*/http://www.mediainfo.com/editorandpublisher/headlines/article_display.jsp?vnu_content_id=1891030

you'll note two '2004' edits, one saying 'Jan 17, 2004', and the other saying 'Nov 28, 2004'. When you click on each of those, you'll note that the pages actually returned show those MMDDYYHHMMSS stamps mere minutes apart:

Jan 17, 2004 -> 2006-08-26 23:07:07
Nov 28, 2004 -> 2006-08-26 22:56:52

The allegedly older of these two pages was crawled by the Wayback Machine 12 minutes and change AFTER the one that was supposed to be 10 months newer!

Not only did they try to cover their tracks, but they did so in a very clumsy manner. I can see Wayback just not crawling this page for two years, even though they normally will check back every few months. But when the 'newer' one was crawled before the 'older' one, it is proof of backdating.

I don't think so, Monster. I think Wayback is pulling the content from the actual site. Look at their FAQ regarding what they do if they have crawled a site and noted a change on a particular date, but do not archive the page that has changed.

I think what you're seeing in the URL is the timestamp of when you surfed the page.

At least, that's how I understand the mechanism to work according to the FAQ. Wayback either pulls the most recent archive they have or a live version.

The comments to this entry are closed.

Donations Appreciated

Infolinks

Blog Ads


Syndigo

AdSense

Search

Wikio Top Fifty

  • Wikio - Top Blogs - Politics

Memeorandum

Find the best blogs at Blogs.com.

Blog Roll

November 2009

Sun Mon Tue Wed Thu Fri Sat
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30          

Technorati


Blog powered by TypePad

2006 Weblog Awards