Link Rot

Go to the full lesson

Exercise

Link Rot

  1. The list of the broken links that Gertler and Bullock found in the American Political Science Review (see above-cited article) is part of the data accompanying the article that they shared through Harvard Dataverse here in the file “brokenReproducibilityURLs.tab”. Download the file, open it in Excel, and find the first five links marked as “Did not find resource”. What sorts of websites are these? Can you still find the linked content using the Internet Archive’s Way Back Machine?
  • show solution
    1. You’ll have noted that the broken links lead to all sorts of different websites: personal, institutional, even government websites. In some cases (such as Erik Voeten’s Princeton page and the Polish Center for Public Opinion Research CBOS), the Way Back Machine is not able to find any archived copy. You may still be able to track down this information by looking at the current sites – or you may not. For other cases, you can access the webpage, such as Richard Tucker’s page at Vanderbilt, but the actual data aren’t archived there. For the Correlates of War project, you are able to download the full data from the Internet Archive. You could also go to the project’s current site, but would you be able to find the version of the data corresponding to the 2000 article there? The internet offers you powerful tools to retrieve old information, but even with those tools, the content behind some links – containing key parts of the scholarly record – may be lost forever. You should have strategies for tracking down such lost data, but more importantly, you should make sure that the same fate does not befall your data and links. The best way to do this is to deposit your data in a repository.