When, How, and Where to Share Data

Go to the full lesson

Exercise

Link Rot

  1. The list of broken links that Gertler and Bullock note (in their 2017 article in PS) that they found in the American Political Science Review (see above-cited article) is part of the data accompanying their PS article; they shared the data through Harvard Dataverse here in the file “brokenReproducibilityURLs.tab”. Download the file, open it in Excel, and find the first five links marked as “Did not find resource”. What sorts of websites are these? Can you still find the linked content using the Internet Archive’s Way Back Machine?
  • show solution
    1. You’ll have noted that the broken links lead to all sorts of different websites: personal, institutional, even government websites. In some cases (such as Erik Voeten’s Princeton page and the Polish Center for Public Opinion Research CBOS), the Way Back Machine is not able to find any archived copy. You may still be able to track down this information by looking at the current sites for such individuals and institutions – or you may not. In other cases, you can access the webpage, such as Richard Tucker’s page at Vanderbilt, but the actual data aren’t archived there. For the Correlates of War project, you are able to download the full data from the Internet Archive. You could also go to the project’s current site, but would you be able to find the version of the data mentioned in the 2000 APSR article there? The internet offers you powerful tools to retrieve old information, but even with those tools, the content behind some links – containing key parts of the scholarly record – may be lost forever. You should have strategies for tracking down such lost data, but more importantly, you should make sure that the same fate does not befall your data and links. The best way to do this is to deposit your data in a repository.