Preserving Your Work

saving what you can

If You Have Time

Fifteen years ago, a file-sharing system for scientific data based on the BitTorrent protocol failed to find wide adoption because of the (quite reasonable) association in many institutions' collective minds between BitTorrent and illegal downloading [Langille2010]. At the time of writing, though, many individuals and groups are turning to BitTorrent to share datasets that are at risk of disappearing because of the protocol's resilience.

If You Don't

Normalized Deviations

Exercises

Write a Last Known Good State Document

Each participant drafts a one-page "last known good state" document for their own project (or for Vaida's erosion data project if they prefer not to use their own):

  1. What does this project do, and what problem does it solve?
  2. What is currently working?
  3. What is broken or incomplete?
  4. Where does the data or code live, and how would someone get it?
  5. What are the most important things a successor would need to know?

Compare drafts in groups of three: what did others ask about that you had not thought to include?

Normalized Deviations List

Work individually for ten minutes, then debrief in groups:

  1. Write down three things in your project that everyone on the team knows are wrong but that you have all learned to live with (e.g., a dependency nobody wants to touch or a process that only works because one person knows the trick).

  2. For each item, explain whether you would document it honestly in the last known good state document, fix it before shutdown, or leave it undocumented?

Which items were hardest to decide about? What did you want to hide, and why? What would a successor most need to know?