7.x to CLAW Migration Sprint - Complete!

The Islandora community has just wrapped up a very successful sprint dedicated to migrating from 7.x to Islandora CLAW. We at the Islandora Foundation want to give a big thanks to everyone who put in time during this sprint, as well as the organizations who lent us their talent on the company dime. We also want to give a special shout out to the Metadata Interest Group, who collectively put in a ton of time and tackled some intense questions for those who want to use a migration to Islandora CLAW as a chance to do metadata cleanup. During the course of two weeks, we managed to accomplish a lot. As of right now you can:
  1. Migrate over objects based on content type
  2. Migrate ALL the datastreams (except AUDIT, which is a special case)
  3. Extract metadata from any XML datastream and make it a Drupal field
  4. Model authorities such as people, organizations, and subjects
  5. Convert MODS to CSV using Cara Key's (LSU) XML2CSV tool
There's still some work left to do, though. On the horizon for the near term, be on the look out for:
  1. Migrating the AUDIT datastream
  2. Modeling more/different types of authorities
  3. Examples of extracting authorities from FOXML
  4. A workflow for those who want to use OpenRefine to reconcile linked data authorities during the migration process
Moving forward, this is an excellent chance for people to try out the tools we're developing and point them at their existing repositories. Our migration tool, originally developed by Jared Whiklo (University of Manitoba), is available on Github. And if you want to give modeling authorities a go, check out our new controlled_access_terms module, which was made by Seth Shaw (University of Nevada Las Vegas). If anyone has feedback/issues/questions, please feel free to create an issue or post a message on the mailing list. Here's a full list of all the people and organizations who helped make this once-considered-impossible feat a reality:
  • Benjamin Rosner - Barnard Collge, CU
  • Pat Dunlavey - Born-Digital
  • Andrija Sagic - Library "Milutin Bojic"
  • Ann McShane - Library Company of Philadelphia
  • Cara Key - Louisiana State University
  • Jason Peak - Louisiana State University
  • Jonathan Green - LYRASIS
  • Rachel Leach - Mount Holyoke College
  • Mark Jordan - Simon Fraser University
  • Adam Soroka - Smithsonian Institution
  • Rachel Tillay - Tulane University
  • Pete Clarke - University College Dublin
  • Jared Whiklo - University of Manitoba
  • Mike Bolam - University of Pittsburgh
  • Seth Shaw - University of Nevada Las Vegas
  • Paul Pound - University of Prince Edward Island
  • Rosie Le Faive - University of Prince Edward Island
  • Nat Kanthan - University of Toronto Scarborough
  • Marcus Barnes - University of Toronto Scarborough
  • Carolyn Moritz - Vassar College
Thanks to everyone involved! And if you missed out on this sprint, don't fret. We'll be holding another Islandora CLAW community sprint later this year after Islandora 7.x-1.12 is released.