Red Island Repository Institute 2011

The Red Island Repository Institute returns!

Cost: $2800.00 (additional non-registrant $1500.00)

This year's Institute will be held between September 17th-23rd. We are planning to have 2 streams this year: Digital Humanities and Science Data. Come and learn about cutting-edge repository technologies with like minded folk on beautiful Prince Edward Island. 

For this year's event, we have exclusively booked Dalvay by the Sea, a resort converted from the Alexander Macdonald mansion (built 1895). We'll be 100m from the ocean, and just 20 minutes from Charlottetown. There are both cottages and rooms available. Registration covers the cost of your room, from Saturday night to the following Friday morning.

Click here for directions.

If you have any questions about this year's conference, or would like to reserve your registration please contact islandora@upei.ca. You can also reach us at (902) 566-0343.

Agenda

Instructors: Bradley McLean, Chris Wilper, Steve Bayliss, Mark Leggott, Bess Sadler, Tom Cramer, Matt Zumwalt, Dean Irvine, Mariana Paredes-Olea, Donald Moses, Kirsta Stapelfeldt (and other members of the IslandoraTeam)

Bradley McLean

Experienced technology team builder, leader, architect, and manager with a strong inclination towards multi-vendor and open source solutions. Successful leadership of large scale consumer website development, knowledge indexing and retrieval development, construction of a data center, and ongoing IT services for 75 users. Interested in entrepreneurial opportunities that include a new technology component.
 

Chris Wilper

Chris is the technical lead for the Fedora repository project and coordinates community participation in the open source development process.

Mark Leggott
I am currently University Librarian at the University of Prince Edward Island, which is Canada's best small university. And it's on an island - how can you beat that? I am also the President and CEO of DiscoveryGarden which is a private company providing a full slate of commercial services for the Islandora/Drupal/Fedora framework and the Founding Director of the Knowledge for All Project, whose goal is to create a public domain database of all scholarly literature.

Bess Sadler
 

Tom Cramer
Tom Cramer is the Chief Technology Strategist and Associate Director of Digital Library Systems and Services for the Stanford University Libraries. In this role, he oversees the technical development and delivery of the full complement of Stanford’s digital library activities, including the digitization, description, discovery, delivery, preservation and management of digital resources that support teaching, learning and research.

Tom is one of the founders of the Hydra Project and an active contributor to Blacklight, open source projects rooted in higher education that deal with digital asset management and discovery. He currently serves as the director of PASIG, the Preservation and Archiving Special Interest and as a steering group member for Open Repositories.  

Matt Zumwalt

Matt Zumwalt is the Founder and Director of MediaShelf, LLC.MediaShelf operates in the field of Digital Asset Managment, but we go much further than that. Matt is an internationally recognized expert on Fedora, a powerful, open source digital asset management system. He coordinates the "Fedora Create" developer community, and he has given public talks on the system in both the United States and Europe.

Dean Irvine
Dean Irvine is an Associate Professor in the Department of English at Dalhousie University. He is the editor of Archive for Our Times: Previously Uncollected and Unpublished Poems of Dorothy Livesay (1998), Heresies: The Complete Poems of Anne Wilkinson, 1924-1961 (2003), and The Canadian Modernists Meet (2005). His edition of F.R. Scott'’s complete poems and translations is forthcoming in the Canadian Modern Poets: Texts and Contexts series (Canadian Poetry Press). He is the author of Editing Modernity: Women and Little-Magazine Cultures in Canada (2008), and is currently completing a book manuscript entitled Variant Readings: Editing Canadian Literature. As general editor of the Canadian Literature Collection/Collection de littérature canadienne (University of Ottawa Press), a new series of critical editions with online apparatus, and director of the Editing Modernism in Canada project, he is engaged in a wide range of collaborative editorial projects in print and digital media.

Donald Moses
Donald Moses is the Collections and eResources Librarian at the Robertson Library, University of Prince Edward Island. His work at the library straddles the traditional and the digital and he manages many of the digitization projects at the Library. Much of that digitization work is transformed as it flows into Islandora – a digital asset management system that leverages Fedora Commons and Drupal.  He works with the Islandora team to test and extend the code they are developing to meet an evolving set of needs, including those of Digital Humanists.
 

Kirsta Stapelfeldt
Kirsta Stapelfeldt is the Repository Manager at the University of Prince Edward Island and manager of UPEI's Islandora project. Through the Robertson Library's Virtual Research Environment Service (VRE service), librarians and developers at UPEI are supporting over 140 repository projects. UPEI's repositories and research environments are developed using the highly-flexible Islandora open-source software stack. Islandora empowers researchers to collaborate with off-site colleagues, store and manipulate data sets, and publish research findings, and is used to house UPEI's local digitization projects. Islandora and the VRE service were founded by University Librarian Mark Leggott, and have inspired innovation in funding and management, transforming the role of librarians and the library at UPEI. Kirsta is a graduate of Dalhousie's MLIS program, with a background in project management, online information literacy, knowledge translation, communication, and university teaching.

Agenda

This year RiRi will provide instruction based on 2 core collections as examples: science data (herbarium) and a digital humanities collection (audio and/or manuscripts), with the creation of an Institutional Repository collection as an optional outcome in the hands-on sessions. We will also tie DuraCloud through the full agenda as a core component of the Fedora ecosystem.

  • Saturday Sept 17

    • Half Day (Afternoon), Intros and Fun Time (Lawn Bowling & Beach Strolling)

    • Opening Dinner (Lobster and Other Seaside Goodies) @ 7:30 (Bar opens at 5:30)

  • Sunday Sept 18

    • 8:30 - 9:00    Breakfast & Welcome (Mark)

    • 9:00 - 10:30    DuraSpace/Fedora Project Update (Bradley)

    • 10:30 - 11:00    Break

    • 11:00 - 12:30    Fedora Architecture & Overview (Chris)

    • 12:00 - 1:30    Lunch

    • 1:30 - 2:30    Fedora Application/Solution Bundles Landscape (Mark/Tom/Bradley-Others)

    • 2:30 - 3:00    Break

    • 3:00 - 4:30    Project Examples & Approaches (various)

    • 6:00 - 11:00    Dinner and Musical Entertainment

  • Monday Sept 19

    • 8:30 - 9:00    Breakfast & Welcome (Mark)

    • 9:00 - 10:30    Content Modelling and Disseminators (Chris/Steve)

    • 10:30-11:00     Break

    • 11:00 - 12:30    The Search Layer: GSearch, Mulgara, Solr, Lucene (Bess/Alan)

    • 12:00 - 2:00    Lunch

    • 2:00 - 2:45    Data Intensive eResearch (Mark)

    • 2:45 - 3:30    Digital Humanities Collections (Matt/Dean/Susan)

    • 3:30 - 4:00    Break

    • 4:00 - 5:00    DuraCloud: Integrating Cloud and Repository Services (Chris/Bradley)

    • 6:00              Dinner Starts

    • Evening        Free Time

  • Tuesday Sept 20

    • 8:30 - 12:00      Application Deep Dive #1: Hydra and Islandora (Hydra and Islandora Teams)

    • 12:00        Lunch

    • 1:30 +            Deep Dive Play Time

    • 6:00                 Dinner Starts

  • Wednesday Sept 21

    • 8:30 - 9:00    Breakfast & Welcome (Mark)

    • 9:00 - 9:45    Linked Data, Mulgara, RDF (Chris/Steve)

    • 9:45 - 10:30    Break-Outs

      • DH Breakout: Content Models (Dean/Matt/Donald)

      • SD Breakout: Content Models (Mark/Steve/Chris)

    • 10:30 - 11:00     Break

    • 11:00 - 12:00    

      • DH Breakout #1: Linked Data (Susan/Dean/Donald)

      • SD Breakout #2: Linked Data (Steve/Mark)

    • 12:00 - 1:00       Lunch

    • 1:00 - 2:30    Metadata, Forms and Crosswalks (Donald/Kirsta)

    • 2:30 - 3:00    Break

    • 3:00 - 4:30    Workflows - Data Transformations and Processes (Matt)

    • Evening        Free Time in Charlottetown
              Transportation will be Available to and from Downtown 

  • Thursday Sept 22

    • 8:30 - 9:00    Breakfast & Welcome (Mark)

    • 9:00 - 10:00    DuraCloud Hands On (Bradley/Chris)

    • 10:00 - 10:30    Break

    • 10:30 - 12:00    Trusted Repositories and Security: XACML & FeSL (Steve)

    • 12:00 - 1:00   Lunch

    • 1:00 - 2:30    Lightning Talks (Everyone)

      • List of Talks

    • 2:30 - 3:00    Break

    • 3:00 - 4:00    General Q & A (Everyone)

    • 4:00 - 4:30    Closing Remarks and Discussion (Mark)

    • 6:30 +        Farewell Dinner

2011 Events

 

Upcoming Events:

April

May

  • U of Toronto Conference – Durable Data, Disintegrating Documents & Digital Dumpsters: The Academic Library in 2020
  • Oracle PASIGRedwood
  • UNT Open Access Symposium – Islandora and Research Support at the University of Prince Edward Island
  • CLA (Canadian Library Association) – Research Data BootCamp Pre-Conference Event, trade show booth

June

Islandora Camp 2010 Agenda (July 22-23)

As an "unconference," you won't find a traditional agenda. But here are some of our planned speakers:

Thursday

Registration/Breakfast begins at 8.00 am

Followed at 9.00 am by 

Boris Mann, Entrepreneur & Digital Strategist

BImage removed.oris Mann has been blogging and using web tools for over 10 years. What began as an exploration of digital tools became a career building startups and web solutions.

Boris co-founded Bryght, the first commercial company focused on the Drupal open source content management system in 2004, and helped grow the Drupal community to a worldwide ecosystem of businesses and developers.

Boris has a passion for engaging communities, from distributed open source teams to the local innovators in Vancouver. He is one of the founding organizers of Northern Voice, the first Canadian blogging conference. He also is active in many developer communities, helping put on BarCamps, DemoCamps, and the Open Source CMS Summit on multiple continents. Boris is a Permanent Member of the Drupal Association. He holds a BSc in Computer Science from the University of Victoria, where he went after a coin flip made him choose between university and becoming a professional chef.
 

Umar Qasim (Dalhousie University)

Using Recommendation System to Reduce Information Overload in Institutional Repositories

Mark Roy (University of Saskatchewan)

Introducing the Content Model Editor

Dan Gourley

Developing the Drupal/Fedora Module

Jen Whitney (University of New Brunswick)

UNB and Drupal/Fedora

Jerry Pan

Nasa Earth Sciences Data

Moiz Bhukhiya

The Apiary Project 

Friday

Islandora Roadmap: Brainstorming and reports

Martin Boucher (Université de Montréal)

Synergies National Platform with Fedora and Drupal

Metadata Schemas

Metadata Schemas in Use/Of Interest to the Islandora Project

This page provides links to the metadata schemas we are using in our VRE projects, or provide examples models for our work.

  • Administrative/General
  • Library/Bibliographic
  • Images
  • Sound
  • Video
  • BioScience/Chemistry
    • CML (chemistry)
    • Critter (in-house schema)
    • EMBL (gene sequences)
    • EML (ecology)
    • MzXML (mass spec data)
    • NCBI (gene sequences)
    • SNOMED-CT (medical terminology)
  • Business
    • UBL (business processes)
    • XBRL (business)
  • Humanities

General Fedora-Drupal in First Islandora

General Fedora-Drupal

This is a general description of the UPEI Fedora/Drupal system and how we are using bth products, on their own and integrated. This is based on a description created by Paul Pound on March 28, 2008. *

General System and Components

  • Fedora 2.2.1 with fgsearch 1.1, Kowari (xx) and Postgres (XX).
  • Drupal 5.x, php5, and mysql (XX).
  • Lucene (XX), curl (XX)
  • api-a (api-a includes methods to mostly consume the objects) is locked down so authorization is required.
  • api-m (mangement api used for ingest etc.) needs authorization as well and can only be used from specific ip addresses. Both of these settings are pretty common but some users probably have api-a more open and rely on the xacml security policies.

We have drupal on one box, Fedora on another and the databases are on a third.

Search and Display

Currently we send rest type calls to Lucene via Curl. The results are provided in xml that we display using an xslt to transform the results to html.

  • Collection listing: we send an itql query to Fedora's resource index (in our case Kowari as mptStore does not understand itql) via curl/rest. We receive xml results and for the modules in test we are parsing the xml in php but will be switching to xsl. This will give us much more flexibility as we can store the xsl with the collection object.
  • Datastream listing: we send a soap call to Fedora that returns an array which is displayed using php code.
  • Metadata display: we are grabbing the QDC or DC datastream from the object via soap. Since the returned datastream is xml we are using an xsl to transform the stream to html.

Editing and Ingest

  • To edit the metadata we grab the QDC/DC xml stream and parse it to create a html form. In our standalone app we use an xsl, but our newer Drupal-based technique uses Drupal's form api: after parsing it in php we pass a php array to Drupal which builds the form.
  • For ingest each collection will have a collection_policy datastream (if it doesn't have one we can fallback to a standard that is part of the module). The collection_policy defines what content_models are allowed as part of this collection. A content_model in our case defines what type of file can be ingested and what to do to that file (such as create thumbnail etc).
  • All the management functions use soap to add, purge, modify, ingest etc.

Drupal

  • We are mapping drupal paths/menus to php functions.
  • We use the form api for the forms.
  • We use IMCE to upload the files. By using IMCE we can give file size quotas, and storage limits to drupal roles. We can also limit what type of files they are allowed to upload using IMCE. IMCE is the same widget tinyMCE uses to upload files.
  • We rely on clean urls and the drupal file system being public. Initially we tried to support normal urls and clean urls but it gets to hard to maintain when there are so many paths to something.
  • We have several permissions you can configure which include add fedora datastreams, edit fedora metadata, ingest new fedora objects, purge objects and datastreams, and view fedora collection.
  • Specific roles can have a range of permissions so you can define who can do what.
Tags

Importing Refworks into Islandora

The Islandora IR has been setup to use exported refworks citations as the file type for ingest.  Using refworks gives us a few advantages. 

  • We can export citations from many databases providers.  This gives us complete and accurate data to start with (most of the time).
  • Global editing of refworks records.  This is usefull for adding what we call the university name (username) and departments to the refworks record. 

Setting up a RefWorks account for this purpose:

  1. Create an account.
  2. Use the “Customization” feature to re-label User1 to “AuthorID” and User2 to “Department”.  The “AuthorID” will be the same as the username that the faculty members will use to login to the repository to manage their own documents. The “Department” will be the same as the “roles” used in Drupal/Fedora to provide permission control and  provide the ability for users to browse publications by academic department. Often the departments can be retrieved from LDAP and stored as roles in Drupal.  The specific department code should match the “LDAP” provided codes or the roles created in Drupal.


Method 1 – using library databases

  1. Search one or more of your library’s citation databases (e.g. PubMed, EBSCOHost, Proquest, etc.) to find the citations.
  2. . Use the database’s “export to RefWorks” built-in functionality if available. Some databases offer instead the ability to save the citations into a tagged format file which you will need to import into RefWorks using RefWorks’ “filters” (using References – Import).
  3. Once the citations are in RefWorks, you may find some citation data has been imported into the wrong field and needs to be fixed by hand. An example is conference proceedings, which often require moving information to different fields.
  4. Step 4. Edit the records (individually or using global edit depending on how you handled the importation) to add the AuthorID and Department info to each.  You can add multiple authors or departments using semi colons to seperate the values.


Method 2 – manually entering citations into RefWorks


In RefWorks, click References – Add New Reference, select the “Ref Type” to make sure you get the right template for the type of document, then fill in the blanks as appropriate. Make sure you include the AuthorID and Department data.  You can add multiple authors or departments using semi colons to seperate the values.


Final steps – exporting records from RefWorks to the Repository


Regardless of which method you use to get the records into RefWorks, the final step in Refworks is to pull together the new records to be imported into the repository into a single folder.  Use the RefWorks option “References – Export” to select that folder, choose “RefWorks XML” as the format, and export the set. Use your web browser’s “Save Page As” function to save the exported records to a text file for later importation into the repository.

Importing the Refworks file into Islandora

  1. Login to Islandora
  2. Click Digital Repository - located in the left hand column
  3. Click on the Institutional Repository Collection - on the content section of the page
  4. Click on Collection Description to expand the collection's fieldset
  5. Click on the blue ingest folder
  6. Leave Refworks selected and click Next
  7. Choose Browse and find the exported Refworks file you just saved.
  8. Click Next

Islandora will parse the Refworks file and create a citation object for each reference element in the file.  Users with the appropriate roles can then add the full text to the citations.

Islandora Receives Major Grant Funding

Islandora was the recipient of major research funds this month, coming from 2 funded projects under ACOA's AIF (Atlantic Innovation Fund) initiative. DiscoverySpace, led by Mark Leggott, is a 3-year, $2.5 million project to build on Islandora's unique approach to research collaboration and data stewardship with the Virtual Research Environment (VRE) framework. The project will work with the larger open source community to build a rich ecosystem of tools for all research domains. In addition to enhancing the core Islandora software, the project will release dozens of specialized Sprouts, providing rich collaborative environments for a wide range of disciplines. The project will also work toward the development of a service entity that will provide support, hosting and custom development for external partners.

Image removed.

Three innovative projects by UPEI researchers were awarded AIF funding on January 26. Shown here are recipients Mark Leggott, University Librarian; Dr. Don Reynolds, Dean of AVC, accepting on behalf of Dr. Chris Riley; and Dr. Anne Muckle, Clinical Bacteriologist. The Veterinary Laboratory Quality Assurance Program Expansion (VLA-QAP) is a project of the Atlantic Veterinary College (AVC) led by C. Ann Muckle, AVC’s Lab Director of Diagnostic Services. AVC will develop software and systems for web-based delivery of impartial, objective, verification program for veterinary labs to verify and assure diagnosis. The project will use a special version of the Islandora software to deliver their services and resources in a unique online environment. As with the broader Islandora software, applications developed for this project (such as the virtual microscopy system) will be released as open source.