Workflow for digitisation projects

From Digipedia

Jump to: navigation, search
How useful is this article to you?
1 (not useful) - 5 (very useful)
Current average rating: 0
Please login or create an account to rate this article

Contents

OVERVIEW


This briefing document describes how to track workflow within a digitisation project. It is the UKOLN QA Focus paper Documenting Digitisation Workflow and is licenced under a Creative Commons Attribution-Noncommercial-Share alike licence

Digitisation is a production process. Large numbers of analogue items, such as documents, images, audio and video recordings, are captured and transformed into the digital masters that a project will subsequently work with. Understanding the many variables and tasks in this process – for example the method of capturing digital images in a collection (scanning or digital photography) and the conversion processes performed (resizing, decreasing bit depth, convert file formats, etc.) – is vital if the results are to remain consistent and reliable.
By documenting the workflow of digitisation, a life history can be built-up for each digitised item. This information is an important way of recording decisions, tracking problems and helping to maintain consistency and give users confidence in the quality of your work.

Quality Assessment in the digitisation workflow: flowchart


What to Record


Workflow documentation should enable us to tell what the current status of an item is, and how it has reached that point. To do this the documentation needs to include important details about each stage in the digitisation process, and its outcome.

  • What action was performed at a specific stage? Identify the action performed. For example, resizing an image.
  • Why was the action performed? Establish the reason that a change was made. For example, a photograph was resized to meet pre-agreed image standards.
  • When was the action performed? Indicate the specific date the action was performed. This will enable project development to be tracked through the system.
  • How was the action performed? Ascertain the method used to perform the action. A description may include the application in use, the machine ID, or the operating system.
  • Who performed the action? Identify the individual responsible for the action. This enables actions to be tracked and identify similar problems in related data.

By recording the answers to these five questions at each stage of the digitisation process, the progress of each item can be tracked, providing a detailed breakdown of its history. This is particularly useful for tracking errors and locating similar problems in other items. The actual digitisation of an item is clearly the key point in the workflow, and therefore formal capture metadata (metadata about the actual digitisation of the item) is particularly important.


Where to Record the Information


Where possible, select an existing schema with a binding to XML:

  • TEI (Text Encoding Initiative) and EAD (Encoded Archival Description) for textual documents
  • NISO Z39.87 for digital still images.
  • SMIL (Synchronized Multimedia Integration Language), MPEG-7 or the Library of Congress’ METS A/V extension for Audio/Video.


Quality Assurance


To check your XML document for errors, QA techniques should be applied:

  • Validate XML against your schema or an XML parser
  • Check that free text entries follow local rules and style guidelines


Related Digipedia articles


Digitisation

Metadata

XML


Further Information


Encoded Archival Description

Dublin Core Metadata Initiative

MARC Standards

MPEG- 7 Standard

Synchronized Multimedia

TEI Consortium

Three SGML Metadata Formats: TEI, EAD, and CIMI


[[Role::newcomer]] [[Role::project manager]] [[Goal::managing]] [[Goal::project]] 
[[Goal::planning]] [[Level::basic]] [[Level::medium]]
Home page icon A to Z icon