Data Management Policy - An Interview with Paul Taylor

Dr. Paul Taylor works at the University of Melbourne and has just finished a 2 week secondment in the UK with the JISC-funded EIDCSR (Embedding Institutional Data Curation Services in Research) project based in Oxford. This is an approximate transcript of a quick 5 minute interview between Paul and Neil Grindley (JISC Information Environment Programme Manager)

NG
Hi Paul, thanks for sparing the time out of a very busy schedule … what role do you have in the EIDCSR project?

PT
Thanks Neil … I’m here to help them come up with a draft policy for the management of research data and records. It’s something we’ve had in place at the University of Melbourne since 2005 and we’ve just completed a revision of the policy to hopefully help make it a little more useful for researchers.

NG
Tell us a little bit more about how that policy has been developed at the University of Melbourne and the reactions to it from researchers and data managers.

PT
As I said, we’ve had policy in place since 2005 and early this year we were asked to work out how compliant we were with it, on the basis that if you have a policy and no-one pays any attention to it, its probably not much use keeping it there! Not surprisingly, we found out that most people weren’t compliant and also didn’t really know that the policy was there. We’re hoping that was the reason that they weren’t compliant rather than any sort of animosity against policies in general - but that’s still to be determined.

We reviewed the policy for two reasons: firstly to try and make it of more use to researchers (… there’s limits to that because when you are writing a policy to go across the institution, it has to contain really high level principles about the management of research data. If you get too specific you rule large populations out and then people pay even less attention to it than they did before). Secondly, its to get some attention and a bit of refocus on the data management area. There are a lot of things happening at the university at the moment in terms of the services that the university intends to provide for it’s researchers and some other changes in the Australian environment. We’re hoping to lock the high-level principles away in policy documentation and focus on keeping the guidance, information and support materials up to date and relevant for researchers.

NG
The sustainability of keeping that guidance and information for researchers up to date is a real issue. Capturing their feedback and working it back into future iterations of those materials (and ultimately the policy documentation) is a desirable outcome but also a big challenge isn’t it?

PT
Yes, it is.

NG
How do you think that the policy that you’ve developed in Melbourne transposes to the University of Oxford?

PT
That’s a good question … one of the things that we’ve learnt from the 2005 version of the policy is that its not enough to have the central policy on its own. There needs to be some kind of localisation of the policies and so with this new version of our policy we’ll be asking faculties to come up with their own enhancements so that it makes more sense to their researchers, and then probably get departments to do the same thing. I’d imagine the same sort of system could work at Oxford but it would be a little more complex with the number of people that would need to be involved in coming up with these localised versions of the policy. The hope is that there will be a trickle down effect from the high-level policies which have a practical influence on the way that researchers go about managing data.

In the meetings that I’ve had since I’ve been here, there have been some excellent examples of data managers and data management researchers (I guess you’d call them) who are working closely (one-on-one) with researchers who have come up with some excellent and novel solutions. I think the more that that can happen - a sort of resourcing at the coal face - then the more likelihood there is of high level principles trickling down to meet some of the very local one-on-one researcher-based developments. At that stage, perhaps there would be a general improvement in the management of research data across the institution.

One of the things I’ve heard a lot from people is the need for it to be a federated system. A lot of the departmental research groups have come up with their own systems for managing their own research data. Anything new that is provided centrally from the university has to try and complement those processes rather than take them over. That wouldn’t work well here (in Oxford) and it wouldn’t work in Melbourne. It would tend to antagonise people rather than improve the situation.

NG
Yes … that principle of embedding existing processes and workflows into broader policy initiatives is an important concept for institutions grappling with these kinds of issues at the moment. Thanks very much Paul.

PT
Thanks

University of Melbourne - Policy on the Management of Research Data and Records (2005)
http://www.unimelb.edu.au/records/research.html

Review of Policy on the Management of Research Data and Records (2009)
http://research.unimelb.edu.au/integrity/conduct/data/review

EIDCSR Project (Embedding Institutional Data Curation Services in Research)
http://eidcsr.oucs.ox.ac.uk/

#res3

iPres 2009 - Preservation Infrastructure Track

In San Francisco at iPres sitting in the preservation infrastructure track.

Stephen Abrams (CDL) is telling us about micro-curation services. Lots of clear categorisation of types of services that institutions might require. Currently talking about storage requirements. Provide for safety through redundancy, meaning through context, utility through service. Rattling through too fast to capture detail.

Q. How do CDL services compare with iRods?

A. i-Rods are all part of one controlled environment. CDL Micro-services can run as small discreet functions

Pam Armstrong and Johanna Smith from Library and Archives Canada.

They have a trusted Digital repository project that is running from 2008-2010. They are showing a value management framework. The first concern is ’significance’. They are looking at government records and are trying to determine which records are important even before they arrive at the archive. Talking about a filtering process. Trying to deal with web 2.0 issues and are working on some guidelines.

They have established a records management task force with a high level of government support. A directive on recordkeeping is linked to a management accountability framework. If departments are found to be wanting with their records management function, they are denied the right to delete records. Good stick. There are functional requirements for EDRMS based on ISO. There is a proposed shared service for EDRMS for government info in Canada. They have built open source software eRTA for records managers. They have been working on metadata core set. They are using MODS and MARC and the info is discoverable by public. They have got to their summary already … my o my - these talks are quick!

The lessons learnt include the usefulness of the mandatory instrument that has consequences (see above).

Q. do you accept all formats?

A.  No, they have acceptable formats. Can’t do all formats.

Q. How implemented is all of this?

A. The implementation is uneven. All the instances across govt are implemented inconsistently. They have got lots of work to do to bring the legacy information into line.

Robert Sharpe - Tessella

Representing PLANETS consortium. Title is “Are you Ready? Assessment of readiness of organisations for Digital Preservation”. (I’m interested in this talk. Wondering how this matches up with JISC-funded AIDA project). They did a survey. To establish whether people were ready to use Digital Preservation solutions. The target group for PLANETS is national libraries and archives. There are 96 of these in Europe. They also invited any other interested parties to contribute. They got 206 responses. 70% responses from Europe.  They were a diverse community representing a range of roles.

15% digital preservation

16% in general preservation

22% curation

16% IT

also directors researchers data managers etc …

93% aware of DP challenges.

17% had not considered solutions.

52% did not have preservation policies.

They were 3 times more likely to have a DP budget if they had a DP policy in place. The majority had budgets to do capital activities. DP not really embedded in the institutions that responded still. What needs to be preserved? Stuff in file systems = 77% … many other categories going down to a long tail. National Libraries feel they have almost no control of the formats they have to accept. National Archives however claim high levels of control.

80% of organisations say they have less than 100TB to store in 2009. They think that by 2019, 70% orgs will have more than 100TB and 42% will have more than 1Pb. 85% have a solution or are working on one. They are generally expecting ‘plug and play’ components. That’s the trend and what people are expecting.

What functionality is important? Single most important function was that the repository must maintain authenticity, reliability and integrity of records. 17 different functions cited. Least important function is ‘checks for duplicate items’.Very little agreement on which standards should be used! (surprise surprise!) Of 13 standards on Robert’s chart, PREMIS in the middle in terms of who is using it already.

Summary …

Excellent start on getting DP message out
More work needed on policies and budgets
Wide range of types of digital info from range of sources
Significant quantities of data to preserve
Component-based solutions required
Best practice not yet clear
Early adopters are busy and planning to do more

Q. We are doing a good job with early adopters but what about the wider community. The success factor will be general users engaging with Digital Preservation

A. Yes

Q. The standards you showed, the figures are high for people not even having heard of them!

A. Yes.

End of session

Grant Funding Opportunities

An update on funding opportunities …

This month, November 2008, we will be releasing a Call for projects for grant funding. Outline details are on the Grant Funding Roadmap. UK FE/HE institutions are eligible to bid, with some types of projects restricted to HEFCE- and HEFCW- funded institutions, due to funding streams.

We’re finalising the Call at the moment, but you won’t go far wrong if you start thinking about what you want to do in:
- implementing automated metadata and textmining
- starting up repositories for research data, research papers, learning materials
- networking and enhancing repositories
- preservation in relation to repositories
- short technical projects to improve repository services
- connections between services to support particular disciplines

Bidders will have until January to prepare proposals, and succesful projects will be expected to start by 1st April 2009.

For those of you most interested in supporting research, please note there will also be a Call for projects related to Virtual Research Environments.
If learning and teaching resources are of particular interest, in December there will also be a Call for the forthcoming HEA/JISC Open Educational Content programme.

Date for your diary: Monday 15th December will be a Briefing Day for anyone who would like to come and hear about the funding opportunities related to the Information Environment and Virtual Research Environment Calls. It will be in Central London, probably 10-4. Details will be released soon.

If you’re not based in UK FE/HE, you may be interested in the Funding Roadmap for Invitations to Tender. These are open to anyone, so if you think you have expertise relevant to the sort of issues reported on this blog, then tenders are very welcome.

We will announce the Call on this blog as soon as it is released.

Records Management 2.0

It’s a sorry thing to admit, but I have to say that reading a whole book has become something of a distant memory for me these days! So … nice to be able to put that right by racing through Steve Bailey’s new book from cover to cover. It’s called ‘Managing the Crowd: Rethinking records management for the Web 2.0 world’.

http://www.facetshop.co.uk/mm5/merchant.mvc?Screen=PROD&Store_Code=1&Product_Code=641-1&Category_Code

Records management … a rip-roaring read? Well, I grant you it might not be everyone’s idea of great night in, but for anyone whose waking hours involves thinking about this stuff - I’d recommend you take a look at it.

 Steve’s main contention in the book is that if records managers are not prepared to think outside of their comfort zone, they may be in danger of finding themselves no longer in a profession. Processes and procedures will have moved on and those hanging on to the electronic corollary of earlier paper-based working practices will begin to wonder why no-one in their institution is taking any notice of them anymore.

It ties in quite nicely with the soon-to-be-released JISC PoWR (Preservation of Web Resources) Handbook, a section of which addresses the need for a closer working relationship between Records Managers and those with responsibility for institutional websites, and is a timely alert to those institutions to discover what sort of resources they are in danger of losing if they don’t pay attention to web preservation issues.

JISC PoWR blog: http://jiscpowr.jiscinvolve.org/