The legal deposit of digital spatial data in the United Kingdom Ligue des Bibliothèques Europeénnes de Recherche, Groupe des Cartothécaires de LIBER


TRANSLATE ENGLISH to Français, Deutsch, Italiano, Português, Español!

The legal deposit of digital spatial data in the United Kingdom

Chris Fleet

© LIBER and author
Published from: LIBER Quarterly, the Journal of European Research Libraries, ISSN 1435-5205, Vol. 13(2003). With permission from K.G. Saur Verlag, Munich, Germany
E-mail: c.fleet@nls.uk


Whilst legally the Ordnance Survey agreements discussed here include Trinity College, Dublin, for practical purposes this paper relates to the United Kingdom.

Abstract
This paper critically reviews progress and problems in the uptake of digital cartographic data in UK legal deposit libraries over the last few years. The details and effect of the Voluntary Code, introduced from January 2000, are examined, along with the technical, practical and political reasons for its relatively limited effects. Second, the progress map libraries have made in setting up Ordnance Survey digital data systems are examined, along with the problems that have delayed work and the discussions about obtaining other cartographic data sources. The paper concludes by offering suggestions on possible methods of future progress.

Preserving digital information is becoming an increasingly urgent challenge for both libraries and publishers of books and journals, as the amount of digital information is growing quickly and preservation policies and techniques for this format remain unsettled. The need is pressing. While the costs of long-term archiving are high, the cost of doing nothing would be disastrous.

IFLA/IPA Steering Group, 27th June, 2002.


1. Introduction
Despite its importance for future generations, the last few years have seen only modest progress in obtaining digital data in UK legal deposit libraries (LDLs). Much has been discussed, and basic systems and agreements have been established, but progress has been limited by a number of political and technical problems. This paper attempts to set out what progress has been made and the various difficulties, from the perspective of the National Library of Scotland's involvement in the work, although I hope my LDL colleagues will not find too much to disagree with in what follows.

2. United Kingdom Code of Practice for the voluntary deposit of non-print publications
The United Kingdom continues to lag behind many European countries which, during the 1990s, enacted legislation for the deposit of digital data (Denmark, 1999; France, 1994; Latvia, 1997; Norway, 1990; Sweden; 1993. See Millea, 2001; Fleet, 1999). Although some of these agreements were voluntary (as in the Netherlands), and where they were compulsory often only covering fixed form or offline media, rather than dynamic online datasets, infrastructure and mechanisms were usually established to manage and archive new digital publications. From the mid-1990s the UK's Working Party on Legal Deposit, under the Chairmanship of Sir Anthony Kenny, made recommendations to the Department of National Heritage, subsequently the Department for Culture Media and Sport (DCMS). However, it was not until September 1999 that the Code of Practice for the voluntary deposit of non-print publications was finalised, which came into effect from 4 January 2000.

As the name implies, the Code of Practice is voluntary and there is no legal obligation on publishers to comply with it. It only covers microform and offline electronic products which are primarily text-based or which are intended as information rather than entertainment products, and in addition, there are further exclusions. Deposit is not required if publication substantially duplicates the content of a print publication from the same publisher already deposited (which applies to several large cartographic publishers), if the publication is published for private internal use, or if it falls into various not-for-deposit categories, such as computer software, games, or film and video. It only covers publications distributed in the United Kingdom, or those first published in the United Kingdom.

The decision as to whether to deposit one copy for the British Library or multiple copies for the other legal deposit libraries is left with the publisher. (Although the National Libraries of Wales and Scotland can and have requested additional copies of items with significant Welsh/Scottish content/relevance). The result has therefore been that the British Library has received the largest share of publications. For the first year (January to December 2000) the British Library received 365 monograph offline electronic publications (mainly CD-ROMs), and 235 serial titles, comprising 750 issues. The Agent for the Copyright Libraries received 342 items on CD-ROM or floppy disk (Joint Committee on Voluntary Deposit, 2001). By August 2002, the British Library reported that over 1,000 monographs and 850 serial titles had been received under the Code (Byford, 2002). Within NLS, there have been about 250 items received per year under the scheme, with a total of 539 items received by March 2002. With no national bibliography or centralised recording of electronic publications, it is difficult to know what proportions these represent of the total output, but the general consensus is that it is a relatively low one (Joint Committee on Voluntary Deposit, 2002).

Given the differences between publishers' and libraries' objectives, there have several other problems. The publishers have been keen to ensure exemptions and exceptions, particularly for high-value items, and to highlight the effects of deposit on their commercial objectives. Whilst the original code allowed printing of electronic publications only to the maximum permitted under fair-dealing legislation (eg. one chapter or one journal article and less than 5% of item), and specifically excluded electronic downloading and saving, the publishers were keen to tighten this up, or negotiate arrangements on a title-by-title basis. Although all publishers were encouraged to supply metadata about their publications, on specially designed forms (see Appendix 1) relatively few of these have been filled in, with the result that library staff have had to do more work in processing items. With some justification, the publishers have also been concerned about the libraries' long-term strategies for archiving their publications, and their proposals for a secure network between the libraries. In that the publishers are well-represented in the Joint Committee on Voluntary Deposit, with members from the Publishers' Association, the Association of Learned and Professional Society Publishers, the Periodical Publishers Association, and the Directory Publishers Association, sub-groups have been set up to investigate these problems in order to make progress.

In that the Code allows for material to be deposited in one institution, and networked securely to other legal deposit libraries, there has been much ongoing work on building a secure network between the libraries. The proposed network would be based on a distributed architecture, with multiple digital stores, and using thin-client technology (Citrix MetaFrame and Microsoft Internet Explorer) to reduce volumes of data being transferred. Unfortunately, despite a well-prepared case, both bids to the Treasury's Invest to Save budget in 2000-1 were unsuccessful, and since then work has been ongoing on a "proof of concept project". A practical mechanism for networking electronic items therefore still looks some way away from implementation.

There has also been ongoing work on the Digital Library System (DLS) to store, preserve and retrieve digital publications within the British Library. In the autumn of 2000, the BL signed a contract with IBM for the supply and development of this system, to be based on standard IBM hardware and software and using the Open Archive Information System (OAIS) model. There has been much work on metadata to be included in such as system, and discussions with other national systems, but uncertainties and problems over its development.

Given the limitations of the voluntary code, and the belief that its purpose was primarily as a pilot phase, there have also been continuing attempts to introduce legislation for the compulsory deposit of electronic publications. The DCMS put forward proposals for such legislation in November 2000, but the bid was not successful. From 1998, such new legislation requires a Regulatory Impact Assessment (RIA), noting in particular the costs of compliance to business, and the BL's provisional Assessment was not approved by publishers. Since then there has been more extensive work, and a contract was awarded in 2002 to Electronic Publishing Services Ltd., to conduct a full RIA. More recently still, there have been efforts to introduce new legislation through a 'handout bill' (in effect, a government-sponsored private-members' bill) in the autumn of 2002. Even if such proposals are successful, it unlikely to take effect before 2004 (Bury, 2002).

Within the NLS Map Library, it is difficult to claim that more than a handful of electronic cartographic products have been received as a result of the Code. Of course, many cartographic publishers maintain and sell online datasets (not covered by the code) or supply paper print publications (Hydrographic Office, Automobile Association, Ordnance Survey small-medium scale mapping) in lieu of electronic products. Given the lack of current expertise within NLS over archiving electronic publications, the relatively low quantities of incoming electronic items is arguably to be welcomed, and it is for this reason that NLS still prefers to acquire conventional paper mapping over its electronic equivalents. However, such a situation undoubtedly means that some current electronic products are being lost to future generations.

3. Map Libraries' Negotiations
Given the delays and limitations of the voluntary code, the six United Kingdom and Ireland legal deposit library map libraries (Bodleian Library Oxford, British Library, Cambridge University Library, the National Libraries of Scotland and Wales, and Trinity College Dublin) have actively sought agreements for the supply of cartographic digital data. These relate to three main publishers: Ordnance Survey (Great Britain), Ordnance Survey Northern Ireland, and Experian Goad. Whilst there has been considerable progress with Ordnance Survey (GB) data, there has, regrettably, been very little progress on OSNI and Experian Goad.

3.1 Ordnance Survey (GB) Digital Data
As described in Fleet (1999), there were long and protracted negotiations with Ordnance Survey over the supply of their digital data to legal deposit libraries. There were also several technical issues to resolve, such as the customisation of software to view the OS data, the need to archive and convert the Land-Line data, and the need to agree developments collectively between libraries, OS and the University of London Computer Centre (ULCC), the BL's preferred electronic archive. Nevertheless, the libraries were very grateful that OS were prepared to send their data to the LDLs, albeit with strictly controlled usage conditions and security agreements, given the absence of compulsory deposit legislation. The succeeding problems and delays in implementing the system have been primarily due to difficulties within libraries, and not OS.

On the positive side, there has been progress on a number of fronts. Whilst the National Library of Scotland has run a pilot viewing system for the public from 1998, the other libraries have subsequently made progress, and the OS Viewer system has been set up to varying degrees in all other libraries during 2002. Several design modifications (tweaking certain functions) were agreed between the LDLs during 1999 and successfully implemented through Dotted Eyes, also (in the process) correcting several software bugs and problems. Although the delays to signing a security agreement between OS, BL and ULCC until August 2000, resulted in a backlog of annual snapshots, progress during 2001-2 has now virtually cleared the backlogs, and Land-Line data for 1998, 1999, 2000, and 2001 has all been converted and passed on to LDLs. As the original agreement covered just Land-Line topographic data (with no contours and limited height information), the OS agreed in August 2000 to supply LandForm PROFILE data to libraries at no additional charge. During 2001 there was progress in re-customising the viewer to incorporate and display this height information. Finally, the various security agreements between the LDLs and OS were finalised during 2000-1, based on the BL/OS agreement of September 1999, the Viewer was licensed for LDLs to use, and a support agreement was drawn up for the maintenance and troubleshooting of the software.

However, the progress has not been smooth or speedy, due to three connected factors:

3.2 Ordnance Survey of Northern Ireland (OSNI)
As with OSGB, OSNI have moved to producing digital data in place of large-scale paper plans, and the LDLs were also interested (to differing degrees) in obtaining this digital data. Given the similarities in format (both used a tile-based National Transfer Format) and concerns over use, it was suggested to OSNI that we might use the OSGB arrangements and security agreements as a suitable template for receiving their data. With the delays in finalising these agreements, this proposal could not be put formally to OSNI until 2001, but they responded very positively to say that, in principle, they were willing to supply their data to the libraries, and were keen to allow sample data to be tested. Unfortunately, the practical investigation of the data and conversion parameters, as well as tweaking the Viewer to use their data required central BL IS staff time, which sadly has not been forthcoming, putting progress on hold during 2002.

3.3 Experian Goad
Large-scale Goad Fire Insurance Plans date back to the late 19th century for several British towns, and provide unique information on business premises, retail outlets, and industrial units through time. Many LDLs received these comprehensively from the late 1960s, with updates every year or two years, for ca. 1,200 UK town centres. Although the maps were supplied under legal deposit as (light-sensitive) dyeline prints, and therefore of low archival stability, the plans were a useful, more frequently updated addition to Ordnance Survey large-scale mapping for town centres. However, in 1998 Experian Goad informed the Copyright Libraries' Agent that as the plans were produced digitally, as printouts on demand, they were not conventionally published as copyright maps. and therefore should not be supplied to LDLs.

In 1999-2000 the LDLs response was to focus on getting OSGB data sorted out, with the hope of then arranging something suitable with Experian Goad, such as obtaining digital Goad plans. The plans themselves can be read through MapInfo, the software used for viewing the OS data. Although there was enhanced querying and customisation facilities with the digital format, there were substantial costs of acquiring the data, and royalties for printouts. As no progress was made, and the Goad plans were not supplied to LDLs from 1999, one LDL mentioned in 2001 that they had acquired a subset of Goad plans to at least continue their paper archive. Amongst other things this highlighted that the plans themselves did seem to carry a similar status to conventional paper publications, and therefore a formal letter was sent via the Copyright Agent in May 2002 to request again that Experian Goad deposit these plans.

4. Conclusions
With publishers' continuing and growing concerns over use of their data in libraries, and the reduction this has had to their revenue, relatively few high-value items are being sent to libraries. It seems that real progress in digital deposit will only be made through compulsory legislation and/or with negotiated arrangements over usage and access. The following general conclusions can also be made:

Whilst the LDLs have co-operated well and have shared certain costs, their conflicting priorities, inadequate distributed expertise, and the expense of duplicated systems within LDLs have all caused problems. From the perspective of publishers and the BL, a centralised online supplier, along the lines of EDINA or MIMAS , would arguably be a more cost-effective, efficient way of managing electronic publications, able to keep pace with technology, and easier to manage.

Bibliography