Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and...

28
Expertise for the non- Expertise for the non- specialist: delivering specialist: delivering forest-related forest-related information to non- information to non- foresters foresters Chair and organiser: Chair and organiser: Roger Mills, Oxford University Roger Mills, Oxford University Library Services Library Services Co-ordinator, IUFRO Research Group Co-ordinator, IUFRO Research Group 6.03.00 6.03.00 Information Services and Information Services and Knowledge Organization Knowledge Organization

Transcript of Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and...

Page 1: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Expertise for the non-Expertise for the non-specialist: delivering specialist: delivering

forest-related forest-related information to non-information to non-

forestersforestersChair and organiser:Chair and organiser:

Roger Mills, Oxford University Library Roger Mills, Oxford University Library ServicesServices

Co-ordinator, IUFRO Research Group Co-ordinator, IUFRO Research Group 6.03.00 6.03.00 Information Services and Information Services and

Knowledge OrganizationKnowledge Organization

Page 2: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Forest ResearchForest Research Projects over many decades have produced a wealth of data, Projects over many decades have produced a wealth of data,

published and unpublishedpublished and unpublished Now finding uses in other disciplines Now finding uses in other disciplines

environmental managementenvironmental management climate change assessmentclimate change assessment biodiversity conservationbiodiversity conservation economic planningeconomic planning economicseconomics politicspolitics social sciencesocial science lawlaw

Easy to access with modern technologiesEasy to access with modern technologies data frequently needs processing or harmonisation to make data frequently needs processing or harmonisation to make

it usableit usable Raises many issues of intervention, explanation and training Raises many issues of intervention, explanation and training

which fall partly or wholly on the library and information which fall partly or wholly on the library and information sector sector

Page 3: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Today’s workshopToday’s workshop

Highlight some of the issuesHighlight some of the issuesPresent case studiesPresent case studiesDiscuss what we can do to ensure Discuss what we can do to ensure

that users unfamiliar with the that users unfamiliar with the forestry subject area can make best forestry subject area can make best use of available datause of available data

Make a ‘wish list’ for future action – Make a ‘wish list’ for future action – in IUFRO, IAALD, other fora in IUFRO, IAALD, other fora

Page 4: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Trees grow slowlyTrees grow slowly

Not like cabbages – generations Not like cabbages – generations needed for controlled studyneeded for controlled study

No equivalent to No equivalent to RothamstedRothamsted experiments experiments – started in 1843 and – started in 1843 and still goingstill going

Majority of forest studies carried out Majority of forest studies carried out for a particular end and data for a particular end and data collection not primary purposecollection not primary purpose

Page 5: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Data gatheringData gathering

Traditionally:Traditionally:Field trialsField trialsGather dataGather dataAnalyse on paperAnalyse on paperPublish conclusionsPublish conclusionsData stays in a drawerData stays in a drawer

Page 6: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Early computingEarly computing

Data on tapes, Data on tapes, punched cards etcpunched cards etc

Physically managed Physically managed by central by central computing unitscomputing units

Data preserved Data preserved though may not be though may not be fully catalogued or fully catalogued or readable long termreadable long term

Page 7: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Modern computingModern computing

Gathered on portable devicesGathered on portable devicesAnalysed on PCAnalysed on PCStored on removable mediaStored on removable mediaNo central responsibility, existence No central responsibility, existence

known only to researcherknown only to researcherUnknown, unreachable, unreadableUnknown, unreachable, unreadableSo data is recompiledSo data is recompiled

Page 8: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Forest dataForest data

Time dependent, not repeatableTime dependent, not repeatableTime series important: significant Time series important: significant

variations may occur over relatively variations may occur over relatively short periodsshort periods

Essential to preserve all historical Essential to preserve all historical data we candata we can

Page 9: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Impact of webImpact of web

Preserving data in a mediated library Preserving data in a mediated library allows delivery with health warningsallows delivery with health warnings

Make it web-accessible leaves open Make it web-accessible leaves open to misinterpretationto misinterpretation

But harmonised data useful in many But harmonised data useful in many non-forestry contextsnon-forestry contexts

Problem lies in the harmonisationProblem lies in the harmonisation

Page 10: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

DBHDBH

Diameter at Breast HeightDiameter at Breast HeightHow high is your breast?How high is your breast?

1.3m (4’3”) (USA etc)1.3m (4’3”) (USA etc)1.4m (4’6”) (UK etc)1.4m (4’6”) (UK etc)1.5m (for ornamental trees).1.5m (for ornamental trees).

Decimal conversions also introduce Decimal conversions also introduce variations: 4’6” is more accurately variations: 4’6” is more accurately 1.37m. 1.37m.

Page 11: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

A little knowledge is a dangerous A little knowledge is a dangerous thingthing

Adding stats for DBH from different Adding stats for DBH from different areas without conversion will be areas without conversion will be misleadingmisleading

Can lead to bad decision making Can lead to bad decision making Eg in climatology, basing estimates Eg in climatology, basing estimates

of carbon incorporation on forest of carbon incorporation on forest volumevolume

Page 12: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

What’s that got to do with What’s that got to do with librarians?librarians?

Aim to make data readily available to Aim to make data readily available to all who can use it, without restriction all who can use it, without restriction or censorshipor censorship

Internet helps, but aids unintentional Internet helps, but aids unintentional – or intentional – misuse– or intentional – misuse

Answer: better metadata and user Answer: better metadata and user educationeducation

Page 13: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

GFISGFIS

Data harmonization originally an aim Data harmonization originally an aim of Global Forest Information Serviceof Global Forest Information Service

Not achieved because of manpower Not achieved because of manpower required to generate extra metadata required to generate extra metadata defining conversion requirements, or defining conversion requirements, or just warning of incompatibilitiesjust warning of incompatibilities

Most data not compiled for Most data not compiled for international use, no funding to international use, no funding to provide metadata at sourceprovide metadata at source

Page 14: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

EU to the rescueEU to the rescue

1989 regulations to set up European 1989 regulations to set up European forest and Communication Systemforest and Communication System

““well-structured and relaiable forest well-structured and relaiable forest information at European level”information at European level”

NEFIS: Network for a European Forest NEFIS: Network for a European Forest Information Service 2003-5Information Service 2003-5

http://www.efi.int/portal/project/nefishttp://www.efi.int/portal/project/nefis

Page 15: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Into operationInto operation

European Forest Information and European Forest Information and Communication Platform (EFICP)Communication Platform (EFICP)

http://eficp-info.jrc.it/http://eficp-info.jrc.it/Long gestation commonLong gestation common

Political requirementPolitical requirementDevelopment of prototypeDevelopment of prototypeStudy problemsStudy problemsDevelopment of production systemDevelopment of production system

Now 19 years since original RegulationNow 19 years since original Regulation

Page 16: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Use it or lose itUse it or lose it

Communicate existence of systemCommunicate existence of system Make it easy to use and reliableMake it easy to use and reliable Must save user’s timeMust save user’s time NEFIS project illuminates problemsNEFIS project illuminates problems Many relate to librarians’ traditionbal Many relate to librarians’ traditionbal

expertiseexpertise TerminologyTerminology ClassificationClassification Quality assessmentQuality assessment SearchabilitySearchability InteroperabilityInteroperability High-quality metadataHigh-quality metadata

Page 17: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Iterative developmentIterative development

Distribute technology favours new Distribute technology favours new uses/users for existing datauses/users for existing data

Infrastructure needs:Infrastructure needs: Advanced spatio-temporal data collection and Advanced spatio-temporal data collection and

information managementinformation management Dissemination and fusion of heterogeneous Dissemination and fusion of heterogeneous

distributed informationdistributed information Sophisticated analysis, modeling and Sophisticated analysis, modeling and

visualization of informationvisualization of information Designed to outlive current softwareDesigned to outlive current software

Page 18: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Cf BioinformaticsCf Bioinformatics

Single information system holds:Single information system holds:Sequencing dataSequencing dataTools for annotationTools for annotationTools for analysisTools for analysisPublications resulting from analysisPublications resulting from analysis

E.g. NCBI E.g. NCBI http://www.ncbi.nlm.nih.gov/http://www.ncbi.nlm.nih.gov/

Page 19: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.
Page 20: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

An integrated system for An integrated system for forestry?forestry?

Much wider variety of data typesMuch wider variety of data typesMuch wider community of usersMuch wider community of usersAnd of technical infrastructureAnd of technical infrastructureNCBI model bridges data acquisition, NCBI model bridges data acquisition,

analysis and curationanalysis and curationPublishing models increasingly Publishing models increasingly

incorporate raw data source with incorporate raw data source with peer-reviewed researchpeer-reviewed research

Page 21: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Publishing dataPublishing data

Author complies dataset containing Author complies dataset containing forest cover statistics spanning forest cover statistics spanning multiple jurisdictions and century-multiple jurisdictions and century-long time serieslong time series

Data acquisition and harmonisation Data acquisition and harmonisation methods recorded in metadatamethods recorded in metadata

Publishes package so data remains Publishes package so data remains available long-term for use or further available long-term for use or further analysis by others, retrievable analysis by others, retrievable alongside journal articels alongside journal articels

Page 22: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Open AccessOpen Access

Non-subscription environment to ensure Non-subscription environment to ensure wide availabilitywide availability

Requires new approach to resaerch Requires new approach to resaerch fundingfunding

And long-term funding for data curationAnd long-term funding for data curation That role likely to fall on library communityThat role likely to fall on library community

Business and technical expertise in archiving Business and technical expertise in archiving Developing and supporting integration and Developing and supporting integration and

interoperability toolsinteroperability tools Online repositoriesOnline repositories

Page 23: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Developing standardsDeveloping standards

NEFIS datasets too different to NEFIS datasets too different to achieve interoperabilityachieve interoperability

Demonstrated needDemonstrated needEU European Interoperability EU European Interoperability

Framework 2004Framework 2004TechnicalTechnicalSemantic [precise meaning]Semantic [precise meaning]OrganizationalOrganizational

Last two most challengingLast two most challenging

Page 24: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Semantic interoperabilitySemantic interoperability

Descriptive metadataDescriptive metadata Controlled vocabulariesControlled vocabularies OntologiesOntologies User-nominated terms – requires editorUser-nominated terms – requires editor

TaggingTagging QualityQuality

AccuracyAccuracy Logical consistencyLogical consistency CompletenessCompleteness Positional accuracyPositional accuracy LineageLineage

Non-censorious indication – ‘quality report’Non-censorious indication – ‘quality report’

Page 25: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Data locationData location Provider’s serverProvider’s server Or central?Or central? If local, owner responsible for metadata managementIf local, owner responsible for metadata management Interoperability requires metadata on:Interoperability requires metadata on:

Protocols for query translationProtocols for query translation Mapping of filed labelsMapping of filed labels Field contentsField contents Backround informationBackround information Associated filesAssociated files Realed IPRRealed IPR Required executablesRequired executables Language and character setLanguage and character set Access control mechanismsAccess control mechanisms

Standards to be agreed so all new compilations and Standards to be agreed so all new compilations and reloaded legacy data have this informationreloaded legacy data have this information

Page 26: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

NEFIS DemonstratorNEFIS Demonstrator

No data harmonizationNo data harmonization Showed feasability of retrieving and Showed feasability of retrieving and

analysing data for a single request to analysing data for a single request to multiple servers in multiple countriesmultiple servers in multiple countries

Comprises:Comprises: Resource discovery toolkit – searches metadataResource discovery toolkit – searches metadata Remote search demonstrator – managing data Remote search demonstrator – managing data

retrieval form multiple sourcesretrieval form multiple sources Visualisation toolkit (VTK) – naïve and expert Visualisation toolkit (VTK) – naïve and expert

modelling of retrieved datamodelling of retrieved data

Page 27: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

EDAEDA

Exploratory Data AnalysisExploratory Data AnalysisUnbiased examination of data to detect Unbiased examination of data to detect

patterns, trends, relationships rather patterns, trends, relationships rather than answer preconceived questionthan answer preconceived question

Mirrors bioinformatics approachMirrors bioinformatics approachNEFIS data specially preparedNEFIS data specially preparedAdoption of common standards could Adoption of common standards could

allow development of VTK with no need allow development of VTK with no need for human intervention in preparing datafor human intervention in preparing data

Page 28: Expertise for the non- specialist: delivering forest-related information to non-foresters Chair and organiser: Roger Mills, Oxford University Library Services.

Librarians are keyLibrarians are key

In:In: Curating dataCurating data Developing and supporting implementation of Developing and supporting implementation of

standardsstandards Ensuring ready access to dataEnsuring ready access to data Promoting usePromoting use

Universal Data Control – UDC…Universal Data Control – UDC… It’s classification, Captain, but not as we It’s classification, Captain, but not as we

know it… or maybe it is! know it… or maybe it is! So let’s do it….So let’s do it….