FY 2017 NGGDPP Proposal Information Summary

11
FY2017 NGGDPP Proposal South Carolina Geological Survey vii FY 2017 NGGDPP Proposal Information Summary Name of the State Geological Survey: South Carolina Geological Survey Project Title: Geologic Map GIS Data Migration to GeMS Data Model Principal Investigator(s): Tanner Arrington South Carolina Geological Survey 5 Geology Road Columbia, SC, 29212 803-896-7714, [email protected] Authorized Institutional Mr. Alvin Taylor Representative: Director, SC Department of Natural Resources 1000 Assembly Street Columbia, South Carolina, 29201-0167 803-734-4007, [email protected] Amount Requested: Salaries $26,988 Fringe Benefits $10,795 Travel Expenses $900 Other Direct Costs $0 Indirect Costs $4,216 Grand Total $42,899 Proposed Start Date: 07/01/2017 Proposed Duration: 06/30/2018 Has this proposal been submitted to any other agency for funding: no Active NGGDPP-related grants: none

Transcript of FY 2017 NGGDPP Proposal Information Summary

FY2017 NGGDPP Proposal South Carolina Geological Survey

vii

FY 2017 NGGDPP Proposal Information Summary

Name of the State Geological Survey: South Carolina Geological Survey

Project Title: Geologic Map GIS Data Migration to GeMS Data Model

Principal Investigator(s): Tanner Arrington

South Carolina Geological Survey

5 Geology Road

Columbia, SC, 29212

803-896-7714, [email protected] Authorized Institutional Mr. Alvin Taylor

Representative: Director, SC Department of Natural Resources

1000 Assembly Street

Columbia, South Carolina, 29201-0167

803-734-4007, [email protected] Amount Requested: Salaries $26,988 Fringe Benefits $10,795 Travel Expenses $900 Other Direct Costs $0 Indirect Costs $4,216 Grand Total $42,899 Proposed Start Date: 07/01/2017 Proposed Duration: 06/30/2018

Has this proposal been submitted to any other agency for funding: no

Active NGGDPP-related grants: none

FY2017 NGGDPP Proposal South Carolina Geological Survey

viii

Project Summary

The South Carolina Geological Survey (SCGS) proposes to develop a workflow for migrating

existing geologic map GIS databases into the GeMS data model (formerly NCGMP09). The workflow will

be used to migrate GIS databases from quadrangles identified in three priority areas of South Carolina.

The GeMS data model is much more comprehensive than current SCGS database design. Migrating to

this data model, which other state geological surveys and the USGS National Cooperative Geologic

Mapping Program (NCGMP) are adopting, will make geologic map GIS data at the SCGS more valuable in

the both short- and long-term. The migration promotes the goals of the SCGS Long Range Data

Preservation Plan, the NCGMP, and the National Geological and Geophysical Data Preservation Program

(NGGDDP) by preserving data in a standardized format that enhances accessibility and usability. The

project will result in the following products: 1) quadrangle maps in priority areas in South Carolina will

be migrated into the GeMS data model, 2) documentation and tools will be developed for continuing

the process in the future and informing current data collection efforts, and 3) metadata records entered

into the National Digital Catalog. The migrated quadrangle databases will be immediately beneficial to

SCGS and other partners because geologic map data are used as the foundation for ongoing research in

the identified priority areas, such as hydrologic modeling and coastal change studies. The migrated

databases, and the workflow developed from the process, will be a beneficial data preservation task for

the SCGS, partners, and the public because of the robust and standardized data schema. Without this

data migration task, SCGS geologic map GIS holdings are at risk of becoming obsolete due to the lack of

crucial data and metadata encoded in the current databases.

FY2017 NGGDPP Proposal South Carolina Geological Survey

ix

Table of Contents

Negotiated Rate Agreement/Indirect Cost Rate Agreement .............................................. i

FY 2017 NGGDPP Proposal Information Sheet ..................................................................vii

Project Summary ............................................................................................................... viii

Table of Contents ............................................................................................................... ix

Project Description ............................................................................................................. 1

a. Introduction ........................................................................................................... 1

i. Project Background and Justification ....................................................... 1

ii. Improving Databases to Enhance Understanding .................................... 2

iii. Databases and Archiving System for Preservation ................................... 3

iv. Updating Digital Data to New Formats ..................................................... 4

v. Support from NGMDB ............................................................................... 4

vi. Prior NGGDPP Results ............................................................................... 4

b. Project Procedures ................................................................................................. 4

i. Prioritizing Quad Maps for Migration ....................................................... 4

ii. Meeting the Challenges of Migrating to GeMS ........................................ 6

iii. Project Personnel and Workflow .............................................................. 6

iv. Generated Products .................................................................................. 7

v. Conformance with SCGS Long-Range Data Preservation Plan ................. 7

vi. Data Rescue and Preservation Workshop ................................................ 7

Data Management Plan ...................................................................................................... 8

Data Rescue and Preservation Workshop Request ............................................................ 9

Detailed Budget ................................................................................................................ 10

Appendices

Appendix A – Letter of Support ........................................................................................ 12

Appendix B – Project Personnel Experience and CVs ....................................................... 14

FY2017 NGGDPP Proposal South Carolina Geological Survey

1

Project Description

INTRODUCTION

The legislatively described duties of the South Carolina Department of Natural Resources (SCDNR), Geological Survey, state that the Geological Survey (SCGS) shall provide geologic advice and assistance to a variety of other agencies involved in land use, economic development, and environmental protection. Geologic mapping is a core function of the SCGS, which has more than 300 geologic map publications and numerous unpublished maps in digital development available for decision makers and the public. Modern geologic maps are generated digitally from field data using Geographic Information Systems (GIS). Standardization of these digital materials is critical for long-term stewardship of geologic information. Preserving GIS data in robust and standardized databases increases accessibility in both the short- and long-term.

This project proposes to preserve existing GIS databases for published geologic quadrangles by converting them to the Geologic Map Schema (GeMS, formerly NCGMP09) database design developed by the National Cooperative Geologic Mapping Program of the U.S. Geological Survey. The integration of GeMS into SCGS workflows will be documented. The GIS databases for geologic quadrangles that are migrated will be made available for download on the SCGS website.

GeMS is designed to encode the core content that is found on a single traditional geologic map produced by the USGS or a state geological survey into a GIS database. GeMS is designed to be flexible enough to meet the needs of a variety of use cases. It has undergone a series of revisions for meeting the diverse needs of the geologic mapping community. The GeMS data model is currently being finalized, and it is expected to be published by USGS in the spring (David Soller, personal communication January 2017). The design of the data model is “focused on the transfer and archiving of map data” (NCGMP09 documentation).

Project Background and Justification Our customers consist of a variety of federal, state, local agencies, environmental consultants, and private individuals. Some SCGS customers were surveyed from the SCGS website. Of the 109 survey respondents, 89 say they are interested in geologic maps and geologic map data. Respondents prefer multiple formats of geologic maps, including hard copy paper maps (38.2%), digital document of image (65.17%), GIS data (33.17%), and interactive web maps (53.93%). The results of this ongoing survey suggest that GIS data and web maps are highly desirable for SCGS customers. The proposed data migration project will help existing and future map products meet these customer needs.

This project is proposed because the SCGS does not have a complete data model for developing and preserving geologic map GIS data. While a standard cartographic workflow exists for producing the final geologic map product (a PDF map), the GIS databases are bare-bones and do not include much information that is found on the map, i.e. unit descriptions or unit age. Essential metadata is also under-developed. This limits internal accessibility, makes delivering digital map data to customers more difficult, and is inadequate for preserving geologic map GIS data.

This project will address data preservation of geological GIS datasets in two ways. First, migrating existing databases from geologic quadrangles in critical areas to the GeMS standard will enhance accessibility and preserve the data. Second, the knowledge gained from the migration process and GeMS will be documented for future data preservation and ongoing mapping projects.

FY2017 NGGDPP Proposal South Carolina Geological Survey

2

Figure 1. Comparison of the existing SCGS database and the NCGMP09 database.

Figure 2. Comparison of the data associated with geologic unit polygons in each database.

Migrating existing GIS databases to the GeMS data model has both short-term and long-term benefits, which meet NGGDPP program goals in the following categories:

Improving databases to enhance understanding, interpretation, and use of data;

Improving existing databases and developing an archiving system for preserving a collection of digital data;

Updating digital data to new formats to maintain accessibility as older digital storage technology becomes obsolete and newer technology becomes available;

Improving Databases to Enhance Understanding The primary benefit of this project is more useful, accessible, and sustainable geologic map GIS data, which is an economic benefit. Availability of geologic information provides a relatively low-cost

FY2017 NGGDPP Proposal South Carolina Geological Survey

3

alternative to expensive investigations that would only duplicate existing information. It is in the best economic interest of South Carolina to increase the accessibility and usability of geologic map data.

The GeMS data model is a robust schema that encodes all the data from a traditional geologic map, plus additional metadata (Figure 1 and Figure 2). It preserves the backbone of - and adds value to - the visual map product. The current SCGS GIS databases contain the minimum amount of data necessary to produce a map and requires the presence of the mapping geologist’s for interpretation. The GeMS data model, on the other hand, is designed to represent and preserve the geologist’s interpretation, making it useful on its own.

The results of the data migration will provide immediate benefits to the SCGS and the public. It will eliminate the process of preparing data and metadata on an as-needed basis (e.g. for a customer request, for building an application) because the data already will have been prepared. This saves the SCGS time and allows us to deliver data to the costumer more quickly. Further, data delivered to customers will be more robust, and thus require fewer follow-up requests about data provenance, clarifications, and ancillary data. The data will be in a standard schema that the customer can rely on, which gives the customer confidence in our product.

Geologic map data are often used as foundational data sources for other research. With GIS data in the GeMS data model, merging of datasets for regional map compilations, onshore-offshore geologic maps, hydrologic modeling, hazards research, and other current research priorities will be easier. This is a benefit to SCGS and to all the partners who use geologic map GIS data.

Databases and Archiving System for Preservation The immediate benefits listed above are also long-term benefits. Robust databases with adequate metadata are more useable for future users, especially after the originator of the data has moved on. They will also be more accessible, or discoverable, because of the standardized schema and metadata required.

One of the critical aspects GeMS is that it encodes the geologist’s interpretation. This includes defining terms like ‘location approximate’, where ‘approximate’ may have different meaning between two geologists, or in two different locations. Data should be usable and valuable when mapping staff are no longer available to answer questions about the particular dataset. By migrating existing databases, the SCGS will preserve the geologist’s knowledge for future users internally and externally.

Some field data may exist that were used in the interpretation and subsequent production of a map, but that were not incorporated into the database because the GIS structure was not in place to support it. Similarly, sometimes field data are collected, but cannot be represented cartographically at 1:24,000 scale. These data can be added to GeMS data model, further enhancing the value of the database and the map product. This is important because future users may have new information that makes field data relevant, or that changes the way a dataset is interpreted.

Data preservation can be addressed in the data collection process. A well-structured, standardized database can inform standardized data collection workflows. Geologic mapping in the field is a main component of the SCGS. Recently, SCGS field geologists began developing new digital mapping techniques using tablets in the field and a cloud database. The digital data collection process is informed by the GeMS data model so that field geologists can be mindful of the necessary information for the database. This ensures that geologists are collecting information that is important for data preservation.

Another key component of this project is the documentation of the GeMS integration with SCGS workflows. This will include documenting the data migration process, defining terminology used in SCGS

FY2017 NGGDPP Proposal South Carolina Geological Survey

4

geologic maps, and outlining and implementing field collection guidelines with SCGS geologists. The work of documentation is important for the continuity of this data preservation effort.

Updating Digital Data to New Formats Currently, many SCGS geologic map data are stored in an ESRI Personal Geodatabase, an older GIS database built on top of Microsoft Access. The newer File Geodatabase format is preferred because it does not rely on Microsoft Access technology and has fewer limitations. Migrating these databases into a File Geodatabase with the GeMS schema will ensure that the data are archived in a storage format that is widely used and consistent. This data migration project will position our GIS data holdings to keep up with future technology developments. As technology continues to change quickly, it is important to have robust and standardized datasets so that they do not become obsolete.

Migrating our databases now into the GeMS data model will make it easier to keep pace with future technology changes. Future migration tasks can be completed efficiently because data are standardized.

Support from NGMDB David Soller, project lead for the USGS - National Geologic Map Database (NGMDB) and the GeMS data model, has provided a letter of support for this project (Appendix A). The NGMDB project is an important customer. The project is tasked with building a national repository of geologic map information that can be queried through standard criteria. The GeMS data model promotes the goals of the national archive by standardizing the way digital geologic map data are formatted. The SCGS can contribute to and promote the same goal by following the GeMS standard.

Prior NGGDPP Results The SCGS has participated in previous NGGDPP grant opportunities. The first major project was an inventory of the SCGS Core Repository. Over 1400 cores were inventoried. The outcome of this project was a much better grasp of the Repository’s holdings, of which 239 had completed files and were submitted to the National Digital Catalog. The second project was the development and implementation of a database to accommodate scanning and cataloging of drill-logs, one of the primary data holdings within the SCGS. The drill-log database and document management system were developed with the help of the South Carolina Department of Natural Resources (SCDNR) Technology Development Program. The drill-log cataloging is ongoing. Work has begun to develop a web-map based application for displaying these records to the public.

PROJECT PROCEDURES The goals of the project can be divided into three parts:

Migrate existing GIS data to a database design based on the GeMS schema

Documenting the workflow for building GeMS databases for SCGS map data and developing custom tools for the task

Making these databases available on the internet for easy download

Prioritizing Quad Maps for Migration Quadrangles mapped through the STATEMAP program will be migrated to the GeMS data model. The SCGS has mapped and published 175 quadrangles as a part of the STATEMAP program. These existing GIS databases are prioritized for migration on the basis of criteria from two categories: customer needs and SCGS knowledge (Table 1). Prioritizing maps will help to ensure that the most useful maps are made available sooner.

FY2017 NGGDPP Proposal South Carolina Geological Survey

5

Customer needs SCGS knowledge

Areas of known concern or interest across the state Quality of the geologic mapping

Relevance to current projects in SCGS Quality of existing GIS database

Relevance to other customers’ ongoing projects Geological staff available for map interpretation

Responses to a website survey

Table 1. Categories for prioritizing quadrangle map migration

Coastal and coastal plain quadrangles are a first priority for migration (Figure 3). Areas along the coast continue to increase in population, such as the Grand Strand Area. Research priorities in this area include: groundwater modeling and saltwater intrusion monitoring; coastal geologic hazards and vulnerability due to flooding, coastline change, earthquakes, and sinkholes; and growing attention to conservation efforts in coastal environments. From the web survey results, coastal and coastal plain maps are the most requested maps.

Some piedmont quadrangles are a priority for migration because of increasing populations, current water availability concerns, geologic hazards along the Eastern Piedmont Fault System, and mineral extraction.

Figure 3. Map of the Priority Areas for data migration.

Figure 3 outlines the priority areas for map migration. There are three priority areas. Priority Area 1 is

the highly developed Grand Strand area along the northern section of coast. Priority Area 2 is the Low

Country, which includes the environmentally sensitive ACE Basin. Priority Area 3 is the Aiken-Columbia-

Camden area. Five (5) quadrangle maps will be migrated from each Priority Area, for a total of fifteen

(15) quadrangles.

FY2017 NGGDPP Proposal South Carolina Geological Survey

6

Meeting the Challenges of Migrating to GeMS It has been recognized by other state geological surveys and USGS efforts that migrating to the NCGMP09 (predecessor to GeMS) data model from existing data models is a complicated task. Several talks and posters at the 2015 Digital Mapping Techniques meeting highlighted some of the workflows and issues that were encountered. Washington State Division of Geology and Earth Resources (WS DGER) has published extensive documentation on a pilot project. These resources have been reviewed by the SCGS. Much of the groundwork for this process has already begun at the SCGS. Test cases have been completed on a Coastal Plain quadrangle and a Piedmont quadrangle. Some migration tools have been developed and writing of a guide document is underway.

The simplicity of the existing SCGS data model is an advantage for this project. The bare-bones databases will make the technical aspect of the migration less complicated. Much of the information will be entered new by referencing the map and consulting with staff geologists. This alleviates some of the complications reported by other Geological Surveys who have tested the data model with complex geologic maps.

Project Personnel and Workflow The personnel chosen for this project encompass the geologic and GIS experience needed to complete the migration tasks appropriately. The data migration tasks are completed in GIS, but will require geologic expertise for interpretation of some key attributes and metadata. The geologists will also advise and review the documentation for collection of field data as it relates to data preservation. GIS workflows and tools will be developed by the GIS manager for quality assurance. Quality control tasks for finished databases will be implemented, which will include running a validation tool that was developed for the GeMS data model. Table 2 outlines the project personnel and their roles. The time allotted for each person is shown in the budget.

Staff Role

Tanner Arrington – GIS Manager

Project lead – Quality assurance, tool development, GIS process documentation, GIS quality control, website development

C. Scott Howard Ph.D. – SCGS Chief Geologist

Geology lead – Select priority quadrangles, geologic map interpretation, quality control of geologic information in finished databases

Staff (to be hired)

GIS data migration tasks, write documentation, write metadata, database quality control

Robert Morrow, IV – Piedmont Geologist

Geologic map interpretation, advise on documentation

Will Doar, Ph.D.– Coastal Plain Geologist

Geologic map interpretation, advise on documentation

Table 2. List of project personnel and their roles.

A common response from those who tested GeMS data migration projects is that it would be beneficial to have someone dedicated to the project (Digital Mapping Techniques 2015, Salt Lake City, UT). It is more efficient to have one person who becomes very familiar with the data migration process than several that try to approach it part-time, and never have the time to know the process well. That is why we are proposing to hire staff who will dedicate all of their time to this task. This person also will be responsible for documenting the workflow, so that their knowledge of the process is recorded and can inform geologic data collection and future GIS data migration projects.

FY2017 NGGDPP Proposal South Carolina Geological Survey

7

Priority geologic quadrangle maps will be selected by chief geologist Dr. Scott Howard based on the criteria above. Hired staff will consult with GIS Manager Tanner Arrington and Dr. Howard to review the maps and discuss potential challenges. The data migration process will be initiated by hired staff. Mr. Arrington will ensure that hired staff have proper tools and GIS capabilities for developing databases and migrating the data. Staff will consult Dr. Howard, geologist Robert Morrow (Piedmont), or geologist Dr. Will Doar (Coastal Plain) about GeMS fields that require geologic interpretation. They will document their progress, write metadata for each database, and help to write the documentation of the migration process. Once a quadrangle has been migrated, it will go through a quality check by hired staff and Dr. Howard. Mr. Arrington will develop a web page to distribute the databases on the SCGS website.

Background and expertise information about the project personnel can be found in Appendix B, along with each person’s curriculum vitae.

Generated Products The primary products of this work are databases that have been migrated into the GeMS data model, and documentation outlining the workflow for migrating GIS data and collecting new data using the GeMS data model. Some of the expected documents and tools that will be developed include:

A comprehensive migration guide document for continuing data migration and assisting future map production with GeMS databases.

Python tools for ArcGIS.

New geologic GIS databases based on the GeMS data model for published quadrangle maps made available on SCGS servers and the SCGS website.

Metadata records will be completed for each map database migrated to the GeMS data model, and then translated into the NGGDPP metadata schema for upload to the National Digital Catalog (NDC). Metadata will be initially generated with the ArcGIS NCGMP09 toolbox and Evan Thoms’ (USGS) metadata toolbox, which are designed to write metadata specifically for GeMS. Then the data will extracted to the required data elements for the NDC.

The data management plan in the next section describes how the results of this project will be made accessible. SCGS’s goal is to make our data holdings as accessible as possible because it provides a benefit to us through time savings, and a benefit our customers. The GIS databases that result from this project, along with future databases, will be made available for download on the SCGS website along with the geologic maps.

Conformance with SCGS Long-Range Data Preservation Plan Migrating data to the GeMS data model addresses a common goal between the data preservation plan, STATEMAP mapping plans, and SCGS organizational plans: develop better databases for our data holdings. The SCGS plans to develop better databases so that years of data collection will be useful and accessible for future generations of users. While the data collected served an immediate need, the SCGS recognizes that data can have value in the future, sometimes unforeseen in the present. Therefore, it is important to the SCGS that data are preserved in accessible formats for future potential uses. For example, the SCGS continues to scan and input drill-logs into a document management system and database. Work is also ongoing in working with the Hydrology Section of SCDNR to catalog and share data beneficial to both parties, including drill-logs, geophysical-logs and well records.

Data Rescue and Preservation Workshop We request matching NGGDPP funds to sponsor travel to the workshop in September. Details are outlined in the budget.

FY2017 NGGDPP Proposal South Carolina Geological Survey

8

Data Management Plan

The resources that are expected to be preserved are GIS databases from which South Carolina Geological Survey (SCGS) geologic quadrangle maps are produced. The GIS databases are bare-bones, holding only the minimal amount of spatial and tabular data necessary from which to construct a geologic map. Databases corresponding to published geologic maps in priority areas of South Carolina will be chosen for data migration into the Geologic Map Schema (GeMS) data model. GeMS is a robust GIS data model developed specifically for geologic maps.

The GeMS data model is being published by USGS and submitted to the Federal Geographic Data Committee (FGDC) as a standard for geologic map GIS data. The GeMS data model includes metadata built into the schema of the database (called feature-level metadata), which allows for some metadata to be associated with specific records within the database. It also includes standard practices for recording database-level metadata. Database-level metadata will be translated into the NGGDPP metadata schema for submitted the metadata records to the National Digital Catalog.

SCGS is a state agency. All records, reports, data sets, and notes are available through FOIA requests; however, we freely supply information on request. Access to GIS data used for the creation of geologic maps is available by contacting the GIS Manager. We impose no restrictions on making our data available.

SCGS imposes no restrictions on the re-use, re-distribution, or the development of derivative products from SCGS data sets. Data descriptions and use restrictions are included in the metadata to inform the user of the intended purpose, scale and scope of the data set. We ask that original data source be acknowledged.

The SCGS stores all GIS databases from the mapped quadrangles on a specific server. These databases are maintained and available to the public upon request. In addition, the SCGS has a subset of these databases available for download from the SCGS website. The databases available on the website are developed with the old, bare-bones schema. The SCGS will replace these databases with those that result from this project. The web pages will continue to be used to distribute future databases as they are migrated following the successful completion of this project. In addition, the metadata records submitted to the National Digital Catalog will be archived by the USGS.