Petroleum User’s Group metadata website fileAssociation of Petroleum Geologists Bulletin (v. 67,...
Transcript of Petroleum User’s Group metadata website fileAssociation of Petroleum Geologists Bulletin (v. 67,...
ESRI_UC_2007
Metadata for geoscientistsMetadata for geoscientists
Donald W. DowneyDonald W. DowneyEarth ScientistEarth Scientist
CA RG #7487 TX RG #1539 AAPG CPG #5838 GISP #51955CA RG #7487 TX RG #1539 AAPG CPG #5838 GISP #51955Chevron Exploration & Production Technology Company (EPTC) Chevron Exploration & Production Technology Company (EPTC) Earth Science Technology Department (ESD)Earth Science Technology Department (ESD)Exploration and New Ventures Team Exploration and New Ventures Team 6001 Bollinger Canyon Road, C6001 Bollinger Canyon Road, C--1188 1188 San Ramon, CA 94583San Ramon, CA 94583--0946 0946 Tel 925 842 3448Tel 925 842 3448mailto:[email protected]:[email protected]
2727thth ESRI International UserESRI International User’’s Conferences ConferenceJune 20, 2007June 20, 2007
Petroleum UserPetroleum User’’s Group metadata websites Group metadata websitehttp://www.pughttp://www.pug--steering.com/steering.com/
2ESRI_UC_2007
DefinitionsDefinitions
Metadata is... Metadata is...
searchable information about asearchable information about adata resource.data resource.
oror…… ““Metadata is hidden information in a Metadata is hidden information in a computer file that may contain potentially computer file that may contain potentially dangerous or embarrassing information or lead dangerous or embarrassing information or lead to an accidental disclosure.to an accidental disclosure.””
http://blogs.adobe.com/acrolaw/2005/10/metadata_and_pd.htmlhttp://blogs.adobe.com/acrolaw/2005/10/metadata_and_pd.html
3ESRI_UC_2007
Metadata for geologic dataMetadata for geologic data
•• Seismic lines Seismic lines (2D, 3D data, navigation, balance)(2D, 3D data, navigation, balance)
•• Well logs and cross section linesWell logs and cross section lines
•• Maps Maps (structure, isopach, facies)(structure, isopach, facies)
•• Stratigraphic columnsStratigraphic columns
•• Tabular data Tabular data (picks, core analysis, geochem)(picks, core analysis, geochem)
Who is the owner? Where is it located?Who is the owner? Where is it located?When was this data created?When was this data created?
Why is this data important?Why is this data important?
4ESRI_UC_2007
Searchers vs. taggersSearchers vs. taggers
The The taggerstaggers believe in adding complete believe in adding complete metadata so we can searchmetadata so we can search
The The searcherssearchers believe in searching believe in searching everything using powerful search toolseverything using powerful search tools
Neither viewpoint gets the job done...Neither viewpoint gets the job done...
We are getting overwhelmed by data!We are getting overwhelmed by data!Many datasets have legal restrictions!Many datasets have legal restrictions!
5ESRI_UC_2007
AAPG Code of EthicsAAPG Code of Ethics
•• Members shall not use or divulge any Members shall not use or divulge any employer's or client's confidential information employer's or client's confidential information without their permission and shall avoid without their permission and shall avoid conflicts of interest that may arise from conflicts of interest that may arise from information gained during geological information gained during geological investigations.investigations.
•• Members shall freely recognize the work done Members shall freely recognize the work done by others, avoid plagiarism, and avoid the by others, avoid plagiarism, and avoid the acceptance of credit due othersacceptance of credit due others
•• Members shall endeavor to cooperate with Members shall endeavor to cooperate with others in the profession and shall encourage others in the profession and shall encourage the ethical dissemination of geological the ethical dissemination of geological knowledge.knowledge.
http://www.aapg.org/business/codethic.cfm
6ESRI_UC_2007
• Complicated editors• Generic style sheets• Numerous metadata fields• Poor synchronization• Poor metadata persistence
•Very few geologists are actively entering
metadata!
What is the problem?
7ESRI_UC_2007
The plan must beThe plan must beeasiereasier
than the present than the present metadata editing metadata editing
workflow!workflow!
Metadata workflow rulesMetadata workflow rules
8ESRI_UC_2007
•• Author recognized Author recognized
•• Ownership rightsOwnership rights
•• Legal restrictionsLegal restrictions
•• Metadata is entered Metadata is entered for important files!for important files!
What is the desired state?What is the desired state?
9ESRI_UC_2007
What is my solution?What is my solution?
• Remember that AAPG Ethics requires citation and protection of other’s datasets and intellectual property
• Auto-fill higher-level information by cascading directory path information from enclosing folders and project work orders
• Auto-fill lower-level metadata fields using data analysis
• Improve synchronization between bibliographic and
spatial metadata
10ESRI_UC_2007
Maintain the spirit of Maintain the spirit of the standard the standard
bibliographic citation bibliographic citation
forfor digital data!digital data!
Standard citation formatStandard citation format
North American Stratigraphic Code, American Association of Petroleum Geologists Bulletin (v. 67, no. 5, p. 841-875), 1983.
11ESRI_UC_2007
Metadata Tips Metadata Tips andand TricksTricks
•• Key metadata elements Key metadata elements
•• Editing in ArcCatalogEditing in ArcCatalog
•• AutoAuto--updates, keywordsupdates, keywords
•• Templates & enclosuresTemplates & enclosures
•• Metadata can get lost when Metadata can get lost when rere--saving filessaving files
•• DonDon’’tt put all the metadata into put all the metadata into a single documenta single document
•• DonDon’’tt put metadata into all of put metadata into all of your documentsyour documents
•• DoDo use metadata .xml use metadata .xml templatestemplates
12ESRI_UC_2007
Metadata needs of a geologistMetadata needs of a geologist
Catalog/Search/Usability/DocumentCatalog/Search/Usability/Document
For spatial datasets, provide enough For spatial datasets, provide enough information for users to work with those information for users to work with those
datasets in a GISdatasets in a GIS
Information you should provide includes:Information you should provide includes:author, owner, editor, coordinate system, author, owner, editor, coordinate system,
scale and accuracy, data currentnessscale and accuracy, data currentness
13ESRI_UC_2007
Raw data vs. interpreted data
In New Ventures hydrocarbon exploration, the most important data is the original raw field data.
Even excellent previous interpretations become data for the next round
of interpretations because most newbig discoveries come from interpretation of
new geologic play concepts!
14ESRI_UC_2007
Metadata completenessMetadata completeness
•• Data typeData type
•• Data purposeData purpose
•• AudienceAudience
•• Company policyCompany policy
15ESRI_UC_2007
Metadata workplanMetadata workplan
•• Create metadataCreate metadata
•• LongLong--term maintenanceterm maintenance
•• QC and ValidationQC and Validation
•• Sharing permissionsSharing permissions
16ESRI_UC_2007
Raster images generationRaster images generation
Metadata is created during...Metadata is created during...
•• Project approval processProject approval process
•• Initial image generationInitial image generation
•• Image processingImage processing
•• GeoreferencingGeoreferencing
•• Editing of vector featuresEditing of vector features
•• Project results reportProject results report
Microsoft Office to Adobe Acrobat
In Microsoft Word (without a document open), go to the Adobe Acrobat PDF Menu and choose “Change Conversion Settings”. Select "Convert Document Information" and then click OK. This will pass the Document Properties to the pdf file.
Microsoft NTSF File Catalog
Directory paths are metadata…create a table of original filepaths and current filepaths for addition to metadata keywords, $folder(1) = “Top Level Path” and $folder(n) = “Containing Folder”. Cascade folder names into metadata for files inside
Image metadata workflow
17ESRI_UC_2007
Synchronization with graphics
ACDSee Database Adobe Acrobat Properties Adobe Acrobat Properties 2
Properties EXIF Image IPTC Annotation XMP Annotation
18ESRI_UC_2007
Synchronization with Microsoft
Microsoft NTSF/Office
19ESRI_UC_2007
Extracting metadata keywords• Capture text in:
project summary
attribute tables
text in document
similar datasets
• Sort by unique words
• Sort by capitalizationplacenames
stratigraphy
lithology
paleontology
• Select keywords
• Validation tables
• Spatial analysis
Validation (Lookup) TablesValidation (Lookup) TablesStandardized lists to validateStandardized lists to validate
Data DomainsKeywords: Countries, basins, company names Wellname and seismic line naming conventionsStratigraphic column converts pick nomenclature
Project and software parametersHorizontal & vertical scaling, color tablesProcessing parameters grid increment, contour interval
Keyword workflow
20ESRI_UC_2007
Project Summaries to Metadata
UtilizeUtilize the prethe pre--existing information existing information ininProjectProject Proposals (Work Proposals (Work Orders)Orders)
•• Extract Project Name, Owner and Project PurposeExtract Project Name, Owner and Project Purpose
•• Internal Charge Code field can serve a key field to Internal Charge Code field can serve a key field to interlink with project timeinterlink with project time--writing and with the files writing and with the files generated for the projectgenerated for the project††
††Interfacing with the business environmentInterfacing with the business environmentLinking personnel timewriting and file creation Linking personnel timewriting and file creation
allows management of careers and assetsallows management of careers and assets
21ESRI_UC_2007
Descriptive folder hierarchyManyMany users organize their data files using folder users organize their data files using folder names. names. Capture this by Capture this by autoauto--inserting or cascading folder metadatainserting or cascading folder metadataCreateCreate a table of original filepaths and current filepathsa table of original filepaths and current filepaths
$filefolder_original(1) = $filefolder_original(1) = ““Top Level PathTop Level Path””
$filefolder_original(n) = $filefolder_original(n) = ““Containing FolderContaining Folder””
•• Cascade the folder names into metadata fieldsCascade the folder names into metadata fields
22ESRI_UC_2007
Metadata for geoprocessingMetadata for geoprocessing
Models are geoprocessing workflowsModels are geoprocessing workflows
•• Model metadata for Model metadata for casual userscasual users
•• Primary purpose is to search for dataPrimary purpose is to search for data
•• Does not need details of processingDoes not need details of processing
•• Create metadata Create metadata afterafter processing steps are doneprocessing steps are done
•• Model metadata for Model metadata for specialistsspecialists
•• Primary purpose is for QC analysisPrimary purpose is for QC analysis
•• Details of processing and results of analysisDetails of processing and results of analysis
•• Need to create and edit Need to create and edit whilewhile doing processingdoing processing
23ESRI_UC_2007
Folders & geodatabasesFolders & geodatabases
•• Manually create Manually create metadata using a metadata using a metadata editor and save metadata editor and save as "metadata.xml" within as "metadata.xml" within the folder the folder
•• Save a HTML file as Save a HTML file as "metadata.htm""metadata.htm"
•• Add enclosures such as Add enclosures such as an index map graphic an index map graphic
••Metadata for both Metadata for both personal & multipersonal & multi--user user geodatabases is stored geodatabases is stored internally.internally.
http://support.esri.com/knowledgeBase/documentation/FAQs/sde_/wehttp://support.esri.com/knowledgeBase/documentation/FAQs/sde_/webhelp802/ArcCatalog/Metadata_Support.htmbhelp802/ArcCatalog/Metadata_Support.htm
24ESRI_UC_2007
Metadata import and exportMetadata import and exportImport/Export Metadata ButtonsImport/Export Metadata Buttons
.xml templates are useful for updating metadata, .xml templates are useful for updating metadata, but is a database a better solutionbut is a database a better solution
for synchronizing metadata?for synchronizing metadata?
25ESRI_UC_2007
What can the community do?
Utilize basic bibliographic metadata standards
(author, title, owner, legal disclaimers)
Create simple editors and style sheets
Improve synchronization and persistence(templates, autofill and keyword generation)
Create additional standards for specialization
Provide a basic solution (ownership, legal, technical) which interfaces with the types
geologic data and software used in our offices!