Post on 22-Dec-2015
Implementing Metadata
Marjorie M K Hlava, PresidentAccess Innovations, Inc.
Albuquerque, NM
+1-505-998-0800www.accessinn.comwww.dataharmony.commhlava@accessinn.com
1 Subject metadata
23
45
67
89
Indexing Legacy Content
Systems Integration
Forward-Flow Indexing
Using Your Indexing in Search
Metadata Maintenance
Author Submission
Enhanced Search
Analytics
Essential Component of Successful Metadata Strategy
A controlled vocabulary*• Authority file• Taxonomy• Thesaurus• Ontology…forms the basis of semantic initiatives.
More than a repository of indexing terms, a taxonomy should:• Outline the field or area of business• Illustrate the relationships between concepts• Describe the areas of expertise of authors, staff, editors,
users, customers• Drive analytics that enable data-based decisions
1
Subject Metadata -Taxonomy Construction
Critical Building Block for Semantic Activities
Subject Metadata
2
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
2• Apply subject metadata to the repository of content • Greatest early tangible benefit of metadata development.
• Semantically enrich content with subject terms toenhance search and browse capabilities
• Analyze legacy content to produce data for analysis andto enable data-based decisions
• Add an author name disambiguation routine to this process• associates each unique author • with the topics on which they publish• process we call “Semantic Fingerprinting”
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Indexing Legacy Content
3
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Systems Integration
Incorporate Indexing Software into Workflow
• Metadata Management is Middleware designed to integrate with the production platform.
• Integrations with o Publishing platforms o CMS applications.o Article submission systemso Websiteso Etc.
• Incorporate call-outs to the Metadata from inside any of them via API or web service calls
3
Systems Integration
Incorporate Indexing Software into Workflow
Systems Integration: Incorporating the Metadata and Indexing Tools into Existing Publishing Pipeline
4
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Systems Integration
Incorporate Indexing Software into Workflow
Forward Flow Indexing(New Content)
Automated or Machine-Aided Process
Call the metadata servers at many: • Send content (daily, or in real time) to be automatically
indexed and return enriched content• Suggest taxonomy terms in real time to a human indexer• Batch index legacy content • …and many other uses.
The goal is to streamline the entire process by making it as efficient as possible while adding accurate subject metadata you can leverage during various points in the publishing cycle.
4
Forward Flow Indexing(New Content)
Automated or Machine-Aided Process
Forward-Flow Indexing: Simplify the Metadata and Subject Indexing Process in the Content Production and Posting Cycle
5
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Systems Integration
Incorporate Indexing Software into Workflow
Forward Flow Indexing(New Content)
Automated or Machine-Aided
Process
Search Presentation Layer for Web
Leverage Indexing for Search & Browse
5Implement the Metadata Enriched Data in Search to Leverage Your Semantic Indexing
Leverage your semantic indexing by producing a rich searchand browse environment for your users to provide faster,more accurate search results.
In the presentation layer (Website)Use it in the search or web presentation layer
Search Presentation Layer for Web
Leverage Indexing for Search & Browse
Using Your Indexing in Search
6
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Systems IntegrationIncorporate Indexing
Software into Workflow
Forward Flow Indexing(New Content)
Automated or Machine-Aided Process
Search Presentation Layer for Web
Leverage Indexing for Search & Browse
Taxonomy/Metadata Maintenance
Iterative Refinement of Vocabulary
6Keeping it Up-to-date
Subject metadata is a living document and must be maintainedto ensure maximum effectiveness over time.
Your taxonomy will need to be updated as:• New concepts are introduced into your field• Your content base expands to cover new topics• Terminologies and usage practices change over time• Indexing is reviewed and adjustments are made
Taxonomy/Metadata Maintenance
Iterative Refinement of Vocabulary
Metadata Maintenance
7
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Systems IntegrationIncorporate Indexing
Software into Workflow
Forward Flow Indexing(New Content)
Automated or Machine-Aided Process
Search Presentation Layer for Web
Leverage Indexing for Search & Browse
Taxonomy/Metadata Maintenance
Iterative Refinement of Vocabulary
Metadata-BasedAdd-Ons
Auto-Editor/ReviewerAssignment, &c.
SmartSubmit • suggest keywords from the taxonomy to authors right
when they submit new content• Allows authors to suggest missing terms• Use indexing to assign peer reviewers • Use to auto bin editors to Journals via subject-appropriate
content• Get early data on what topics are being submitted for
publication
7 Metadata-BasedAdd-Ons
Auto-Editor/ReviewerAssignment, &c.
Author Submission
Semantic Fingerprinting
• Extract and normalize author and institution names from your data
• Associate each author and institution (journal, book, etc.) with the topics on which they publish and build subject profile for each
• Great for characterizing entities for sales and marketing purposes
7
8
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Systems IntegrationIncorporate Indexing
Software into Workflow
Forward Flow Indexing(New Content)
Automated or Machine-Aided Process
Search Presentation Layer for Web
Leverage Indexing for Search & Browse
Taxonomy/Metadata Maintenance
Iterative Refinement of Vocabulary
Metadata-BasedAdd-Ons
Auto-Editor/ReviewerAssignment, &c.Metadata-Based
Search Features
Type-Ahead FunctionsContent Recommender
Metadata-Driven Options
Leverage the metadata to allow your users to• Browse the taxonomy for topics and return relevant content• Power a type-ahead box in search Recommender Engine:You can also use the subject metadata to drive users to content with the same topical indexing as the content they’re viewing• Rather than a behavior-based recommender • Amazon promotes content based on other users• Recommend content based on subject areas tagged
8 Metadata-BasedSearch FeaturesType-Ahead FunctionsContent Recommender
Enhanced Search
Taxonomy Construction
Critical Building Block for Semantic Activities
Semantic Enrichment(Indexing Legacy Content)
Classifies Content for Use in Search/Analytics
Systems Integration
Incorporate Indexing Software into Workflow
Forward Flow Indexing(New Content)
Automated or Machine-Aided Process
Search Presentation Layer for Web
Leverage Indexing for Search & Browse
Taxonomy Metadata Maintenance
Iterative Refinement of Vocabulary
Metadata-BasedAdd-Ons
Auto-Editor/ReviewerAssignment, &c.Metadata-Based
Search Features
Type-Ahead FunctionsContent Recommender
Metadata-EnabledData-Driven Analytics
Investigate Content, Authors, Users, &c.
9
Make data-based decisions with analytics and visualizations empowered by your metadata and semantic enrichment
See topics trending• Over time• By publication• By author and institution• By user behavior
Understanding your content drives sales and marketing,content curation, and editorial decisions.
The possibilities are endless.
Metadata-EnabledData-Driven Analytics
Investigate Content, Authors, Users, &c.
Analytics9