VINERy: A Visual IDE for Information Extraction

1
System Architecture (4) User Study Experts Novice Users Much less training needed: Minutes vs. Days Unanimously found the system useful and easy to use [Compared to what he has used before] This is 100 times better.--- A subject (2) UI Components (3) VAQL AQL Yunyao Li*, Elmer Kim**, Marc A. Touchette, Ramiya Venkatachalam, Hao Wang *IBM Research – Almaden, ** Treasure Data, Inc., IBM Silicon Valley Lab VINERy: A Visual IDE for Information Extraction (1)VAQL (Visual Annotation Query Language) Extract Constructs Atomic Pre-built Dictionary Regular Expression Literal Proximity Composite Sequence Pattern Union Information extraction is a critical building block for a wide range of emerging applications. To satisfy the increasing text analytics demands of real-world applications, it is crucial to lower the barrier to entry and empower novices to develop high quality IE extractors . VINERy is SystemT’s latest effort towards this goal. Automatically generate performant and readable AQL programs for execution and for further development in AQL, if needed. Available as part of IBM BigInsights since 4.0 Watch video demo Refinement Constructs Projection Expression Consolidation Filter Document Viewer Canvas Project Pane Extractor Catalog Result Grid Property Pane How does VINERy compare to other existing ways of performing IE tasks? much worse much better Learning Curve Ease of Use 1 2 3 4 5 Try it out Learn more

Transcript of VINERy: A Visual IDE for Information Extraction

Page 1: VINERy: A Visual IDE for Information Extraction

System Architecture

(4) User Study• Experts

• Novice Users• Much less training needed: Minutes vs. Days• Unanimously found the system useful

and easy to use

“[Compared to what he has used before] This is 100 times better.” --- A subject

(2) UI Components

(3) VAQL AQL

Yunyao Li*, Elmer Kim**, Marc A. Touchette, Ramiya Venkatachalam, Hao Wang *IBM Research – Almaden, ** Treasure Data, Inc., IBM Silicon Valley Lab

VINERy: A Visual IDE for Information Extraction

(1)VAQL (Visual Annotation Query Language)

• Extract Constructs • Atomic• Pre-built • Dictionary• Regular Expression• Literal • Proximity

• Composite• Sequence Pattern • Union

Information extraction is a critical building block for a wide range of emerging applications.

To satisfy the increasing text analytics demands of real-world applications, it is crucial to lower the barrier to entry and empower novices to develop high quality IE extractors.

VINERy is SystemT’s latest effort towards this goal.

Automatically generate performant and readable AQL programs for execution and for further development in AQL, if needed.

Available as part of IBM BigInsights since 4.0

Watch video demo

• Refinement Constructs• Projection• Expression• Consolidation• Filter

Document ViewerCanvasProject Pane

Extractor Catalog Result GridProperty Pane

How does VINERy compare to other existing ways of performing IE tasks?

much worse much better

Learning Curve

Time Required

Ease of Use

Effort Required

1 2 3 4 5

Try it out Learn more