Semantic Search in E-Discovery

Post on 27-Jan-2015

115 views 6 download

Tags:

description

 

Transcript of Semantic Search in E-Discovery

Semantic Search in E-Discovery

David Graus

Research on the application of text mining and information retrieval for fact finding in regulatory investigations

Semantic search in e-discovery

Who’s Involved?

2

Prof. dr. Maarten de RijkeDirector Intelligent Systems Lab, UvA

David van Dijk, MSc.Researcher E-Discovery, CREATE-IT applied research

Dr. Hans HenselerLector E-Discovery, CREATE-IT applied research

Menno Israël, MSc.Teamleader Knowledge and Expertise Centre for Intelligent Data Analysis (Kecida), NFI

David Graus, MSc.PhD Candidate, Semantic Search in E-Discovery, UvA

Zhaochun Ren, MSc.PhD Candidate, Semantic Search in E-Discovery, UvA

Semantic search in e-discovery

Introduction

£ Semantic Search in E-Discovery

3

Semantic search in e-discovery

What is

£ Semantic Search in E-Discovery� retrieving and securing digital forensic evidence

4

Semantic search in e-discovery

What is

£ Semantic Search in E-Discovery

5

Semantic search in e-discovery

What is

£ Semantic Search in E-Discovery� retrieving and securing digital forensic evidence� from emails, forums, etc...

6

Semantic search in e-discovery

What is

£ Semantic Search in e-Discovery

7

Semantic search in e-discovery

Challenge

8

¢ Finding out who knew what, from whom, and when

Semantic search in e-discovery

Challenge

9

¢ Finding out who knew what, from whom, and when¢ Generic search is not the answer

Semantic search in e-discovery

Finding evidence for E-Discovery

10

¢ We don’t know what we’re looking for¢ What we’re looking for might be deliberately hidden¢ Communication might be very domain-specific,

contextualized or incomplete

Semantic search in e-discovery

Task

11

¢ Retrieve all relevant traces¢ Highly iterative search process¢ Support (re)formulating questions and hypotheses

Semantic search in e-discovery

How do we approach this?

¢ Two subprojects:£ Information Retrieval

� Finding material of unstructured nature from large collections£ Information Extraction/Text Mining

� Discovering patterns in data

12

Semantic search in e-discovery

How do we approach this?

¢ Information Retrieval£ Integrating structure/context of data in retrieval models

� Capturing forum and email context� Conversational search

13

Semantic search in e-discovery

How do we approach this?

¢ Information Extraction/Text Mining£ Extracting structured knowledge from user generated

content� Semantic pre-processing� Social network inference� Information maps

14

Semantic search in e-discovery

How do we approach this?

¢ Information Retrieval <-> Information Extraction

15

Semantic search in e-discovery

Current work (first steps)

¢ Information Retrieval£ Twitter Mining (as a form of conversational search)

¢ Information Extraction/Text Mining£ Entity linking (for semantic document enrichment)

¢ TREC/TAC benchmarking events£ TREC Legal Track 2011 (2013?)

16

Semantic search in e-discovery

Contributions

¢ xTAS: Open source text analysis toolkit¢ iColumbo: Internet monitoring framework¢ Used by:

£ Internet Recherche Netwerk£ Koninklijke Bibliotheek£ Beeld en Geluid£ ... You?

17

Semantic search in e-discovery

Semantic search in E-discovery

¢ David Graus¢ d.p.graus@uva.nl

18