Nuxeo 5.3 and Semantic R&D

28
Copyright © Ch'ti JUG – License Creative Commons 2.0 France Ch’ti Ch’ti JUG JUG Nuxeo 5.3 and Semantic Nuxeo 22 octobre 2009

Transcript of Nuxeo 5.3 and Semantic R&D

Page 1: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo 5.3 and Semantic Nuxeo

22 octobre 2009

Page 2: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Page 3: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo 5.3 Architecture and Semantic R&D

Nuxeo - The company Nuxeo EP - Architecture overview Nuxeo DM 5.3 - new features

• Open Social• Windows SharePoint Services

Nuxeo Semantic R&D• Scribo / IKS / Eureka• The Semantic Web• Named Entities Recognition• Semantic Hashing for fast image lookup

Page 4: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

The company

Nuxeo editor: Nuxeo EP• Open Source ECM

Support subscriptions: Nuxeo Connect CEVA integrations Nuxeo DM - Cloud Edition Custom developments and Consulting

Page 5: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo

Page 6: Nuxeo 5.3 and Semantic R&D

Ch’ti Ch’ti JUGJUG

Copyright © Nuxeo – License Creative Commons 2.0 France

Nuxeo is hiring

http://www.nuxeo.com/en/societe/jobs

Page 7: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo EP – Architecture

Page 8: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo Core (Repository)

Nuxeo Core

JCRJCR

Core Services

Core Repository

LifecycleLifecycle Doc typesDoc types

EventsEvents VersioningVersioning

SecuritySecurity

Content RepositoryContent Repository

VCSVCS

QueryQuery

AdapterAdapter AdapterAdapter

Page 9: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo EP – Main services

Repository (Document Types / Version / ACL / Lifecycle)

UserManager + Directories (User / Groups – SQL + LDAP)

Workflow (JBPM) Audit (Record Management) Relation Service (Jena document

links) Tag service ...

Page 10: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo DM demo

Page 11: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Apogée – Nuxeo RCP

Page 12: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Nuxeo Digital Asset Management

Page 13: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Other applications

Nuxeo Digital Asset Management• Multimedia collection management• Fully Ajaxified filter based UI• Video / Audio streaming• Similarity search

Nuxeo Correspondence• FAX & Snail Mails imports• Smart routing workflows

Page 14: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Semantic R&D

Scribo – IKS – Eureka Semantic Web? Named Entities Recognition Semantic Relationships Extraction Automated document categorization Keyphrase (e.g. Tag) extraction Semantic Hashing for similarity search

Page 15: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Semantic Web

Linked entities and properties in a global namespace

Page 16: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Named Entities Recognition

Find occurrences of:• Persons• Places (Countries, Cities, Seas, …)• Organizations (Companies,

Gouvernments, NGOs, …)

Based on NLP algorithms• Explicit Linguistic Rules• Machine Learning (e.g. SVMs, CRFs, ...)

Links to Wikipedia / DBPedia / Freebase, ...

Page 17: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Scribo UIMA Architecture

Page 18: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Embedded UIMA chain

Page 19: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Page 20: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

NER Demo – Scribo Worbench

Page 21: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Semantic Hashing

GIST Image extraction• Vector of 960 float parameters

Semantic Hashing• Int64 code that preserves semantics

Algorithms• Stacked Denoising Autoencoders• Locality Sensitive Hashing• Spectral Hashing• Deep Belief Networks

Page 22: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Autoencoders - step 1

Encoder 0

960 floats  ­ GIST of noised image

400 floats – intermediate code

960 floats  ­ GIST of noiseless image

Decoder 0

Page 23: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Autoencoders – step 2

Encoder 0

960 floats  ­ GIST of noised

400 floats – intermediate code

Encoder 1

64 floats – intermediate code

Decoder 1

400 floats  for encoder_0(noiseless image)

Page 24: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Autoencoders – step 3

Encoder 0

960 floats  ­ GIST code

400 floats – intermediate code

Encoder 1

64 floats – intermediate codeBinary Threshold

64 bits Integer semantic hash

Page 25: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Expected results on pictures

Results from: Learning to Hash with Binary Reconstructive Embeddings – 2009 ­ Brian Kulis, Trevor Darrell

Page 26: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Questions ?

Page 27: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Licence

Les photos et logos appartiennent à leurs auteurs respectifs

Le contenu de la présentation est sous licence Creative Commons 2.0 France• Contrat Paternité• Pas d'Utilisation Commerciale• Partage des Conditions Initiales à

l'Identique http://creativecommons.org/licenses/by-nc-sa/2.0/fr/

27

Page 28: Nuxeo 5.3 and Semantic R&D

Copyright © Ch'ti JUG – License Creative Commons 2.0 France

Ch’ti Ch’ti JUGJUG

Cocktail

Merci pour votre attention

Merci à Ineat Conseil pour son sponsoring

28