Nuxeo 5.3 and Semantic R&D
-
Upload
olivier-grisel -
Category
Technology
-
view
2.488 -
download
0
Transcript of Nuxeo 5.3 and Semantic R&D
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo 5.3 and Semantic Nuxeo
22 octobre 2009
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo 5.3 Architecture and Semantic R&D
Nuxeo - The company Nuxeo EP - Architecture overview Nuxeo DM 5.3 - new features
• Open Social• Windows SharePoint Services
Nuxeo Semantic R&D• Scribo / IKS / Eureka• The Semantic Web• Named Entities Recognition• Semantic Hashing for fast image lookup
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
The company
Nuxeo editor: Nuxeo EP• Open Source ECM
Support subscriptions: Nuxeo Connect CEVA integrations Nuxeo DM - Cloud Edition Custom developments and Consulting
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo
Ch’ti Ch’ti JUGJUG
Copyright © Nuxeo – License Creative Commons 2.0 France
Nuxeo is hiring
http://www.nuxeo.com/en/societe/jobs
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo EP – Architecture
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo Core (Repository)
Nuxeo Core
JCRJCR
Core Services
Core Repository
LifecycleLifecycle Doc typesDoc types
EventsEvents VersioningVersioning
SecuritySecurity
Content RepositoryContent Repository
VCSVCS
QueryQuery
AdapterAdapter AdapterAdapter
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo EP – Main services
Repository (Document Types / Version / ACL / Lifecycle)
UserManager + Directories (User / Groups – SQL + LDAP)
Workflow (JBPM) Audit (Record Management) Relation Service (Jena document
links) Tag service ...
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo DM demo
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Apogée – Nuxeo RCP
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Nuxeo Digital Asset Management
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Other applications
Nuxeo Digital Asset Management• Multimedia collection management• Fully Ajaxified filter based UI• Video / Audio streaming• Similarity search
Nuxeo Correspondence• FAX & Snail Mails imports• Smart routing workflows
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Semantic R&D
Scribo – IKS – Eureka Semantic Web? Named Entities Recognition Semantic Relationships Extraction Automated document categorization Keyphrase (e.g. Tag) extraction Semantic Hashing for similarity search
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Semantic Web
Linked entities and properties in a global namespace
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Named Entities Recognition
Find occurrences of:• Persons• Places (Countries, Cities, Seas, …)• Organizations (Companies,
Gouvernments, NGOs, …)
Based on NLP algorithms• Explicit Linguistic Rules• Machine Learning (e.g. SVMs, CRFs, ...)
Links to Wikipedia / DBPedia / Freebase, ...
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Scribo UIMA Architecture
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Embedded UIMA chain
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
NER Demo – Scribo Worbench
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Semantic Hashing
GIST Image extraction• Vector of 960 float parameters
Semantic Hashing• Int64 code that preserves semantics
Algorithms• Stacked Denoising Autoencoders• Locality Sensitive Hashing• Spectral Hashing• Deep Belief Networks
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Autoencoders - step 1
Encoder 0
960 floats GIST of noised image
400 floats – intermediate code
960 floats GIST of noiseless image
Decoder 0
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Autoencoders – step 2
Encoder 0
960 floats GIST of noised
400 floats – intermediate code
Encoder 1
64 floats – intermediate code
Decoder 1
400 floats for encoder_0(noiseless image)
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Autoencoders – step 3
Encoder 0
960 floats GIST code
400 floats – intermediate code
Encoder 1
64 floats – intermediate codeBinary Threshold
64 bits Integer semantic hash
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Expected results on pictures
Results from: Learning to Hash with Binary Reconstructive Embeddings – 2009 Brian Kulis, Trevor Darrell
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Questions ?
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Licence
Les photos et logos appartiennent à leurs auteurs respectifs
Le contenu de la présentation est sous licence Creative Commons 2.0 France• Contrat Paternité• Pas d'Utilisation Commerciale• Partage des Conditions Initiales à
l'Identique http://creativecommons.org/licenses/by-nc-sa/2.0/fr/
27
Copyright © Ch'ti JUG – License Creative Commons 2.0 France
Ch’ti Ch’ti JUGJUG
Cocktail
Merci pour votre attention
Merci à Ineat Conseil pour son sponsoring
28