Post on 05-Jan-2016
description
cTAKES: Demo
Clinical Text Analysis and Knowledge Extraction System
James Masanz
Mayo Clinic
UIMA CAS Visual Debugger (CVD)
Provided by / part of UIMA Run a pipeline against free text With appropriate 1st annotator, against
XML such as CDA document View annotations created (“debugger”)
Export annotations to XML (XCAS or XMI)
cTAKES: Components
• Sentence boundary detection (OpenNLP technology)
• Tokenization (rule-based)
• Morphologic normalization (NLM’s LVG)
• POS tagging (OpenNLP technology)
• Shallow parsing (OpenNLP technology)
• Named Entity Recognition
• Negation and context identification (NegEx)
• Dependency parser
• Drug Profile module
• Smoking status classifier
• CEM normalization module
Extend Earlier Example
Tamoxifen 20 mg po daily started on March 1, 2005 for 6 mo.
Aspirin prn.
Fx history of breast cancer. History of migraines.
Sentences
Tokens
Chunks
Windows for Lookup
Named Entity
Questions?
Live Demo of CVD