Weakly supervised segmentation and Firma convenzione ...
Transcript of Weakly supervised segmentation and Firma convenzione ...
Firma convenzione
Politecnico di Milano e Veneranda Fabbrica
del Duomo di Milano
Aula Magna β Rettorato
Mercoledì 27 maggio 2015
Weakly supervised segmentation and recognition of surgical gestures
Tutor: Elena De Momi, PhD
Co-tutor: Danail Stoyanov, PhD
Co-tutor: Hirenkumar Nakawala, PhD
Student: Beatrice van Amsterdam
850736
Academic Year: 2016-2017
Nome Cognome, assoc.prof. ABC Dept.
IntroductionRobot-Assisted Minimally Invasive Surgery (RAMIS)
Robot-Assisted Minimally Invasive
Surgery (RAMIS)
β Video and robot kinematics recoding
β Lack of tactile sensationββ Improved visibility (3D)
β Improved precision and ergonomic comfortβ Reduced trauma, pain, recovery time, costs
β Extended operation time and learning curve
Minimally Invasive Surgery (MIS)
β Reduced trauma, pain, recovery time, costs
β Lack of tactile sensationβ Kinematic restrictions
β Reduced visibility (2D)
Open surgery
βMaximum visibilityβ Tactile sensationβ No kinematic restrictionsβ Substantial trauma and painβ Long recovery time
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
IntroductionBackground clinical problem
Surgical technical errors are the mostcommon reason (30%) for post-surgicalcomplications including re-operation andre-admission. (Stahel, 2014)
(Birkmeyer, 2013)
The issue of surgical errors has becomeparticularly relevant with the advent ofnew surgical techniques that offer greatadvantages to the patient but require ahigh level of visuomotor skill from thesurgeon, such as MIS and RAMIS. (White,
2015)
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
IntroductionAction recognition for Computer Assisted Interventions
β Extended learning curve Computational models for surgical skill assessment
β Critical and time-consuming tasks Intra-operative assistance and automation
Surgery involves dexterous human motion. The eventual goal for studying surgical
motion could improve safety and effectiveness of surgical patient care (Gao, 2014) .
Beatrice van Amsterdam
ACTION RECOGNITION
Understand where the surgeonneeds improvement
Short segments are easier toautomate
Nome Cognome, assoc.prof. ABC Dept.
IntroductionAction granularity
Beatrice van Amsterdam
Procedure
Phase
Activity
Surgeme
Dexeme
E.g. Suturing
E.g. Closure
E.g. Lumbar Disc Herniation
E.g. Grasping the needle
E.g. Turning left
(Lalys, 2014)
Nome Cognome, assoc.prof. ABC Dept.
Related work
Supervised Approaches
β’ Hidden Markov Models (Tao, 2012)
β’ Conditional Random Fields (Tao, 2013)
β’ Deep Learning (Lea, 2017)
Unsupervised Approaches
β’ Gaussian Mixture Models (Murali, 2016)
β’ Hidden Markov Models unsup. (Tanwani, 2016)
β’ Bottom-up clustering (Fard, 2016)
Weakly supervised Approaches
β’ Video retrieval (Quellec, 2014)
β’ CNN (Wang, 2017)
β’ Ordering constrained clustering (Bojanowski, 2014)
Beatrice van Amsterdam
β Good resultsβ Manual labelling (burden, subjective)β Fixed set of classes
β No manual labelling
β Poorer performanceβ Classes learnt from data
β Better performance than unsupervised methodsβ β Minor manual labellingβ Fixed set of classes
Nome Cognome, assoc.prof. ABC Dept.
Aim of the work
New approach: employ weak supervision to optimize
unsupervised algorithm initialization
Beatrice van Amsterdam
Many unsupervised algorithms rely on iterative schemes: initialization problem
Expected outcome: improved alignment between
predicted and ground truth segmentation
Nome Cognome, assoc.prof. ABC Dept.
Materials and MethodsThe JIGSAWS dataset
Beatrice van AmsterdamBeatrice van Amsterdam
JIGSAWS (Gao, 2014) is a public dataset featuring suturing demonstrations collected fromeight surgeons with different skill level (Expert, Intermediate, Novice) using the da VinciSurgical System.
Cartesian positions (3 variables)Rotation matrix (9 variables)Linear velocities (3 variables)
Angular velocities (3 variables)Gripper angle (1 variable)
The motion of each manipulator isdescribed by a local frame attached at itsend-effector using 19 kinematic variables:
Expert: > 100 training hours Novice: < 10 training hours
Nome Cognome, assoc.prof. ABC Dept.
Materials and MethodsPre-processing
Rotation matrix converted into
quaternion
Euclidean distance signals
computation
Low pass filter
fc = 1.5 Hz
(Despinoy, 2016)
Normalization to zero mean and unit variance
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Materials and MethodsSimultaneous action segmentation and recognition
After model training, each sample is assigned to itsmost likely mixture component, i.e. its most likelysurgeme label.
Each demonstration π π‘ β βπ can be treated as anoisy realization of a switched linear dynamicalsystem. Each regime π¨π corresponds to a surgeme.
which is equivalent to solving multiple linearregression problems, when L=1 (Moldovan, 2015).
π π‘ + 1 = π¨ππ π‘ + π π‘ ,
π¨π β {π¨π, β¦ , π¨π΅}, π¨π β βπΓπ
argminπ¨π
π¨ππΏπ β πΏπ+π ,
πΏπ = [π π‘ ,β¦ , π π ] β βπΓπ
Action recognition can then be performed byfitting a Gaussian Mixture Model (GMM) to theaugmented state π π‘ ,
π π‘ = π π‘ , π π‘ + 1 ,β¦ , π π‘ + πΏ ,
π π‘ β βππΏ
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Materials and MethodsGround truth segmentation redefinition: local linearity
Beatrice van AmsterdamBeatrice van Amsterdam
πΏ = 3 (Murali, 2016) π π‘ = π π‘ , β¦ , π π‘ + πΏ = π π‘ , π π‘ + 1 , π π‘ + 2 , π π‘ + 3
π π‘π π‘ + 1π π‘ + 2π π‘ + 3
π π‘
Nome Cognome, assoc.prof. ABC Dept.
Materials and MethodsGround truth segmentation redefinition: action dictionary
L1: positioning and pushing needle through the tissue
L2: reaching for the needle with the left hand
L3: pulling suture with the left hand
L4: transferring needle from left to right
L5: extracting suture from the tissue with the left hand
L6: reaching for the needle with the right hand;
L7: transferring needle from right to left
L8: using right hand to help tighten suture
L9: dropping suture
L10: moving to end points
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Materials and MethodsWeakly supervised initialization
Beatrice van Amsterdam
GMM0
w
Nome Cognome, assoc.prof. ABC Dept.
Materials and MethodsOffline segmentation
Beatrice van Amsterdam
w
w
Nome Cognome, assoc.prof. ABC Dept.
Evaluation protocolExtrinsic evaluation metrics
π¨πππππππ =ππ
ππ + πΉπ + πΉπ
πΉπππππ =ππ
ππ + πΉπ
π·ππππππππ =ππ
ππ + πΉπ
π β πππππ =2 Γ ππππππ πππ Γ π πππππ
ππππππ πππ + π πππππ
π΅πππππππππ π΄πππππ π°ππππππππππ (π΅π΄π°) =πΌ πΎ, π
π» πΎ π» π
πΉπππππ π΄πππππΎπππππππ (πΉπ΄πΎ)
π·ππππππππ π΄πππππΎπππππππ (π·π΄πΎ)
π β ππππππ΄πππππΎπππππππ (ππ΄πΎ)
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Evaluation protocolIntrinsic evaluation metrics
πΊπππππππππ π°ππ ππ (πΊπ°) = ππππ ππΊπ° , ππΊπ° =πβπ
max(π,π), 0 < ππΊπ° < 1
πΉπππππππ πΊπππππππππ π°ππ ππ (πΉπΊπ°) =ππΌ
Ground Truth SI
β
β
β
π
π
π
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Experiments and ResultsExperimental protocol
Beatrice van Amsterdam
β Ground truth redefinition
β Initialization technique
β Feature selection
β Increased data variability
Experts only
Full dataset
Nome Cognome, assoc.prof. ABC Dept.
Experiments and ResultsGround truth redefinition (experts)
*p<0.05 McNemarβs test
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Experiments and ResultsGround truth redefinition (experts)
*p<0.05 π2 test for independent proportions
Beatrice van Amsterdam
Proposed annotation
Original annotation
Nome Cognome, assoc.prof. ABC Dept.
Experiments and ResultsInitialization technique (experts)
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Experiments and ResultsFeature selection (experts)
McNemarβs test
Beatrice van Amsterdam
Nome Cognome, assoc.prof. ABC Dept.
Experiments and ResultsFeature selection (experts)
Beatrice van Amsterdam
output
GT
GToutput
Nome Cognome, assoc.prof. ABC Dept.
Experiments and ResultsExtended dataset
Beatrice van Amsterdam
< (Lea, 2017) supervised (85% Accuracy)< (Fard, 2016) unsupervised (0.78 Recall, 0.74 Precision)> (Murali, 2016) unsupervised GMM-based (0.31 NMI)
Nome Cognome, assoc.prof. ABC Dept.
Conclusion and future work
Experimental results on real surgical kinematic trajectories during a training exerciseconfirm that weakly supervised initialization signicantly outperforms standard task-agnostic initialization methods, leading to 13% improvement of NMI.
Automatic recognition of surgical gestures is an important preliminary step in thedevelopment of computational models for surgical skill assessment as well as intra-operative assistance and automation algorithms.
Future work:
β’ Including contextual and semantic information from the video data.
β’ Extending the kinematic data with the trajectories of the da Vinci arm joints.
β’ Introducing transition probabilities between different actions (HMM).
Beatrice van Amsterdam