LICT Human-Machine-Interface
Transcript of LICT Human-Machine-Interface
BIODATA ANALYSIS&
VISUALIZATION Jan Aerts
Faculty of Engineering - ESAT/SCD
http://saaientist.blogspot.com@jandot
Tuesday 1 February 2011
Involved in genomics research:
•chicken, cow, human genome DNA sequencing
•search for genetic variation responsible for phenotype/disease
Issues with
•filtering: finding the correct set of parameters
•pattern searching: grasping the significance and effect of the mutations
=> visual analyticsTuesday 1 February 2011
A. Filtering
Investigating parameter space...
Tuesday 1 February 2011
putative mutations
filter 1
filter 2
A B C
different settings for filters
filter 3
Tuesday 1 February 2011
AB
C
What we find...
Tuesday 1 February 2011
AB
C
What we find...
State of the art: run many filter pipelines and take intersection
Tuesday 1 February 2011
AB
C
What we should have found...
Tuesday 1 February 2011
parameter-space
Tuesday 1 February 2011
Tuesday 1 February 2011
Tuesday 1 February 2011
Tuesday 1 February 2011
sometimes: bypass parameter-setting
Tuesday 1 February 2011
Tuesday 1 February 2011
Tuesday 1 February 2011
Tuesday 1 February 2011
Aim: use interactive visualization of the “raw” data to:
•peep inside the black box
•get feel for the data
•get feel for how filter settings influence each other
Tuesday 1 February 2011
Aim: use interactive visualization of the “raw” data to:
•peep inside the black box
•get feel for the data
•get feel for how filter settings influence each other
Eradicate disease
Tuesday 1 February 2011
B. Pattern searching
Making sense of our data...
Tuesday 1 February 2011
Tuesday 1 February 2011
Tuesday 1 February 2011
Typical example: gene networks=> can we identify patterns?
same network
Tuesday 1 February 2011
How do these networks differ?
Tuesday 1 February 2011
Hive Plots, taken from http://mkweb.bcgsc.ca/linnet/
Tuesday 1 February 2011
Aim: help researchers make sense of complicated data:
•gene networks
•structural variation in the genome
• linked data
• ...
Tuesday 1 February 2011
Aim: help researchers make sense of complicated data:
•gene networks
•structural variation in the genome
• linked data
• ...
Eradicate disease
Tuesday 1 February 2011
Hurdles:
•big data (millions/billions of datapoints)
=> makes interactivity difficult
solution: indexing methods, data formats, dimensionality reduction, ...
• visual encoding
Tuesday 1 February 2011
Tools
Tuesday 1 February 2011
User groups:
researcher => clinician => patient
Tuesday 1 February 2011
Bioinformatics
Visualization
Tuesday 1 February 2011
So:
•visual analytics: visually identifying patterns in large datasets to inform on statistical analysis
•use visualization to make sense of complex data
Tuesday 1 February 2011