Quadratic PLS applied to external preference ...Quadratic PLS applied to external preference...
Transcript of Quadratic PLS applied to external preference ...Quadratic PLS applied to external preference...
Quadratic PLS applied to external preference mappingexternal preference mapping
Stéphane Verdun Véronique Cariou El Mostafa QannariStéphane Verdun, Véronique Cariou, El Mostafa Qannari
ONIRIS Sensometrics and Chemometrics LaboratoryONIRIS, Sensometrics and Chemometrics Laboratory,Nantes, France
July, 10-13, 20121 11th International Sensometrics, Rennes
Introduction
Context of Preference Mapping:• relate data on consumers’ preference to information on products
• identify attributes drivers of liking
Usual implementation of external preference mapping:
consumers
Setup a perceptual
products map
Fit into the products map
sensorydescriptors
consumers’ preference
ratingsts
products map
cts
productssensoryratingsratings
Y (n x p)
prod
uct
prod
uc
ratings
X (n x q)
2 11th International Sensometrics, RennesJuly, 10-13, 2012
( p) ( q)
Preference mapping
Extension to:• sensory descriptors as a single consensus data table vs multiple data
blocks vs multiway data
e an
d ud
es
bute
s
• information on consumers with a L-shaped data structure
Z’ (r x p)
usag
eat
titu
attri
b
consumersBuild a perceptualFit into the
sensorydescriptors
Z (r x p)
consumers’ preference
ratings
ucts
p pproducts mapproducts map
ucts
productssensoryratings
Y (n x p)prod
u
prod
3 11th International Sensometrics, RennesJuly, 10-13, 2012
See Dijksterhuis & al. (2005) ; Lengard & al. (2006) ; Endrizzi & al. (2010) ; Mage & al. (2012).
Preference mapping
Adapt model to fit:
• vectorial vs polynomial models
Liki
ng
Liki
ngLi
king
Liki
ng
Liki
ng
CP1CP2CP1CP2 CP1CP2 CP1CP2CP1CP2
Setting up of the products perceptual map:
f PCA PLS• use of PCA vs PLS
4 11th International Sensometrics, RennesJuly, 10-13, 2012
Plan
• Models of Quadratic PLS
• Extension of Hoskuldsson’s approach to several Y : QPLS2
• α - Regularization of QPLS2α Regularization of QPLS2
• Application to Preference Mapping : The Coffee dataset
• Conclusion
5 11th International Sensometrics, RennesJuly, 10-13, 2012
Models of Quadratic PLS
From PLS: t u
Y(n x p)X(n x q) Liki
ng
linear functionbetween u and t
Y(n x p)X(n x q)
Sweet
’
max cov2(t,u)
t u
w’p’
c’q’
To quadratic PLS:quadratic functionbetween u and t
Sweet
Liki
ng
6 11th International Sensometrics, RennesJuly, 10-13, 2012
Models of Quadratic PLS
Gnanadesikan (1977), Wold & al. (1984), …t u
Y
t ulinear functionbetween u and t
XX (x 2)X (x *xk) Xmax cov2(t,u)
X (xj )X (xj xk)
Wold & al. (1989), Baffi, Martin and Morris (1999) t u 1 Estimate c b a q adratic
Y
t u
X
1. Estimate c by a quadraticregression of u on t
2 Update w by a Newton1 2
w’p’ c’q’
2. Update w by a Newton-Raphson-like linearization of the quadratic inner relationestimated by linear PLS
7 11th International Sensometrics, RennesJuly, 10-13, 2012
p q estimated by linear PLS
Models of Quadratic PLS : Höskuldsson’s approach
Höskuldsson (1992) extends the PLS1 criterion with the d ti d i t ti tquadratic and interactions terms.
At step h, he seeks to maximize:),²(cov...),²(cov),²(cov),²(cov 11
2−++++ hhhhh ttyttytyty
given t1, …, th-1 already known, and th=Xh-1wh with ||wh ||=1.
On the basis of this criterion, Verdun & al. (2012) have proposed a revision of the original algorithm in order to guarantee convergence and optimality.
This latter one is extended in the case of QPLS2.
Plan
• Models of Quadratic PLS
• Extension of Hoskuldsson’s approach to several Y : QPLS2
• α - Regularization of QPLS2α Regularization of QPLS2
• Application to Preference Mapping : The Coffee dataset
• Conclusion
9 11th International Sensometrics, RennesJuly, 10-13, 2012
Extension of Hoskuldsson’s approach to several Y: QPLS2QPLS2
The criterion can be modified to handle the case where Y has several variables.
At a step h, the algorithm seeks a component uh=Yh-1ch and a component th=Xh 1wh that maximise: co po e t th h-1wh t at a se
),²(cov...),²(cov),²(cov),²(cov 112
−++++ hhhhhhhhh ttuttututu ),(),(),(),( 11 hhhhhhhhh
10 11th International Sensometrics, RennesJuly, 10-13, 2012
Extension of Hoskuldsson’s approach to several Y: some limitationssome limitations
The criterion uses the covariance between u on the one hand, and t and t2 on the other hand. These variables can be at very different scale levels.
If t2 variance is large compared to the variance of t, the objective criterion will be dominated by t2.j y
11 11th International Sensometrics, RennesJuly, 10-13, 2012
Plan
• Models of Quadratic PLS
• Extension of Hoskuldsson’s approach to several Y : QPLS2
• α - Regularization of QPLS2α Regularization of QPLS2
• Application to Preference Mapping : The Coffee dataset
• Conclusion
12 11th International Sensometrics, RennesJuly, 10-13, 2012
α - Regularization of QPLS2
A parameter α is introduced to balance the linear and the
quadratic terms 0≤ α ≤1 :quadratic terms, 0≤ α ≤1 :
( ) ),²(cov1),²(cov 2hhhh tutu αα −+
α is set to the value that leads to the highest RY2 coefficient
of the quadratic model. q
D i th d fl ti t Y i d fl t d i t d t2During the deflation step, Y is deflated using t and t2.
13 11th International Sensometrics, RennesJuly, 10-13, 2012
Plan
• Models of Quadratic PLS
• Extension of Hoskuldsson’s approach to several Y : QPLS2
• α - Regularization of QPLS2α Regularization of QPLS2
• Application to Preference Mapping : The Coffee dataset
• Conclusion
14 11th International Sensometrics, RennesJuly, 10-13, 2012
Application to Preference Mapping : The Coffee dataset (ESN 1996)dataset (ESN, 1996)
Data
• 8 coffees
• 160 french consumers who have been partitionned in 3160 french consumers who have been partitionned in 3 clusters
• Quantitative descriptive analysis: 23 sensory descriptors• Quantitative descriptive analysis: 23 sensory descriptors• Smell descriptors : chocolate, intensity, moisty, sweet...
• Taste descriptors : sour, chocolate, metallic...
15 11th International Sensometrics, RennesJuly, 10-13, 2012
Application to Preference Mapping : The Coffee dataset
Results of the Hierarchical Clustering:
16 11th International Sensometrics, RennesJuly, 10-13, 2012
Application to Preference Mapping : The Coffee dataset
Products map obtained by QPLS:~ 79% of the variance of X explained by t1 and t2
~ 80% of the variance of Y explained by the quadratic model (t1, t2)
17 11th International Sensometrics, RennesJuly, 10-13, 2012
Application to Preference Mapping : The Coffee dataset
RY2 explained for the three clusters
18 11th International Sensometrics, RennesJuly, 10-13, 2012
Application to Preference Mapping : The Coffee dataset– Clusters– Clusters
Consumers like both smooth coffee with chocolateodors and strong coffee with intense odor
Consumers like both the green coffee with a lot ofperfume and the bitter coffee with an intenseaftertaste and an odor of roasted coffee
19 11th International Sensometrics, RennesJuly, 10-13, 2012
Plan
• Models of Quadratic PLS
• Extension of Hoskuldsson’s approach to several Y : QPLS2
• α - Regularization of QPLS2α Regularization of QPLS2
• Application to Preference Mapping : The Coffee dataset
• Conclusion
20 11th International Sensometrics, RennesJuly, 10-13, 2012
Conclusion
QPLS presents several advantages for preference mapping:
• the criterion is very explicit and in line with PLS regression
• the (new) algorithm is simple and convergentthe (new) algorithm is simple and convergent.
But :
• in order to enhance the interpretation of the model, there is a need to operate a model selection.
21 11th International Sensometrics, RennesJuly, 10-13, 2012
Bibliography
Baffi, G., Martin, E. B., & Morris, A. J. (1999). Non-linear projection to latent structures revisited: the quadratic PLS algorithm. Computers and Chemical Engineering, 23(3), 395-411.
G & ( ) C S fDijksterhuis, G., Martens, H., & Martens, M. (2005). Combined Procrustes analysis and PLSR for internal and external mappingof data from multiple sources. Computational Statistics and Data Analysis, 48(1), 47-62.
Endrizzi, I., Gasperi, F., Calo, D. G., & Vigneau, E. (2010) Two-step procedure for classifying consumers in a L-structured datacontext. Food Quality and Preference, 21(3), 270-277.
ESN (1996). A European sensory and consumer study. A case study on coffee. European Sensory Network, ChippingCampden, Gloucestershire.
Gnanadesikan, R. (1977). Methods for statistical data analysis of multivariate observations. Wiley, New York, 1977.
Höskuldsson A (1992) Quadratic PLS regression Journal of Chemometrics 6(6) 307-334Höskuldsson, A. (1992). Quadratic PLS regression. Journal of Chemometrics, 6(6), 307-334.
Lengard, V. r., & Kermit, M. (2006). 3-Way and 3-block PLS regressions in consumer preference analysis. Food Quality andPreference, 17(3-4), 234-242.
Mage, I., Menichelli, E., & Naes, T. (2012). Preference mapping by PO-PLS: Separating common and unique information inl d t bl k F d Q lit d P f 24(1) 8 16several data blocks. Food Quality and Preference, 24(1), 8-16.
Verdun, S., Hanafi, M., Cariou, V. & Qannari, E. M. (2012), Quadratic PLS1 regression revisited. J. Chemometrics, 26: 384–389.
Wold, S., Kettaneh-Wold, N., & Skagerberg, B. (1989). Nonlinear PLS modeling. Chemometrics and Intelligent Laboratory g g ( ) g g ySystems, 7(1-2), 53-65.
Wold, S., Albano, C., Dunn Ill, W.J., Edlund,U., Esbensen, K., Geladi, P., Helberg, S., Johansson, E., Lindberg, W. & Sjostrom, M. (1984). Multivariate data analysis in chemistry, in B.R. Kowalski (Eds), Chemometrics, Mathematics and Statistics in Chemistry, Reidel, Dordrecht, p. 17.
July, 10-13, 201222 11th International Sensometrics, Rennes
QPLS, What else ?
Thank you for your attentionThank you for your attention
23 July, 10-13, 2012 11th International Sensometrics, Rennes