174,99 €
Sensory testing and measurement are the main functions of sensory analysis. In recent years, the sensory and consumer field has evolved to include both difference testing and similarity testing, and new sensory discrimination methods such as the tetrads have received more attention in the literature.
This second edition of Sensory Discrimination Tests and Measurements is updated throughout and responds to these changes and includes:
Mainly intended for researchers and practitioners in the sensory and consumer field, the book is a useful reference for modern sensory analysis and consumer research, especially for sensometrics.
Sie lesen das E-Book in den Legimi-Apps auf:
Seitenzahl: 917
Veröffentlichungsjahr: 2015
Cover
Title Page
Copyright
Dedication
Preface
References
Acknowledgments
About the companion website
Chapter 1: Introduction
1.1 Sensometrics
1.2 Sensory tests and measurements
1.3 A brief review of sensory analysis methodologies
1.4 Method, test, and measurement
1.5 Commonly used discrimination methods
1.6 Classification of sensory discrimination methods
Chapter 2: Measurements of sensory difference/similarity: Thurstonian discriminal distance
2.1 Measurement of sensory difference/similarity
2.2 Thurstonian discriminal distance, δ or
d
′
2.3 Variance of
d
′
2.4 Tables and R/S-Plus codes for
d
′ and variance of
d
′
2.5 Computer-intensive approach to Thurstonian models of the “M + N” test
2.6 Estimates of population and group
d
′
Chapter 3: Measurements of sensory difference/similarity: area under ROC curve in Signal Detection Theory
3.1 Area measure of sensory difference/similarity
3.2 ROC curve functions
3.3 Estimations of the parameters of ROC curves
3.4 Estimations of variances of estimators
3.5 R/S-Plus codes for estimations of parameters for the three ratings methods
3.6 Estimates of population
R
-index in replicated ratings
Chapter 4: Difference testing
4.1 Binomial model for difference testing
4.2 Difference tests using forced-choice methods
4.3 Power analysis for tests for one proportion
4.4 Discrimination tests using methods with response bias
4.5 Power analysis of tests for two proportions
4.6 Efficiency comparisons of difference tests
4.7 Difference tests for
d
′ and
R
-index
Chapter 5: Similarity (equivalence) testing
5.1 Introduction
5.2 Similarity tests using the Two-Alternative Forced Choice (2-AFC) method
5.3 Similarity testing using forced-choice methods
5.4 Similarity tests using methods with response bias
5.5 Similarity tests using ratings of the A–Not A, Same–Different, and A–Not AR
5.6 Similarity tests for continuous data
5.7 Similarity tests for correlated data
5.8 Confidence interval for similarity evaluation
5.9 Controversy over similarity (equivalence) tests in statistical and sensory literature
Chapter 6: Bayesian approach to discrimination tests
6.1 Introduction
6.2 One-proportion two-sided tests
6.3 One-proportion one-sided tests
6.4 Two-proportion tests
6.5 Thurstonian
d′
for Bayesian estimate of proportion
Chapter 7: Modified discrimination tests
7.1 Modified Triangular test
7.2 Degree of Difference test
7.3 Double discrimination tests
7.4 Preference tests with a “no preference” option
7.5 Discrimination tests with pseudo-correct responses (forgiveness)
Chapter 8: Multiple-sample discrimination tests
8.1 Multiple-sample comparison based on proportions
8.2 Multiple-sample comparison based on ranks
8.3 Multiple-sample comparison based on categories
8.4 Multiple-sample comparison based on ratings
8.5 Multiple-sample comparison based on paired comparisons
Chapter 9: Replicated discrimination tests: beta-binomial model
9.1 Introduction
9.2 BB distribution
9.3 Estimation of the parameters
9.4 Applications of the BB model in replicated tests
9.5 Testing power and sample size
Chapter 10: Replicated discrimination tests: corrected beta-binomial model
10.1 Introduction
10.2 CBB distribution
10.3 Estimation of parameters in the CBB model
10.4 Statistical testing for parameters in a CBB model
10.5 Testing power and sample size
10.6 CBB and Thurstonian models for replicated discrimination methods
Chapter 11: Replicated discrimination tests: Dirichlet–multinomial (DM) model
11.1 DM distribution
11.2 Estimation of the parameters of a DM model
11.3 Applications of the DM model in replicated ratings and discrimination tests
11.4 Testing power for DM tests
11.5 DM model in a meta-analysis for usage and attitudinal (U&A) data
Chapter 12: Measurements of sensory thresholds
12.1 Introduction
12.2 Standard dose–response model
12.3 Model for responses with an independent background effect
12.4 Model for overdispersed responses
Chapter 13: Measurements of sensory risk with negative sensory effects
13.1 Benchmark dose methodology
13.2 Estimation of BMD from quantal data
13.3 Estimation of BMD from replicated quantal data
13.4 Estimation of BMD from continuous data
Chapter 14: Measurements of time intensity
14.1 Introduction
14.2 Smoothing and graphical presentation of T-I data
14.3 Analysis based on parameters of smoothed T-I curves
14.4 Multivariate data analysis for T-I data
14.5 Functional data analysis for T-I data
Chapter 15: Measurements of sensory shelf life
15.1 Introduction
15.2 Determination of SSL using R package and R codes
15.3 Numerical examples
Chapter 16: Measurements of the performance of a trained sensory panel and panelists
16.1 Criteria for assessing performance
16.2 Estimations of ICC from different types of data
16.3 Statistical tests for ICCs
16.4 Other indices for evaluation of panel data
16.5 Assessing the discriminability of trained sensory panelists and panels
References
Chapter 17: Measurements of consumer emotions and psychographics
17.1 Introduction
17.2 Measurements of consumer positive and negative emotions
17.3 Psychographics
17.4 Propensity score analysis
Chapter 18: Measurements of the relative importance of attributes
18.1 Introduction
18.2 Determination of the relative importance of attributes based on averaging over orderings
18.3 Determination of the relative importance of attributes based on variable transformation
18.4 Determination of the relative importance of attributes based on Breiman's RF
18.5 Determination of the relative importance of attributes based on fuzzy measures and the Choquet integral
18.6 Meta-analysis of the relative importances of attributes
18.7 Adaptive penalty analysis combining both directional effects and the relative importance of JAR attributes to overall liking
Appendix A: List of R/S-Plus codes, data files, and packages used in the book
References
Author Index
Subject Index
End User License Agreement
xiii
xiv
xv
xvii
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
Cover
Table of Contents
Preface
Begin Reading
Chapter 2: Measurements of sensory difference/similarity: Thurstonian discriminal distance
Figure 2.1 Theoretical and empirical psychometric functions.
Figure 2.2 Empirical psychometric functions for the Specified and Unspecified “” test with M = N.
Chapter 3: Measurements of sensory difference/similarity: area under ROC curve in Signal Detection Theory
Figure 3.1 Geometric presentation of ROC and indices: in normal-deviate (z) scales, AUC, and Gini-index.
Chapter 4: Difference testing
Figure 4.1 Powers of statistical tests using forced-choice methods. TR, Triangular; 2AFC, Two-Alternative Forced Choice; 3AFC, Three-Alternative Forced Choice; 4AFC, Four-Alternative Forced Choice; DUTR, Duo–Trio; TETU, Unspecified Tetrad; TETS, Specified Tetrad; DUPAR, Dual Pair (4IAX).
Figure 4.2 Powers of statistical tests using methods with response bias
Figure 4.3 Difference testing powers for three versions of ratings methods
Chapter 5: Similarity (equivalence) testing
Figure 5.1 Power of similarity testing using the 2-AFC method .
Figure 5.6 Power of similarity testing using the 2-AFC method (, 100, 200, or 300).
Figure 5.7 Statistical powers of similarity tests using the four forced-choice methods (2-AFC, Duo–Trio, 3-AFC, and Triangular) with , , , with a step of 0.01, and .
Figure 5.8 Powers for similarity tests using the 2-AFC method for , , and sample sizes , based on (a) exact binomial and (b) normal approximation with a continuity correction.
Figure 5.9 Similarity test powers for ratings of three methods (similarity limit ).
Chapter 6: Bayesian approach to discrimination tests
Figure 6.1 Beta prior distributions.
Figure 6.2 Highest posterior density (HPD) interval. Note: For a specified credible level, we can find a density value and two points and in a posterior beta distribution such that the density at and equals and the area (the probability) under the density curve and between and approximately equals the specified credible level.
Figure 6.3 Prior and posterior beta distributions. Note: The prior distribution is a beta distribution with mean and variance . The posterior distribution is a beta distribution with and . The observations are and the sample size is .
Figure 6.4 Posterior distribution of in a 3-AFC test. Note: The prior distribution of
p
is noninformative. The observed number of correct responses is and the total number of responses is
n
= 100. The independent background effect is .
Figure 6.5 Posterior distribution of log-odds ratio . Note: The posterior distribution with = 0.6 and = 0.0804. The posterior probability of is 0.983.
Figure 6.6 Posterior distribution of the difference of proportions . Note: The posterior distribution with and . The posterior probability of is 0.958.
Chapter 7: Modified discrimination tests
Figure 7.1 Powers of conventional and double discrimination methods .
Figure 7.2 Thurstonian model for the 2-AC with both placebo pairs and test pairs.
Figure 7.3 Comparisons of the powers of Two-out-of-Five difference tests with true correct and pseudo-correct responses .
Figure 7.4 Comparison of the powers of Two-out-of-Five similarity tests with true correct and pseudo-correct responses (, similarity limit ).
Chapter 8: Multiple-sample discrimination tests
Figure 8.1 Multiple comparisons in Example 8.4.2.
Figure 8.2 Predicted probabilities of response for product purchase intent.
Figure 8.3 Estimated relative abilities (worths) of products.
Chapter 12: Measurements of sensory thresholds
Figure 12.1 Fit of the logit model to the data in Table 12.1.
Figure 12.2 Fit of the logit model to the data in Table 12.2.
Figure 12.3 Fit of logit model to the data in Table 12.3.
Chapter 13: Measurements of sensory risk with negative sensory effects
Figure 13.1 Log-logit model for BMD and BMDL based on the data in Table 13.1.
Figure 13.2 Log-logit model combining a CBB log-likelihood for BMD and BMDL based on the replicated testing data in Table 13.2.
Figure 13.3 Output of BMDS for the data in Table 13.3.
Figure 13.4 Output of BMDS for the data in Table 13.4.
Chapter 14: Measurements of time intensity
Figure 14.1 T-I curve for the data in Table 14.1.
Figure 14.2 T-I curve and its 95% confidence intervals for the data in Table 14.1.
Figure 14.3 First and second principal curves.
Figure 14.4 Loadings on the first and second principal curves.
Figure 14.5 Three groups of curves and their averages.
Figure 14.6 Values of GCV for selection of the smooth parameter .
Figure 14.7 Fit to function of the data in Table 14.1.
Figure 14.8 Charts for the functional data objects “tidat5fd” and “tidat123fd.”
Figure 14.9 Mean and standard deviation curves for the functional data object “tidat5fd.”
Figure 14.10 First and the second derivatives of the functional data object “cexa.fd.”
Figure 14.11 First and the second derivatives of the mean of the functional data object “tidat5fd.”
Figure 14.12 Ninth curve in the functional data object “tidat5fd.”
Figure 14.13 Estimated variance–covariance surface and contour plot for the bivariate functional data object “tidat5bifd.”
Figure 14.14 First two principal component functions or harmonics in Example 14.5.11.
Figure 14.15 Effect of the first two principal component functions or harmonics as perturbations of the mean curve (the solid line).
Figure 14.16 Scores of nine curves on the first two principal components.
Figure 14.17 First two pairs of canonical weight functions in Example 14.5.2. Solid curves correspond to attribute A and dotted curves to attribute B.
Figure 14.18 Scores for the first two pairs of canonical variables plotted against each other.
Figure 14.19 Regression coefficients estimated in order to predict T-I for all products and for each product invidually.
Figure 14.20 Predicted mean for each product.
Figure 14.21 Predicted mean for each of three products.
Figure 14.22 Fittings for each of three products.
Figure 14.23 Observed pointwise statistic values and critical values for a functional F-test.
Figure 14.24 fANOVA for three groups of curves.
Figure 14.25 Observed pointwise statistic values and critical value for a functional t-test.
Figure 14.26 fANOVA for first derivative curves.
Figure 14.27 fANOVA for second derivative curves.
Chapter 15: Measurements of sensory shelf life
Figure 15.1 Failure curve based on Weibull distribution for the “ssldat” data.
Figure 15.2 Survival curve based on the nonparametric KM model for the “ssldat” data. Note: The thicker line is the estimate and the thinner lines pointwise 95% confidence interval.
Figure 15.3 Failure curve based on Weibull distribution for the “ssldat2” data.
Figure 15.4 Survival curve based on nonparametric KM model for the “ssldat2” data. Note: The thicker line is the estimate and the thinner lines pointwise 95% confidence interval.
Chapter 16: Measurements of the performance of a trained sensory panel and panelists
Figure 16.1 Ratings quality for 10 attributes.
Figure 16.2 Ratings quality for 10 panelists.
Chapter 17: Measurements of consumer emotions and psychographics
Figure 17.1 Loadings of factors on PA and NA items.
Figure 17.2 Multiple comparisons for well-being rating means for the four regions.
Figure 17.3 Approximate weight of evidence (AWE) for clusters and clustering tree.
Figure 17.4 Range of PSs, by treatment group .
Chapter 18: Measurements of the relative importance of attributes
Figure 18.1 Relative importance of attributes in Example 18.2.1.
Figure 18.2 Relative importance of attributes in Example 18.2.2.
Figure 18.3 Relative importance of attributes in Example 18.4.1.
Figure 18.4 Relative importance of attributes in Example 18.4.2.
Figure 18.5 Relative-importance values of JAR attributes to overall liking for a product in Example 18.7.1
Chapter 1: Introduction
Table 1.1 Classifications of sensory discrimination methods
Chapter 2: Measurements of sensory difference/similarity: Thurstonian discriminal distance
Table 2.1
d
′ and
B
value for variance of
d
′ for the 2-AFC method
Table 2.8
d
′ and
B
value for variance of
d
′ for the Dual Pair method
Table 2.9
d
′ and
B
value for variance of
d
′ for the double 2-AFC method
Table 2.12
d
′ and
B
value for variance of
d
′ for the double Triangular method
Table 2.11
d
′ and
B
value for variance of
d
′ for the double Duo–Trio method
Table 2.13 R/S-Plus codes for forced-choice methods
Table 2.14 R/S-Plus codes for A–Not A and Same–Different methods
Table 2.15 Different versions of the “M + N” test
Table 2.16 Theoretical chance probabilities of the “M + N” test
Table 2.17
d
′ and
B
values for the Specified Two-out-of-Five test (“dp321f”)
Table 2.18
d
′ and
B
values for the Unspecified Two-out-of-Five test (“dp322f”)
Table 2.19
d
′ and
B
values for the Unspecified “M + N” with M = 3 and N = 1 test (“dp312”)
Table 2.20
d
′ and B values for the Specified Hexagon test (M = N = 3)(“dp331f”)
Table 2.21
d
′ and B values for the Unspecified Hexagon test (M = N = 3) (“dp332f”)
Table 2.22
d
′ and
B
values for the Specified Octad test (M = N = 4) (“dp441f”)
Table 2.23
d
′ and B values for the Unspecified Octad test (M = N = 4)(“dp442f”)
Table 2.24 R/S-Plus codes used to produce some simulation-derived psychometric functions
Table 2.25
Table 2.26
Chapter 3: Measurements of sensory difference/similarity: area under ROC curve in Signal Detection Theory
Table 3.1 Practical meanings of the values of the measures
Table 3.2
Chapter 4: Difference testing
Table 4.1 2 × 2 Table for data from a monadic A–Not A test
Table 4.2 Data for Example 4.6
Table 4.3 2 × 2 Table for data from a mixed A–Not A test
Table 4.4 Data for Example 4.9
Table 4.6 Data for Example 4.4.5
Table 4.7 Sample sizes (
N
) required for a power of 0.8 using mixed-design A–Not A and Same–Different tests
Table 4.8 Expectations under a null hypothesis
Table 4.9 Expectations under an alternative hypothesis
Table 4.5 2 × 2 Table for data from a paired A–Not A test
Table 4.10 Sample sizes for significance at
α
≤ 0.05 and power at 0.8 using the paired design A–Not A and Same–Different tests (McNemar's test, one-sided test)
Table 4.11 Sample sizes for significance at
α
≤ 0.1 and power at 0.8 using the paired design A–Not A and Same–Different tests (McNemar's test, one-sided test)
Table 4.12 Power efficiencies for discrimination methods (
α
= 0.05, power = 0.8)
Table 4.13 Generalized power efficiencies for forced-choice methods
Chapter 5: Similarity (equivalence) testing
Table 5.1 Critical number selecting a product in similarity testing using the 2-AFC method (α = 0.05)
Table 5.2 Critical number selecting a product in similarity testing using the 2-AFC method (α = 0.1)
Table 5.3 Sample size (pairs) required for similarity testing using the 2-AFC method at 0.8 testing power (α = 0.05)
Table 5.4 Sample size (pairs) required for similarity testing using the 2-AFC method for 0.8 testing power (α = 0.1)
Table 5.5 Powers for similarity testing using the 2-AFC method for , and sample sizes
n
= 57–70
Table 5.6 Sample sizes required to reach 0.8 power for similarity testing using the A–Not A and Same–Different methods (α = 0.05,
h
= 1)
Table 5.7 Sample sizes required to reach 0.8 power for similarity testing using the A–Not A and Same–Different methods (α = 0.1,
h
= 1)
Table 5.8 Critical values for paired
t
-tests for similarity (α = 0.05)
Table 5.9 Overall liking for new and current products (Example 5.7.1)
Table 5.10 Purchase intent for new and current products (Example 5.7.2)
Chapter 6: Bayesian approach to discrimination tests
Table 6.1 Interpretations of the Bayes factor
Table 6.2
Table 6.3 Data of a numerical example for an equivalence test using the A–Not A method
Chapter 7: Modified discrimination tests
Table 7.1 Data for the Gridgeman model
Table 7.2 Expanded Table for the Gridgeman two-stage Triangular test
Table 7.3 Probabilities of weight total
w
T
(
N
= 2)
Table 7.4 Total probabilities of weight total
w
T
(
N
= 2)
Table 7.5 Data for Example 7.1.2
Table 7.6 Ratings in Example 7.2.1
Table 7.7 Frequencies of ratings for 100 subjects in a monadic Degree of Difference test
Table 7.8 Frequencies of ratings for 250 subjects in a mixed Degree of Difference test
Table 7.9 Frequencies of ratings for 100 subjects in a paired Degree of Difference test
Table 7.10 Sample sizes required to reach 0.8 power (with a continuity correction) (
α
= 0.05) in double discrimination methods
Table 7.11 Proportions of correct responses (
p
c
) corresponding to Thurstonian
δ
(or
d
′) in forced-choice methods
Table 7.12 Categories in the two-visit method
Table 7.13 Data for Example 7.4.1
Table 7.14 Data in Example 7.4.11
Chapter 8: Multiple-sample discrimination tests
Table 8.1 Upper
α
point of the range of
m
independent and identically distributed standard normal variables
Table 8.2 Data for Example 8.1.2
Table 8.3 Critical values for the Friedman test
Table 8.4 Upper
α
point of the studentized range distribution with parameter
t
and degrees of freedom
Table 8.5 Rank data for Example 8.2.1
Table 8.6 Ranked data in a BIB design (
t
= 7,
k
= 3,
n
= 7,
r
= 3,
λ
= 1)
Table 8.7 Anderson Table for ranked data
Table 8.8 Ranking numbers of three varieties of snap beans (Example 8.2.3)
Table 8.9 Homeowner rankings of four grasses (Example 8.2.4)
Table 8.10
Table 8.11
Table 8.12 Frequencies for the ratings in Table 8.11 by panelist
Table 8.13 Frequencies for the ratings in Table 8.11 by product
Table 8.14 Predicted probabilities of responses concerning product aftertaste
Table 8.15 Predicted probabilities of response for product purchase intent
Table 8.16 Aggregated forced-choice preference data (number of panelists who prefer a product in a given row to a product in a given column)
Table 8.17 Predicted preference probabilities for each pair among four products
Table 8.18 Data for Example 8.5.2
Table 8.19 Predicted probabilities of preferences
Table 8.20 Table for the Thurstonian model of Torgerson's method of triads
Chapter 9: Replicated discrimination tests: beta-binomial model
Table 9.1 Data for Example 9.1.1
Table 9.2 Data for Example 9.1.2
Table 9.3 Data in Example 9.3.2
Table 9.5 Critical proportions for replicated similarity testing using the paired comparison method (
Δ
= 0.1,
α
= 0.05)
Table 9.4 Numbers preferring sample A in a replicated similarity test using the paired comparison method (
k
= 100,
n
= 4) (Example 9.4.5)
Table 9.6 Data for a replicated monadic A–Not A test (Example 9.4.6)
Table 9.7 Pooled data for a replicated monadic A–Not A test (Example 9.4.6)
Table 9.8 Distribution of purchases of a brand over 12 weeks (Example 9.4.7)
Table 9.9 Sample sizes (
k
) required to reach 0.8 power for a replicated similarity preference test and a nondirectional 2-AFC test (
α
= 0.1)
Chapter 10: Replicated discrimination tests: corrected beta-binomial model
Table 10.1
Table 10.2
Table 10.3 Data for a replicated Triangular test where
n
= 3,
k
= 30 (Example 10.3.5)
Chapter 11: Replicated discrimination tests: Dirichlet–multinomial (DM) model
Table 11.1 Probabilities of a DM distribution (
n
= 3,
m
= 3,
g
= 9,
π
= (0.5, 0.2, 0.3))
Table 11.2 Data for a replicated preference test with a “no preference” option (
k
= 30,
n
= 4,
m
= 3)
Table 11.3 Housing satisfaction data
Table 11.4 Summary of data in Table 11.3
Table 11.5 Frequencies of ratings for 25 subjects with 4 replications in a replicated paired designed Degree of Difference test
Table 11.6 Summary of data in Table 11.5
Table 11.7 Data for a replicated mixed A–Not A test
Table 11.8 Pooled data for a replicated mixed A–Not A test
Table 11.9 Evaluations of three products by 10 panelists with five replications (1 = “ acceptable, ” 0 = “ unaccepTable ”)
Table 11.10 Response patterns for three products (1 = “ acceptable, ” 0 = “ unaccepTable ”)
Table 11.11 Numbers of response patterns for each panelist
Table 11.12 Counts of responses for a CO (Check One) question
Table 11.13 Example of a CATA (Check All That Apply) question
Table 11.14 Responses to the first item, “Morning, just after waking up,” and the last item, “Just before going to bed,” in the CATA question in Table 11.13
Chapter 12: Measurements of sensory thresholds
Table 12.1 Dose response data for Example 12.2.1 (from Bliss 1935, Prentice 1976)
Table 12.2 Data for Example 12.2.2
Table 12.3 Data in Example 12.3.1
Table 12.4 Data for Example 12.3.2 (Hoekstra 1987, Morgan 1992, p. 95)
Table 12.5 Neonatal acute toxicity to trichloromethane (Example 12.4.1)
Table 12.6 Maximum-likelihood parameter estimates and variance–covariance matrix for the parameters of the dose–response model based on a beta-binomial distribution
Table 12.7 Maximum likelihood parameter estimates and variance–covariance matrix for the parameters of the dose–response model based on a CBB distribution
Table 12.8 Data for Examples 12.4.2 and 12.4.3
Chapter 13: Measurements of sensory risk with negative sensory effects
Table 13.1 Consumer response data for Example 13.2.1
Table 13.2 Data summary of consumer responses for Example 13.3.1
Table 13.3 Summarized data for Example 13.4.1
Table 13.4 Summarized data for Example 13.4.2
Chapter 14: Measurements of time intensity
Table 14.1 Data for Example 14.2.1
Table 14.2 Averaged
I
max
values for samples
Table 14.3 Alpha upper quantile for adaptive Neyman test for curves
Chapter 15: Measurements of sensory shelf life
Table 15.1 Data in Example 15.3.1 (“ssldat”) (“ 1 ” = “ yes ”; “ 2 ” = “ no ”)
Table 15.2 Data in Example 15.3.2 (“ssldat2”) (“ 1 ” = “ yes ”; “ 2 ” = “ no ”)
Chapter 16: Measurements of the performance of a trained sensory panel and panelists
Table 16.1 Ratings for
k
random samples given by
n
panelists in a panel
Table 16.2 ANOVA one-way random effects model
Table 16.3 Numerical example for continuous data
Table 16.4 Numerical example for ordinary data
Table 16.5 Ranks of ordinary data
Table 16.6 Numerical example for ranking data
Table 16.7 Numerical example for multiple-choice data
Table 16.8 ICCs (
r
I
) for Cronbach's coefficient alpha (
r
α
) = 0.70 and
n
= 3 : 15
Table 16.9 Ratings quality for 10 attributes
Table 16.10 Ratings quality for 10 panelists
Table 16.11 Format of the data summary for the “ M + N ” experiment
Table 16.12 Probabilities of making correct selections in Fisher's exact test using the specified “ M + N ” method with M = N
Table 16.13 Cumulative probabilities (
p
-values in a one-sided test) of making correct selections in Fisher's exact test using the specified “ M + N ” method with M = N
Table 16.14 Outcome of Fisher's “The Lady Tasting Tea” experiment
Chapter 17: Measurements of consumer emotions and psychographics
Table 17.1 Positive and Negative Affect Schedule (PANAS)
Table 17.2 Arizona Integrative Outcomes Scale (AIOS)
Table 17.3 Consumer involvement profile (CIP)
Table 17.4 Example of CIP
Table 17.5 Price–quality consciousness
Table 17.6 Food neophobia scale (FNS)
Table 17.7 General neophobia scale (GNS)
Table 17.8 Brief Sensation Seeking Scale, 4-Item (BSSS-4)
Table 17.9 Sensation Seeking, 2-Item (SS2)
Table 17.10 General self-efficacy (GSE)
Table 17.11 Medication Adherence Self-Efficacy Scale
Table 17.12 Resilience
Table 17.13 Well-being rating means of the four regions
Table 17.14 Treatment effects on overall liking
Chapter 18: Measurements of the relative importance of attributes
Table 18.1 Relative importance values and 95% confidence intervals of attributes in Example 18.2.1
Table 18.2 Relative importance of attributes to purchase intent in Example 18.2.2
Table 18.3 Relative importance of panel descriptive attributes for consumer overall liking, determined by Breiman's RF
Table 18.4 Data for Example 18.5.1
Table 18.5 Relative-importance vectors and covariance matrices for three studies (Example 18.6.1)
Table 18.6 Group and population relative-importance vectors and covariance matrices
Table 18.7 Results of a penalty analysis for a product (
n
= 120)
Table 18.8 Relative-importance values of JAR attributes to overall liking in Example 18.7.1
Appendix A: List of R/S-Plus codes, data files, and packages used in the book
Table .1 List of R/S-Plus codes used in the book
Table A.2 List of some data files used in the book
Table A.3 List of R/S-Plus packages used in the book
Second Edition
Jian Bi
Sensometrics Research and Service
Richmond, Virginia, USA
This edition first published 2015 © 2015 by John Wiley & Sons, Ltd.
Registered office: John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK
Editorial offices: 9600 Garsington Road, Oxford, OX4 2DQ, UK The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK 111 River Street, Hoboken, NJ 07030-5774, USA
For details of our global editorial offices, for customer services and for information about how to apply for permission to reuse the copyright material in this book please see our website at
www.wiley.com/wiley-blackwell.
The right of the author to be identified as the author of this work has been asserted in accordance with the
UK Copyright, Designs and Patents Act 1988.
All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher.
Designations used by companies to distinguish their products are often claimed as trademarks. All brand names and product names used in this book are trade names, service marks, trademarks or registered trademarks of their respective owners. The publisher is not associated with any product or vendor mentioned in this book.
Limit of Liability/Disclaimer of Warranty: While the publisher and author(s) have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. It is sold on the understanding that the publisher is not engaged in rendering professional services and neither the publisher nor the author shall be liable for damages arising herefrom. If professional advice or other expert assistance is required, the services of a competent professional should be sought.
Library of Congress Cataloging-in-Publication Data applied for.
ISBN: 9781118733530
A catalogue record for this book is available from the British Library.
Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic books.
Cover image: ©istock.com/traffic_analyzer
To my family
The second edition of this book is similar to the first in that it focuses on sensory tests and measurements from a statistical perspective. However, it expands greatly upon the first in the following aspects:
The second edition extends the discussion of sensory measurement from Thurstonian discriminal distance
(
Chapter 2
) to the area (
R
-index and Gini-index) under the ROC curve in Signal Detection Theory (
Chapter 3
) and to wider sensory measurements, including sensory threshold (
Chapter 12
), sensory risk (
Chapter 13
), time-intensity (
Chapter 14
), sensory shelf life (
Chapter 15
), the performance of a trained sensory panel and panelists (
Chapter 16
), consumer emotions and psychographics (
Chapter 17
), and the relative importance of attributes (
Chapter 18
).
The second edition extends the discussion of sensory discrimination tests from main difference tests (
Chapter 4
) to similarity/equivalence tests (
Chapter 5
) and Bayesian tests (
Chapter 6
).
Chapters 7
–
11
discuss novel methods for modified discrimination tests, multiple-sample discrimination tests, and replicated discrimination tests.
More R/S-Plus built-in programs, packages, and codes are used in the second edition. Some of the tables for statistical tests used in the first edition are replaced by R/S-Plus codes. The R/S-Plus codes and some of the data files used in the book are listed in Tables A.
1
and A.
2
in Appendix A and are available from the companion Web site,
www.wiley.com/go/bi/SensoryDiscrimination
. The R packages (R Development Core Team 2013) used in the book are listed in Table A.
3
in Appendix A and can be downloaded from
www.r-project.org
.
The title of the book has been changed to reflect the expanded and changed contents of the second edition. The title of the first edition was
Sensory Discrimination Tests and Measurements: Statistical Principles, Procedures, and Tables
, while is the title of the second edition is
Sensory Discrimination Tests and Measurements: Sensometrics in Sensory Evaluation
.
The book is organized as follows:
Chapter 1
briefly describes sensory discrimination methods.
Chapter 2
and
3
discuss sensory effect measurement using distance index, Thurstonian
, and the area indices
R
-index and Gini-index.
Chapter 4
–
6
discuss sensory discrimination tests, including difference testing, similarity (equivalence) testing, and the Bayesian approach to discrimination testing.
Chapter 7
and
8
discuss modified and multiple-sample discrimination tests.
Chapter 9
–
11
discuss replicated discrimination tests based on the beta-binomial (BB) model, the corrected beta-binomial (CBB) model, and the Dirichlet–multinomial (DM) model, respectively.
Chapters 12
–
18
discuss diverse and specific sensory measurements in a broad sense, from measurements of sensory threshold (
Chapter 12
) to measurements of the relative importance of attributes (
Chapter 18
).
The assumed readers of the book are researchers and practitioners in the sensory and consumer field, as well as anyone who is interested in sensometrics. The book is intended to be a useful reference for modern sensory analysis and consumer research, especially for sensometrics. It is different in its objective from the textbooks widely used in the sensory field (e.g., Amerine et al. 1965, Stone and Sidel 2004, Meilgaard et al. 2006, Lawless and Heymann 2010) and from common guidebooks (e.g., Chambers and Wolf 1996, Kemp et al. 2009). It is also different in perspective and focus from the books on quantitative sensory analysis and applied statistics in sensory and consumer research (e.g., O'Mahony 1986b, Næs and Risvik 1996, Meullenet et al. 2007, Mazzocchi 2008, Gacula et al. 2009, Næs et al. 2010, Lawless 2013), although it has some topics in common with these.
Amerine, M. A., Pangborn, R. M., and Roessler, E. B. 1965.
Principles of Sensory Evaluation of Food
. Academic Press, New York.
Chambers, E. VI and Wolf, M. B. 1996.
Sensory Testing Methods
(2nd ed.). ASTM Manual Series MNL. ASTM International, West Conshohocken, PA.
Gacula, M., Singh, J., Bi, J., and Altan, S. 2009.
Statistical Methods in Food and Consumer Research
(2nd ed.). Elsevier/Academic Press, Amsterdam.
Kemp, S. E., Hollowood, T., and Hort, J. 2009.
Sensory Evaluation. A Practical Handbook
. John Wiley & Sons, Chichester.
Lawless, H. T. 2013.
Quantitative Sensory Analysis: Psychophysics, Models and Intelligent Design
. Wiley-Blackwell, Oxford.
Lawless, H. T. and Heymann, H. 2010.
Sensory Evaluation of Food: Principles and Practice
(2nd ed.). Springer, New York.
Mazzocchi, M. 2008.
Sensory for Marketing and Consumer Research
. Sage, Los Angeles, CA.
Meilgaard, M. C., Civille, G. V., and Carr B. T. 2006.
Sensory Evaluation Technique
(4th ed.). CRC Press, Boca Raton, FL.
Meullenet, J. F., Xiong, R., and Findlay, C. J. 2007.
Multivariate and probabilistic Analysis of Sensory Science Problems
. Blackwell, Ames, IA.
Næs, T. and Risvik, E. 1996.
Multivariate Analysis of Data in Sensory Science
. Elsevier, Amsterdam.
Næs, T., Brockhoff, P. B., and Tomic, O. 2010.
Statistics for Sensory and Consumer Science
. John Wiley & Sons, Chichester.
R Development Core Team. 2013.
R: A Language and Environment for Statistical Computing
. R Foundation for Statistical Computing, Vienna. Available from:
http://www.R-project.org
/ (last accessed April 14, 2015).
Stone, H. and Sidel, J. 2004.
Sensory Evaluation Practices
(3rd ed.). Academic Press, Amsterdam.
I am grateful to Professor Michael O'Mahony, Professor Hye-Seong Lee, Dr. Carla Kuesten, Dr. Herbert Meiselman, Julia Chung, Yaohua Feng, and Pooja Chopra, who are the co-authors of our papers published in recent years.
I would like to thank the Wiley editor David McDade and project editor Audrie Tan for their encouragement in the completion of this project.
I wish to dedicate this book to my wife, Yulin, and my daughter, Cindy.
Jian Bi
Richmond, Virginia
December 2014
This book is accompanied by a companion website:
www.wiley.com/go/bi/SensoryDiscrimination
The website includes:
R/S-Plus codes for downloading
Data files for downloading
The lists of R/S-Plus codes and data files available on the website are provided in Appendix A on page (insert page number) of this book.
This book is about sensometrics, focusing on sensory discrimination tests and measurements in the domain of sensory analysis. Sensometrics is a subfield of, or an area related to, sensory and consumer science. According to Brockhoff (2011), “Sensometrics is the scientific area that applies mathematical and statistical methods to model data from sensory and consumer science.” It is similar to psychometrics in psychology, biometrics in biology, chemometrics in chemistry, econometrics in economy, politimetrics in macropolitics, environmetrics in environmental sciences, and so on. Sensometrics has experienced rapid growth in both academia and industry within the last 2 or 3 decades. It plays an important role in modern sensory analysis and consumer research, especially in the coming Big Data era.
The basic functions of sensory analysis are to provide reliable sensory measurements and to conduct valid tests. Statistical hypothesis testing is the theoretical basis of sensory tests. Statistical tests include both difference tests and similarity (equivalence) tests. The Thurstonian model (Thurstone 1927) and Signal Detection Theory (SDT) (Green and Swets 1966, Macmillan and Creelman 2005) are the theoretical basis for sensory effect measurement. Psychometric functions provide invariable indices that are independent of the methods used for measurements. Notably, the Thurstonian discriminal distance (or ) (ASTM 2012) and the area (R-index) under the receiver operating characteristic (ROC) curve in SDT have been widely accepted and are used in both food and sensory fields. Daniel M. Ennis (1993, 1998, 2003) and Michael O'Mahony (1979, 1992), among others, should be particularly thanked for their insight and foresight in introducing the methodologies into these fields and for tirelessly promoting their research and application over recent decades.
Sensory measurement takes on a broad range of meanings and contents. Besides sensory effect measurement using Thurstonian discriminal distance and area under ROC curve, the following measurements can also be regarded as different types of sensory measurement: sensory threshold measurement, sensory risk measurement, time intensity measurement, sensory shelf life measurement, trained sensory panel/panelist performance measurement, consumer emotions and psychographics measurement, and attribute relative importance measurement. Specific statistical methodologies are used for different types of sensory measurement.
Sensory analysis can be divided into two types: laboratory sensory analysis and consumer sensory analysis. In laboratory sensory analysis, a trained panel is used as an analytical instrument to measure the sensory properties of products. In consumer sensory analysis, a sample of a specified consumer population is used to test and predict consumer responses to products. These have different goals and functions, but share some methodologies.
Discriminative analysis and descriptive analysis are the main classes of methodology for both laboratory and consumer sensory analyses. Discriminative analysis includes discrimination tests and measurements. In this book, discrimination tests are used to determine whether a difference exists between treatments for confusable sensory properties of products (difference test), or whether a difference is smaller than a specified limit (similarity/equivalence test), usually using a two-point scale or a rating or ranking scale. Discrimination measurements are used to measure, on a particular index, the extent of the difference/similarity. There are two sources of sensory differences: intensity and preference. A discrimination test is used when testing difference/similarity of intensity; a preference test is used when testing difference/similarity of preference. Descriptive analysis is used to determine, on a rating scale, how much of a specific characteristic difference exists among products (quantitative descriptive analysis) or to characterize a product's sensory attributes (qualitative descriptive analysis). Quantitative descriptive analysis for preference is also called “acceptance testing.”
Acceptance or preference testing is of very limited value for a laboratory panel (Amerine et al. 1965) but is valuable in a consumer analysis setting. Laboratory discrimination testing, using a trained panel under controlled conditions, is referred to as “Sensory Evaluation I,” while consumer discrimination testing, using a sample of untrained consumers under ordinary consumption (eating) conditions, is referred to as “Sensory Evaluation II” (O'Mahony 1995). Confusion of the two will lead to misleading conclusions. Controversy over whether the consumer can be used for discrimination testing ignores the fact that laboratory and consumer discrimination tests have different goals and functions.
The distinction between discriminative analysis and quantitative descriptive analysis is not absolute from the viewpoint of modern sensory analysis. The Thurstonian model and SDT (see Chapters 2, 3) can be used for both discriminative analysis and quantitative descriptive analysis. The Thurstonian (or ), a measure of sensory difference/similarity, can be obtained from any kind of scale used in discriminative and descriptive analysis. A rating scale, typically used in descriptive analysis, is also used in some modified discrimination tests.
The following types of analysis are the important topics and methodologies of sensory analysis: sensory threshold analysis, sensory risk analysis, time intensity analysis, sensory shelf life analysis, trained sensory panel/panelist performance analysis, consumer emotions and psychographics analysis, and sensory attribute relative importance analysis.
This book is primarily concerned with methodology, mainly from a statistical point of view, of sensory discrimination tests and measurements, including laboratory and consumer sensory analyses.
In this book, a distinction is made among three terms: “sensory discrimination method,” “sensory discrimination test,” and “sensory discrimination measurement.”
In sensory discriminative analysis, certain procedures are used for experiments. These procedures are called discrimination methods (e.g., the Duo–Trio method, the Triangular method, the ratings method).
When discrimination procedures are used for statistical hypothesis testing, or when statistical testing is conducted for the data from a discrimination procedure, the procedure is called discrimination testing (e.g., the Duo–Trio test, the Triangular test, the ratings test). In this book, discrimination testing is referred to as both difference testing and similarity/equivalence testing for both preference and intensity (Chapters 4, 5). Bayesian statistical tests are also discussed, in Chapter 6. In Chapter 7, some modified discrimination tests are discussed. Multiple-sample discrimination tests are discussed in Chapter 8. Replicated discrimination tests are discussed in Chapters 9–11.
When discrimination procedures are used to measure, or, in other words, when an index (e.g., Thurstonian (or ) or R-index) is produced using the data from a discrimination procedure, the procedure is called a discrimination measurement (e.g., Duo–Trio measurement, Triangular measurement, ratings of the A–Not A measurement). Effect measurement includes distance measure and area measure R-index (or Gini-index). Besides the effect measurement discussed in Chapters 2, 3, other sensory measurements are discussed in Chapters 12–18. Both sensory testing and measurement are of importance and are useful. However, generally speaking, sensory measurement is more important and more useful in practice. Sensory measurements provide indices of the magnitude of sensory effects.
The Two-Alternative Forced Choice (2-AFC) method (Green and Swets 1966): This method is also called the paired comparison method (Dawson and Harris 1951, Peryam 1958). With this method, the panelist receives a pair of coded samples, A and B, for comparison on the basis of some specified sensory characteristic. The possible pairs are AB and BA. The panelist is asked to select the sample with the strongest (or weakest) sensory characteristic. The panelist is required to select one even if he or she cannot detect the difference.
The Three-Alternative Forced Choice (3-AFC) method (Green and Swets 1966): Three samples of two products, A and B, are presented to each panelist. Two of them are the same. The possible sets of samples are AAB, ABA, BAA or ABB, BAB, BBA. The panelist is asked to select the sample with the strongest or the weakest characteristic. The panelist has to select a sample even if he or she cannot identify the one with the strongest or the weakest sensory characteristic.
The Four-Alternative Forced Choice (4-AFC) method (Swets 1959): Four samples of two products, A and B, are presented to each panelist. Three of them are the same. The possible sets of samples are AAAB, AABA, ABAA, BAAA or BBBA, BBAB, BABB, ABBB. The panelist is asked to select the sample with the strongest or the weakest characteristic. The panelist is required to select a sample even if he or she cannot identify the one with the strongest or weakest sensory characteristic.
The Triangular (Triangle) method (Dawson and Harris 1951, Peryam 1958): Three samples of two products, A and B, are presented to each panelist. Two of them are the same. The possible sets of samples are AAB, ABA, BAA, ABB, BAB, and BBA. The panelist is asked to select the odd sample. The panelist is required to select one sample even if he or she cannot identify the odd one.
The Duo–Trio method (Dawson and Harris 1951, Peryam 1958): Three samples of two products, A and B, are presented to each panelist. Two of them are the same. The possible sets of samples are A: AB, A: BA, B: AB, and B: BA. The first one is labeled as the “control.” The panelist is asked which of the two test samples is the same as the control sample. The panelist is required to select one sample to match the “control” sample even if he or she cannot identify which is the same as the control.
The Unspecified Tetrad method (Lockhart 1951): Four stimuli, two of A and two of B, are used, where A and B are confusable and vary in the relative strengths of their sensory attributes. Panelists are told that there are two pairs of putatively identical stimuli and to sort them into their pairs.
The Specified Tetrad method (Wood 1949): Four stimuli, two of A and two of B, are used, where A and B are confusable and vary in the relative strengths of their sensory attributes. Panelists are told that there are two pairs of putatively identical stimuli and to indicate the two stimuli of specified A or B.
The Dual Pair (4IAX) method (Macmillan
et al
. 1977): Two pairs of samples are presented simultaneously to the panelist. One pair is composed of samples of the same stimuli, AA or BB, while the other is composed of samples of different stimuli, AB or BA. The panelist is told to select the most different pair of the two pairs.
The “
” method (Lockhart 1951):
samples with M sample A and N sample B are presented. The panelist is told to divide the samples into two groups, of A and B. There are two versions of the method: specified and unspecified. This is a generalization of many forced-choice discrimination methods, including the Multiple-Alternative Forced Choice (m-AFC), Triangle, and Specified and Unspecified Tetrad. The “
” with larger M and N can be regarded as a specific discrimination method with a new model. Unlike the conventional difference tests using the “
” with small M and N based on a binomial model, the “
” with larger M and
can reach a statistical significance in a single trial for only one “
” sample set based on a hypergeometric model. The methods that use a new model are particularly useful for assessing the discriminability of sensory panels and panelists; these are discussed in Chapter 16.5.
The A–Not A method (Peryam 1958): Panelists are familiarized with samples A and Not A. One sample, which is either A or Not A, is presented to each panelist. The panelist is asked if the sample is A or Not A.
The A–Not A with Remind (A–Not AR) method (Macmillan and Creelman 2005): Unlike the A–Not A, which is a single-sample presentation, a reminder (e.g., sample A) is provided before each test sample (sample A or Not A) in order to jog the panelist's memory.
The Same–Different method (see, e.g., Pfaffmann 1954, Amerine
et al
. 1965, Macmillan
et al
. 1977, Meilgaard
et al
. 1991, among others, for the same method under different names): A pair of samples, A and B, is presented to each panelist. The four possible sample pairs are AA, BB, AB, and BA. The panelist is asked if the two samples that he or she received are the same or different.
The ratings methods discussed in the book include ratings of the A–Not A, A–Not AR, and Same–Different methods.
Sensory discrimination methods are typically classified according to the number of samples presented for evaluation, as single-sample (stimulus), two-sample, three-sample, or multiple-sample methods. This classification is natural, but it does not reflect the inherent characteristics of the methods. In this book, the discrimination methods are classified according to the decision rules and cognitive strategies they involve. This kind of classification may be more reasonable and profound. In the following chapters, we will see how methods in the same class correspond to the same types of statistical model and decision rules.
There are two different types of instruction in the discrimination method. One type involves asking the panelists to indicate the nature of difference in the products under evaluation; for example, “Which sample is sweeter?” (the 2-AFC and the 3-AFC methods); or “Is the sample A or Not A?” (the A–Not A method). The other type compares the distance of difference; for example, “Which of the two test samples is the same as the control sample?” (the Duo–Trio method); “Which of these three samples is the odd one out?” (the Triangular method); or “Are these two samples the ‘same’ or ‘different’?” (the Same–Different method). The two types involve different cognitive strategies and result in different percentages of correct responses. Hence, the discrimination methods can be divided into these two types: methods using the “skimming” strategy and methods using the “comparison of distance” strategy (O'Mahony et al. 1994). The two types of methods can also called specified or unspecified method.
Response bias is a basic problem with sensory discrimination methods. Many authors hav eaddressed this problem (e.g. Torgerson 1958, Green and Swets 1966, Macmillan and Creelman 2005, O'Mahony 1989, 1992, 1995). Sensory discrimination methods are designed for the detection and measurement of confusable sensory differences. There is no response bias if the difference is large enough, but response bias may occur when the difference between two products is so small that a panelist makes an uncertain judgment. In this situation, how large a difference can be judged as a difference may play a role in the decision process. Criterion variation (strictness or laxness of a criterion) causes response bias. A response bias is a psychological tendency to favor one side of a criterion. Response bias is independent of sensitivity. This is why the methods with response bias (e.g., the A–Not A and the Same–Different methods) can also be used for difference testing. However, response bias affects test effectiveness (power).
Forced-choice procedures can be used to stabilize decision criteria. Hence, most sensory discrimination methods are designed as forced-choice procedures. A forced-choice procedure must have at least three characteristics: (1) Two sides of a criterion must be presented. The two sides may be “strong” and “weak,” if the criterion is about the nature of the difference between products. The two sides may be “same” and “different,” if the criterion is about the distance of the difference between products. Because a single sample or two samples of the same type cannot contain two sides of a criterion, evaluating a single sample or the same type of sample is not a forced-choice procedure. Because a single pair of samples or a pair of samples of the same type cannot contain two sides of a criterion concerning the distance of a difference, evaluating a single sample pair or a pair of samples of the same type is not a forced-choice procedure, either. (2) A panelist should be instructed that the samples presented for evaluation contain the two sides of a criterion. (3) A response must be given in terms of one clearly defined category. The “don't know” response is not allowed.
In conventional discrimination tests using forced-choice methods, such as the “” method with small M and N, we cannot get a statistical conclusion from a response for only one set of samples, because even for the perfect response for a set of the samples, the chance probability (e.g., 1/3 in the 3-AFC) is still larger than any acceptable significance level. Hence, multiple sets of “” samples are needed. A binomial model is used for analysis of the proportion of correct responses. However, we can get a conclusion based on responses in a table for only one set of “” samples with larger M and N in a Fisher's exact test.
The responses in forced-choice methods are binary. The responses in the methods with response bias may be binary or ratings. The ratings of the methods represent degrees of sureness of a judgment or different decision criteria. For example, the responses in an A–Not A test are “A”/“Not A” (i.e., 1 or 2). The responses in a ratings of the A–Not A test may be a six-point scale with (1, 2, 3, 4, 5, 6) corresponding to (A, A?, A??, N??, N?, N).
Table 1.1 describes the classifications of sensory discrimination methods.
Table 1.1 Classifications of sensory discrimination methods
Requiring the nature of difference
Comparing distance of difference
Forced-choice methods
Based on multiple sets of samples
2-AFC
Duo–Trio
3-AFC
Triangular
4-AFC
Unspecified Tetrad
Specified Tetrad
Dual-Pair (4IAX)
Based on one set of samples
Specified “
” with larger M and N
Unspecified “
” with larger M and N
Methods with response bias
Binary response
A–Not A
Same–Different
A–Not AR
Ratings response
Ratings of A–Not A
Ratings of Same–DifferentRatings of A–Not AR
