26262.pdf

When dealing with a wine, it is of interest to be able to predict its quality based on chemical and/or sensory variables. There is no agreement on what wine quality means, or how it should be assessed and it is often viewed in intrinsic (physicochemical, sensory) or extrinsic (price, prestige, conte...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Γλώσσα:English
Έκδοση: Firenze University Press 2022
Διαθέσιμο Online:https://books.fupress.com/doi/capitoli/978-88-5518-461-8_44
id oapen-20.500.12657-56372
record_format dspace
spelling oapen-20.500.12657-563722022-06-02T03:26:14Z Chapter Prediction of wine sensorial quality: a classification problem Carpita, Maurizio GOLIA, Silvia wine quality categorical classifier Bayes classifier When dealing with a wine, it is of interest to be able to predict its quality based on chemical and/or sensory variables. There is no agreement on what wine quality means, or how it should be assessed and it is often viewed in intrinsic (physicochemical, sensory) or extrinsic (price, prestige, context) terms (Jackson, 2017). In this paper, the wine quality was evaluated by experienced judges who scored the wine on the base of a 0-10 scale, with 0 meaning very bad and 10 excellent, so, the resulting variable was categorical. The models applied to predict this variable provide the prediction of the occurrence probabilities of each of its categories. Nevertheless, jointly with this probabilities’ record, the practitioners need the predicted value (category) of the variable, so the statistical problem to be covered refers to the way in which this probabilities’ record is transformed into a single value. In this paper we compare the predictive performances of the default method (Bayes Classifier - BC), which assigns a unit to the most likely category, and other two methods (Maximum Difference Classifier and Maximum Ratio Classifier). The BC is the optimal criterion if one is interested in the accuracy of the classification, but, given that it favors the prevalent category most, when there is not a category of interest, it cannot be the best choice. The data under study concern the quality of the red variant of the Portuguese "Vinho Verde" wine (Cortez et al., 2009), measured on a 0-10 scale. Nevertheless, only 6 scores were used, with 2 scores with a very few number of observations, so this is the right context for predictive performance comparisons. In the study, we investigated different merging of categories and we used 11 explanatory variables to estimate the probabilities’ record of the wine quality variable. 2022-06-01T12:21:01Z 2022-06-01T12:21:01Z 2021 chapter ONIX_20220601_9788855184618_557 2704-5846 9788855184618 https://library.oapen.org/handle/20.500.12657/56372 eng Proceedings e report application/pdf Attribution 4.0 International 26262.pdf https://books.fupress.com/doi/capitoli/978-88-5518-461-8_44 Firenze University Press 10.36253/978-88-5518-461-8.44 10.36253/978-88-5518-461-8.44 bf65d21a-78e5-4ba2-983a-dbfa90962870 9788855184618 132 4 Florence open access
institution OAPEN
collection DSpace
language English
description When dealing with a wine, it is of interest to be able to predict its quality based on chemical and/or sensory variables. There is no agreement on what wine quality means, or how it should be assessed and it is often viewed in intrinsic (physicochemical, sensory) or extrinsic (price, prestige, context) terms (Jackson, 2017). In this paper, the wine quality was evaluated by experienced judges who scored the wine on the base of a 0-10 scale, with 0 meaning very bad and 10 excellent, so, the resulting variable was categorical. The models applied to predict this variable provide the prediction of the occurrence probabilities of each of its categories. Nevertheless, jointly with this probabilities’ record, the practitioners need the predicted value (category) of the variable, so the statistical problem to be covered refers to the way in which this probabilities’ record is transformed into a single value. In this paper we compare the predictive performances of the default method (Bayes Classifier - BC), which assigns a unit to the most likely category, and other two methods (Maximum Difference Classifier and Maximum Ratio Classifier). The BC is the optimal criterion if one is interested in the accuracy of the classification, but, given that it favors the prevalent category most, when there is not a category of interest, it cannot be the best choice. The data under study concern the quality of the red variant of the Portuguese "Vinho Verde" wine (Cortez et al., 2009), measured on a 0-10 scale. Nevertheless, only 6 scores were used, with 2 scores with a very few number of observations, so this is the right context for predictive performance comparisons. In the study, we investigated different merging of categories and we used 11 explanatory variables to estimate the probabilities’ record of the wine quality variable.
title 26262.pdf
spellingShingle 26262.pdf
title_short 26262.pdf
title_full 26262.pdf
title_fullStr 26262.pdf
title_full_unstemmed 26262.pdf
title_sort 26262.pdf
publisher Firenze University Press
publishDate 2022
url https://books.fupress.com/doi/capitoli/978-88-5518-461-8_44
_version_ 1771297601206878208