16978.pdf

Mixed or multilevel models exploit random effects to deal with hierarchical data, where statistical units are clustered in groups and cannot be assumed as independent. Sometimes, the assumption of linear dependence of a response on a set of explanatory variables is not plausible, and model specifica...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Γλώσσα:English
Έκδοση: Firenze University Press 2022
Διαθέσιμο Online:https://books.fupress.com/doi/capitoli/978-88-5518-304-8_7
id oapen-20.500.12657-56339
record_format dspace
spelling oapen-20.500.12657-563392022-06-02T03:25:45Z Chapter Random effects regression trees for the analysis of INVALSI data VANNUCCI, GIULIA GOTTARD, ANNA Grilli, Leonardo Rampichini, Carla Regression trees Multilevel models Random effects Hierarchical data Mixed or multilevel models exploit random effects to deal with hierarchical data, where statistical units are clustered in groups and cannot be assumed as independent. Sometimes, the assumption of linear dependence of a response on a set of explanatory variables is not plausible, and model specification becomes a challenging task. Regression trees can be helpful to capture non-linear effects of the predictors. This method was extended to clustered data by modelling the fixed effects with a decision tree while accounting for the random effects with a linear mixed model in a separate step (Hajjem & Larocque, 2011; Sela & Simonoff, 2012). Random effect regression trees are shown to be less sensitive to parametric assumptions and provide improved predictive power compared to linear models with random effects and regression trees without random effects. We propose a new random effect model, called Tree embedded linear mixed model, where the regression function is piecewise-linear, consisting in the sum of a tree component and a linear component. This model can deal with both non-linear and interaction effects and cluster mean dependencies. The proposal is the mixed effect version of the semi-linear regression trees (Vannucci, 2019; Vannucci & Gottard, 2019). Model fitting is obtained by an iterative two-stage estimation procedure, where both the fixed and the random effects are jointly estimated. The proposed model allows a decomposition of the effect of a given predictor within and between clusters. We will show via a simulation study and an application to INVALSI data that these extensions improve the predictive performance of the model in the presence of quasi-linear relationships, avoiding overfitting, and facilitating interpretability. 2022-06-01T12:19:54Z 2022-06-01T12:19:54Z 2021 chapter ONIX_20220601_9788855183048_524 2704-5846 9788855183048 https://library.oapen.org/handle/20.500.12657/56339 eng Proceedings e report application/pdf Attribution 4.0 International 16978.pdf https://books.fupress.com/doi/capitoli/978-88-5518-304-8_7 Firenze University Press 10.36253/978-88-5518-304-8.07 10.36253/978-88-5518-304-8.07 bf65d21a-78e5-4ba2-983a-dbfa90962870 9788855183048 127 6 Florence open access
institution OAPEN
collection DSpace
language English
description Mixed or multilevel models exploit random effects to deal with hierarchical data, where statistical units are clustered in groups and cannot be assumed as independent. Sometimes, the assumption of linear dependence of a response on a set of explanatory variables is not plausible, and model specification becomes a challenging task. Regression trees can be helpful to capture non-linear effects of the predictors. This method was extended to clustered data by modelling the fixed effects with a decision tree while accounting for the random effects with a linear mixed model in a separate step (Hajjem & Larocque, 2011; Sela & Simonoff, 2012). Random effect regression trees are shown to be less sensitive to parametric assumptions and provide improved predictive power compared to linear models with random effects and regression trees without random effects. We propose a new random effect model, called Tree embedded linear mixed model, where the regression function is piecewise-linear, consisting in the sum of a tree component and a linear component. This model can deal with both non-linear and interaction effects and cluster mean dependencies. The proposal is the mixed effect version of the semi-linear regression trees (Vannucci, 2019; Vannucci & Gottard, 2019). Model fitting is obtained by an iterative two-stage estimation procedure, where both the fixed and the random effects are jointly estimated. The proposed model allows a decomposition of the effect of a given predictor within and between clusters. We will show via a simulation study and an application to INVALSI data that these extensions improve the predictive performance of the model in the presence of quasi-linear relationships, avoiding overfitting, and facilitating interpretability.
title 16978.pdf
spellingShingle 16978.pdf
title_short 16978.pdf
title_full 16978.pdf
title_fullStr 16978.pdf
title_full_unstemmed 16978.pdf
title_sort 16978.pdf
publisher Firenze University Press
publishDate 2022
url https://books.fupress.com/doi/capitoli/978-88-5518-304-8_7
_version_ 1771297452394020864