Leverage and influential observations on the Liu type estimator in the linear regression model with the severe collinearity
- PMID: 34458624
- PMCID: PMC8379455
- DOI: 10.1016/j.heliyon.2021.e07792
Leverage and influential observations on the Liu type estimator in the linear regression model with the severe collinearity
Abstract
In the process of building a linear regression model, the essential part is to identify influential observations. Various influence measures involving Cook's distance and DFFITS are designed to detect the linear regression's influential observations using the Least Squares (LS). The existence of influential observations in the data is complicated by the presence of severe collinearity and affects the efficiency of the detection measures. This paper proposes new diagnostic methods based on the Liu type estimator (LTE) defined by Liu [1]. The Cook's distance and DFFITS for the LTE are introduced. Moreover, approximate formulas for Cook's distance and DFFITS are also proposed for LTE. Two real data sets with a high level of multicollinearity among the explanatory variables as well as the simulation study are used to illustrate and evaluate performance of the methodologies presented in this paper.
Keywords: Cook's distance; DFFITS; Leverage; Multicollinearity; Ridge regression.
© 2021 The Author(s).
Conflict of interest statement
The authors declare no conflict of interest.
Figures
Similar articles
-
PERTURBATION AND SCALED COOK'S DISTANCE.Ann Stat. 2012;40(2):785-811. doi: 10.1214/12-AOS978. Ann Stat. 2012. PMID: 23152646 Free PMC article.
-
Cook's Distance Measures for Varying Coefficient Models with Functional Responses.Technometrics. 2015 May;57(2):268-280. doi: 10.1080/00401706.2014.914978. Technometrics. 2015. PMID: 26257438 Free PMC article.
-
Regression Diagnostic under Model Misspecification.J Appl Stat. 2007 Jun 29;34(5):563-575. doi: 10.1080/02664760701235014. eCollection 2007. J Appl Stat. 2007. PMID: 38817915
-
Multivariate Outliers: A Conceptual and Practical Overview for the Nurse and Health Researcher.Can J Nurs Res. 2021 Sep;53(3):316-321. doi: 10.1177/0844562120932054. Epub 2020 Jun 10. Can J Nurs Res. 2021. PMID: 32522115 Review.
-
A comprehensive review of group level model performance in the presence of heteroscedasticity: Can a single model control Type I errors in the presence of outliers?Neuroimage. 2017 Feb 15;147:658-668. doi: 10.1016/j.neuroimage.2016.12.058. Epub 2016 Dec 25. Neuroimage. 2017. PMID: 28030782 Free PMC article. Review.
Cited by
-
Genetic associations between circulating immune cells and periodontitis highlight the prospect of systemic immunoregulation in periodontal care.Elife. 2024 Mar 27;12:RP92895. doi: 10.7554/eLife.92895. Elife. 2024. PMID: 38536078 Free PMC article.
-
Cross-sectional reference values of cerebral ventricle for Chinese neonates born at 25-41 weeks of gestation.Eur J Pediatr. 2022 Oct;181(10):3645-3654. doi: 10.1007/s00431-022-04547-z. Epub 2022 Aug 17. Eur J Pediatr. 2022. PMID: 35978254
References
-
- Liu K. Using Liu-type estimator to combat collinearity. Commun. Stat., Theory Methods. 2003;32(5):1009–1020.
-
- Hoaglin D.C., Welsch R.E. The hat matrix in regression and ANOVA. Am. Stat. 1978;32(1):17–22.
-
- Velleman P.F., Welsch R.E. Efficient computing of regression diagnostics. Am. Stat. 1981;35(4):234–242.
-
- Cook R.D. Detection of influential observation in linear regression. Technometrics. 1977;19(1):15–18.
-
- Belsley D.A., Kuh E., Welsch R.E. John Wiley & Sons; 1980. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity.
LinkOut - more resources
Full Text Sources
