Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 17;7(8):e07792.
doi: 10.1016/j.heliyon.2021.e07792. eCollection 2021 Aug.

Leverage and influential observations on the Liu type estimator in the linear regression model with the severe collinearity

Affiliations

Leverage and influential observations on the Liu type estimator in the linear regression model with the severe collinearity

Hussein Eledum. Heliyon. .

Abstract

In the process of building a linear regression model, the essential part is to identify influential observations. Various influence measures involving Cook's distance and DFFITS are designed to detect the linear regression's influential observations using the Least Squares (LS). The existence of influential observations in the data is complicated by the presence of severe collinearity and affects the efficiency of the detection measures. This paper proposes new diagnostic methods based on the Liu type estimator (LTE) defined by Liu [1]. The Cook's distance and DFFITS for the LTE are introduced. Moreover, approximate formulas for Cook's distance and DFFITS are also proposed for LTE. Two real data sets with a high level of multicollinearity among the explanatory variables as well as the simulation study are used to illustrate and evaluate performance of the methodologies presented in this paper.

Keywords: Cook's distance; DFFITS; Leverage; Multicollinearity; Ridge regression.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Leverage values for the Longley data set.
Figure 2
Figure 2
The two versions Cook's D for LTE in the Longley data for the two values of dˆ.
Figure 3
Figure 3
DFFITS statistic for LTE in the Longley data for the two values of dˆ.
Figure 4
Figure 4
Leverage values for the Hald data set.

Similar articles

Cited by

References

    1. Liu K. Using Liu-type estimator to combat collinearity. Commun. Stat., Theory Methods. 2003;32(5):1009–1020.
    1. Hoaglin D.C., Welsch R.E. The hat matrix in regression and ANOVA. Am. Stat. 1978;32(1):17–22.
    1. Velleman P.F., Welsch R.E. Efficient computing of regression diagnostics. Am. Stat. 1981;35(4):234–242.
    1. Cook R.D. Detection of influential observation in linear regression. Technometrics. 1977;19(1):15–18.
    1. Belsley D.A., Kuh E., Welsch R.E. John Wiley & Sons; 1980. Regression Diagnostics: Identifying Influential Data and Sources of Collinearity.

LinkOut - more resources