Main Article Content

Abstract

Outliers in regression analysis can cause large residuals, the diversity of the data becomes greater, causing the data to be heterogenous. If an outlier is caused by an error in recording observations or an error in preparing equipment, the outlier can be ignored or discarded before data analysis is carried out. However, if outliers exist not because of the researcher's error, but are indeed information that cannot be provided by other data, then the outlier data cannot be ignored and must be included in data analysis. There are several methods to deal with outliers. The Weight Least Square method produces good results and is quite resistive to outliers. The WLS method is used to overcome the regression model with non-constant error variance, because WLS has the ability to neutralize the consequences of violating the normality assumption caused by the presence of outliers and can eliminate the nature of unusualness and consistency of the OLS estimate. To compare the level of estimator accuracy between regression models, the mean absolute percentage error (MAPE) is used. Based on the results of this study, it was concluded that the WLS method produced a smaller Mean Absolute Percentage Error value so that the use of this method was more appropriate because it was not susceptible to the effect of outliers.

Keywords

Outlier Weight Least Square Mean Absolute Percentage Error

Article Details

How to Cite
Prasetya, R. P. (2023). Unpacking Outlier with Weight Least Square (Implemented on Pepper Plantations Data). Parameter: Journal of Statistics, 2(3), 24-31. https://doi.org/10.22487/27765660.2022.v2.i3.16138

References

  1. Aguinis, H., Gottfredson, R. K., & Joo, H. (2013). “Best-Practice Recommendations for Defining, Identifying, and Handling Outliers.” Organizational Research Methods 16(2): 270–301.
  2. Baltagi, Badi H. (2008). Econometrics. Edited by 4th. Berlin: Springer.
  3. BPS. (2021). Buku Pedoman Pencacahan Survei Komoditas Strategis Tanaman Perkebunan. Jakarta: Badan Pusat Statistik.
  4. Khair, Ummul, Hasanul Fahmi, Sarudin Al Hakim, and Robbi Rahim. (2017). “Forecasting Error Calculation with Mean Absolute Deviation and Mean Absolute Percentage Error.” Journal of Physics: Conference Series 930 (1). https://doi.org/10.1088/1742-6596/930/1/012002.
  5. Myers, Raymond H. (1986). Classical and Modern Regression with Applications. Classical and Modern Regression with Applications. Boston, Mass: Duxbury Press.
  6. Sanford, Weisberg. (2005). Applied Linear Regression. 3rd ed. John Wiley and Sons Inc.
  7. Soemartini. (2007). Pencilan (Outlier). Bandung: Universitas Padjadjaran.
  8. Strutz, Tilo. 2011. Data Fitting and Uncertainty A Practical Introduction to Weighted Least Squares and Beyond. 2nd ed. Springer.
  9. Williamson, D F, R A Parker, and J S Kendrick. (1989). “The Box Plot: A Simple Visual Method to Interpret Data.” Annals of Internal Medicine 110 (11): 916–21. https://doi.org/10.7326/0003-4819-110-11-916.
  10. Yuhan, Dely. (2017). “Analisis Faktor-Faktor Produktivitas Tanaman Dan Kelayakan Ekonomi Lada (Piper Nigrum L) Di Kabupaten Belitung Timur.” Universitas Muhamadiyah Yogyakarta.
  11. Zahara, Marliana S Rangkuti, and Robert Asnawi. (2014). “Analisis Komparasi Usahatani Lada Dan Faktorfaktor Yang Mempengaruhi Produksi Lada Hitam Di Lampung,” 765–72.

DB Error: Unknown column 'Array' in 'where clause'