[PDF][PDF] Accounting for Heteroscedasticity in Big Data

FN Antunes, A O'Sullivan, F Rodrigues… - Proc. NSF Workshop on …, 2015 - researchgate.net
Proc. NSF Workshop on Big Data and Urban Informatics, 2015researchgate.net
For regression problems, the general practice is to consider a constant variance of the error
term across all data. This aims to simplify an often complicated model and relies on the
assumption that this error is independent of the input variables. This property is known as
homoscedasticity. On the other hand, in the real world, this is often a naive assumption, as
we are rarely able to exhaustively include all true explanatory variables for a regression.
While Big Data is bringing new opportunities for regression applications, ignoring this …
Abstract
For regression problems, the general practice is to consider a constant variance of the error term across all data. This aims to simplify an often complicated model and relies on the assumption that this error is independent of the input variables. This property is known as homoscedasticity. On the other hand, in the real world, this is often a naive assumption, as we are rarely able to exhaustively include all true explanatory variables for a regression. While Big Data is bringing new opportunities for regression applications, ignoring this limitation may lead to biased estimators and inaccurate confidence and prediction intervals.
researchgate.net
以上显示的是最相近的搜索结果。 查看全部搜索结果