Faster generalised linear models in largeish data
Read OriginalThis technical article discusses an optimization for fitting generalized linear models (GLMs) on large datasets. It proposes using a starting estimator from a subsample, followed by a single Newton-Raphson iteration computed via a single database query, to achieve asymptotic efficiency. This approach aims to be faster than iterative methods like `bigglm` in R, especially when data resides in a database, and includes a practical example with a logistic regression on a vehicle dataset.
Comments
No comments yet
Be the first to share your thoughts!
Browser Extension
Get instant access to AllDevBlogs from your browser
Top of the Week
No top articles yet