উত্তর:
ইন পরিসংখ্যান শেখার উপাদানসমূহ বই, Hastie এট অল। এই সঙ্কুচিত কৌশলগুলির একটি খুব অন্তর্দৃষ্টিপূর্ণ এবং পুঙ্খানুপুঙ্খ তুলনা সরবরাহ করুন। বইটি অনলাইনে উপলব্ধ ( পিডিএফ)). The comparison is done in section 3.4.3, page 69.
The main difference between Lasso and Ridge is the penalty term they use. Ridge uses penalty term which limits the size of the coefficient vector. Lasso uses penalty which imposes sparsity among the coefficients and thus, makes the fitted model more interpretable. Elasticnet is introduced as a compromise between these two techniques, and has a penalty which is a mix of and norms.
To summarize, here are some salient differences between Lasso, Ridge and Elastic-net:
I highly recommended you to have a look at An introduction to statistical learning book (Tibshirani et. al, 2013).
The reason for this is that Elements of statistical learning book is intended for individuals with advanced training in the mathematical sciences. In the foreword to ISL, the authors write:
An Introduction to Statistical Learning arose from the perceived need for a broader and less technical treatment of these topics. [...]
An Introduction to Statistical Learning is appropriate for advanced undergraduates or master’s students in statistics or related quantitative fields or for individuals in other disciplines who wish to use statistical learning tools to analyze their data.
The above answers are very clear and informative. I would like to add one minor point from the statistic perspective. Take the ridge regression as an example. It is an extension of the ordinal least square regression to solve the multicollinearity problems when there are many correlated features. If the linear regression is
Y=Xb+e
The normal equation solution for the multiple linear regression
b=inv(X.T*X)*X.T*Y
The normal equation solution for the ridge regression is
b=inv(X.T*X+k*I)*X.T*Y.
It is a biased estimator for b and we can always find a penalty term k which will make the mean square error of Ridge regression smaller than that of OLS regression.
For LASSO and Elastic-Net, we could not find such a analytic solution.