Controlling Bias and Variance with Regularization Strategies | by Rakesh Ganya | Jul, 2024

Observations

– All above variations of regularizations works fairly effectively, as it’s easy knowledge with single predictor. It would present vital variations, when there many predictors and large knowledge.

By means of scikit-learn Library

L1 Regularization

from sklearn import linear_modelalphas = np.logspace(-4,-1,4) # alphas / penality intensify. i.e., 10^4, 10^3, 10^2, 10^1
print("alphas: ", alphas)
# alphas:  [0.0001 0.001  0.01   0.1   ]
# Lasso / L1 Regularization
lasso_cv = linear_model.LassoCV(alphas=alphas) # cross validation # , normalize=True
lasso_model = lasso_cv.match(X_train, Y_train) # Mannequin Match
lasso_preds = lasso_model.predict(X_test) # Prediction
lasso_preds = [round(x,2) for x in lasso_preds]
df_lasso_coeffs = pd.DataFrame({"variable": X_train.columns.tolist(), 
"Coeff":lasso_cv.coef_})
# Variable Significance i.e., mod(coeff) != 0, that are vital for Mannequin Match
df_lasso_coeffs[df_lasso_coeffs["Coeff"]!=0]
# variable Coeff
# Product_Sell 95.97024
print("lasso_cv.alpha_: ", lasso_cv.alpha_)
# lasso_cv.alpha_:  0.0001
def mse(true,pred):
''' 
Calculate MSE worth
'''    
return np.imply((pred-true)**2)
print("Loss: ", spherical(mse(Y_test,lasso_preds),2))
# Loss:  21452.96

L2 Regularization

alphas = np.logspace(-4,-1,4) # alphas / penality intensify. i.e., 10^4, 10^3, 10^2, 10^1
print("alphas: ", alphas)
# alphas:  [0.0001 0.001  0.01   0.1   ]# Ridge / L2 Regularization
ridge_cv = linear_model.RidgeCV(alphas=alphas) # cross validation # , normalize=True
ridge_model = ridge_cv.match(X_train, Y_train) # Mannequin Match
ridge_preds = ridge_model.predict(X_test) # Prediction
ridge_preds = [round(x,2) for x in ridge_preds]
df_ridge_coeffs = pd.DataFrame({"variable": X_train.columns.tolist(), 
"Coeff":ridge_cv.coef_})
# Variable Significance i.e., mod(coeff) > 0.001, that are vital for Mannequin Match
df_ridge_coeffs[df_ridge_coeffs["Coeff"]>=0.001]
# variable Coeff
# Product_Sell 95.934556
print("ridge_cv.alpha_: ", ridge_cv.alpha_)
# ridge_cv.alpha_:  0.1
print("Loss: ", spherical(mse(Y_test,ridge_preds),2))
# Loss:  21327.3

Elastic Internet Regularization

alphas = np.logspace(-4,-1,4) # alphas / penality intensify. i.e., 10^4, 10^3, 10^2, 10^1
print("alphas: ", alphas)
# alphas:  [0.0001 0.001  0.01   0.1   ]# Elastic Internet
elastic_net_cv = linear_model.ElasticNetCV(alphas=alphas) # cross validation # , normalize=True
elastic_net_model = elastic_net_cv.match(X_train, Y_train) # Mannequin Match
elastic_net_preds = elastic_net_model.predict(X_test) # Prediction
elastic_net_preds = [round(x,2) for x in elastic_net_preds]
df_elastic_net_coeffs = pd.DataFrame({"variable": X_train.columns.tolist(), 
"Coeff":elastic_net_cv.coef_})
# Variable Significance i.e., mod(coeff) > 0.001 which can be !=0, that are vital for Elastic Mannequin Match
df_elastic_net_coeffs[df_elastic_net_coeffs["Coeff"]>=0.001]
# variable Coeff
# Product_Sell 95.970134
print("elastic_net_cv.alpha_: ", elastic_net_cv.alpha_)
# elastic_net_cv.alpha_:  0.0001
print("Loss: ", spherical(mse(Y_test,elastic_net_preds),2))
# Loss:  21452.96

Keypoints

Above Hyperparameters i.e., penality rating will probably be discovered utilizing cross validation via hyper parameter tuning i.e., grid or random search.
All above variations of regularizations works fairly effectively, as it’s easy knowledge with single predictor. It would present vital variations, when there many predictors and large knowledge.
Basically regularization works for, Prevents Overfitting: Regularization discourages advanced fashions which may match the coaching knowledge too effectively, thus serving to the mannequin generalize higher to new knowledge.
Simplifies Fashions: By penalizing giant coefficients, regularization can result in easier fashions which are simpler to interpret and use.
Improves Stability: Regularization results in extra secure fashions the place predictions are much less delicate to small adjustments within the coaching knowledge.

Source link

Controlling Bias and Variance with Regularization Strategies | by Rakesh Ganya | Jul, 2024

Working with Input-Convex Neural Networks part3(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

Embracing the Future: The Rise of AI-Driven Development in Software Engineering The software… | by DevBlogs | Jul, 2024

Research on Metaheuristic methods part4(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

How Real-Time Data Analytics and AI Are Transforming Heavy Equipment Operations

NVIDIA Accelerates Google Quantum AI Processor Design With Simulation of Quantum Device Physics

Game Development and Cloud Computing: Benefits of Cloud-Native Game Servers

Teradata AI Unlimited in Microsoft Fabric is Now Available for Public Preview through Microsoft Fabric Workload Hub

Cognigy Unveils Agentic AI: Transforming the Future of Enterprise Contact Centers

Our Picks

MegaTron-Turing NLG. Overview | by Abhijat Sarari | Jun, 2024

Understanding R-Squared and Adjusted R-Squared in Regression Analysis | by Jainvidip | Jul, 2024

Dimensionality Reduction: Part 3 Non-Negative Matrix Factorisation (NMF) | by Ronan O’ Donnell | May, 2024

Most Popular

Revolutionizing the Way We Find Love

Will GenAI Replace Data Engineers? No – And Here’s Why.

Assortment Optimization Machine Learning | by Danishaliarshar | Mar, 2024

Controlling Bias and Variance with Regularization Strategies | by Rakesh Ganya | Jul, 2024

L1 Regularization

L2 Regularization

Elastic Internet Regularization

Related Posts