Hyperparameters tuning

Hyperparameters tuning#

Previous notebooks showed how model parameters impact statistical performance. We want to optimize these parameters to achieve the best possible model performance. This optimization process is called hyperparameter tuning.

This notebook demonstrates several methods to tune model hyperparameters.

Introductory example#

We revisit an example from the linear models notebook about the impact of the \(\alpha\) parameter in a Ridge model. The \(\alpha\) parameter controls model regularization strength. No general rule exists for selecting a good \(\alpha\) value - it depends on the specific dataset.

Let’s load a dataset for regression:

# When using JupyterLite, uncomment and install the `skrub` and `pyodide-http` packages.
%pip install skrub
%pip install pyodide-http
import matplotlib.pyplot as plt
import skrub

# import pyodide_http
# pyodide_http.patch_all()

skrub.patch_display()  # makes nice display for pandas tables

/home/runner/work/traces-sklearn/traces-sklearn/.pixi/envs/docs/bin/python: No module named pip

Note: you may need to restart the kernel to use updated packages.
/home/runner/work/traces-sklearn/traces-sklearn/.pixi/envs/docs/bin/python: No module named pip

Note: you may need to restart the kernel to use updated packages.

from sklearn.datasets import fetch_california_housing

X, y = fetch_california_housing(return_X_y=True, as_frame=True)
X

Processing column   1 / 8

Processing column   2 / 8

Processing column   3 / 8

Processing column   4 / 8

Processing column   5 / 8

Processing column   6 / 8

Processing column   7 / 8

Processing column   8 / 8

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

      4.526
      3.585
      3.521
      3.413
      3.422
         ...  
  0.781
  0.771
  0.923
  0.847
  0.894
Name: MedHouseVal, Length: 20640, dtype: float64

Now we define a Ridge model that processes data by adding feature interactions using a PolynomialFeatures transformer.

from sklearn.linear_model import Ridge
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures, StandardScaler

model = Pipeline(
    [
        ("poly", PolynomialFeatures()),
        ("scaler", StandardScaler()),
        ("ridge", Ridge()),
    ]
)
model

Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()),
                ('ridge', Ridge())])

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

We start with scikit-learn’s default parameters. Let’s evaluate this basic model:

import pandas as pd
from sklearn.model_selection import KFold, cross_validate

cv = KFold(n_splits=10, shuffle=True, random_state=42)
cv_results = cross_validate(model, X, y, cv=cv)
cv_results = pd.DataFrame(cv_results)
cv_results

Processing column   1 / 3

Processing column   2 / 3

Processing column   3 / 3

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

cv_results.aggregate(["mean", "std"])

Processing column   1 / 3

Processing column   2 / 3

Processing column   3 / 3

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

Nothing indicates our pipeline achieves optimal performance. The PolynomialFeatures degree might need adjustment or the Ridge regressor might need different regularization. Let’s examine which parameters we could tune:

for params in model.get_params():
    print(params)

memory
steps
transform_input
verbose
poly
scaler
ridge
poly__degree
poly__include_bias
poly__interaction_only
poly__order
scaler__copy
scaler__with_mean
scaler__with_std
ridge__alpha
ridge__copy_X
ridge__fit_intercept
ridge__max_iter
ridge__positive
ridge__random_state
ridge__solver
ridge__tol

Two key parameters are scaler__degree and ridge__alpha. We will find their optimal values for this dataset.

Manual hyperparameters search#

Before exploring scikit-learn’s automated tuning tools, we implement a simplified manual version.

EXERCISE:

Create nested for loops to try all parameter combinations defined in parameter_grid
In the inner loop, use cross-validation on the training set to get an array of scores
Compute the mean and standard deviation of cross-validation scores to find the best hyperparameters
Train a model with the best hyperparameters and evaluate it on the test set

# Write your code here.
from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=42)

parameter_grid = {
    "poly__degree": [1, 2, 3],
    "ridge__alpha": [0.01, 0.1, 1, 10],
}

Hyperparameters search using a grid#

Our manual search implements a grid-search: trying every possible parameter combination. Scikit-learn provides GridSearchCV to automate this process. During fitting, it performs cross-validation and selects optimal hyperparameters.

from sklearn.model_selection import GridSearchCV

search_cv = GridSearchCV(model, param_grid=parameter_grid)
search_cv.fit(X_train, y_train)

GridSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()),
                                       ('scaler', StandardScaler()),
                                       ('ridge', Ridge())]),
             param_grid={'poly__degree': [1, 2, 3],
                         'ridge__alpha': [0.01, 0.1, 1, 10]})

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

The best_params_ attribute shows the optimal parameters found:

search_cv.best_params_

{'poly__degree': 1, 'ridge__alpha': 0.01}

The cv_results_ attribute provides details about all hyperparameter combinations tried during fitting:

cv_results = pd.DataFrame(search_cv.cv_results_)
cv_results

Processing column   1 / 15

Processing column   2 / 15

Processing column   3 / 15

Processing column   4 / 15

Processing column   5 / 15

Processing column   6 / 15

Processing column   7 / 15

Processing column   8 / 15

Processing column   9 / 15

Processing column  10 / 15

Processing column  11 / 15

Processing column  12 / 15

Processing column  13 / 15

Processing column  14 / 15

Processing column  15 / 15

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

When refit=True (default), the search trains a final model using the best parameters. Access this model through best_estimator_:

search_cv.best_estimator_

Pipeline(steps=[('poly', PolynomialFeatures(degree=1)),
                ('scaler', StandardScaler()), ('ridge', Ridge(alpha=0.01))])

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

The best_estimator_ handles predict and score calls to GridSearchCV:

search_cv.score(X_test, y_test)

0.5910512173880501

EXERCISE:

GridSearchCV behaves like any classifier or regressor. Use cross_validate to evaluate the grid-search model we created.

# Write your code here.

QUESTION:

What limitations does the grid-search approach have?

Randomized hyperparameters search#

Grid-search has two main limitations:

It explores only predefined parameter combinations
Adding parameters or values exponentially increases search cost

RandomizedSearchCV draws parameter values from specified distributions. This allows non-grid exploration of the hyperparameter space with a fixed computational budget.

import numpy as np
from scipy.stats import loguniform

parameter_distributions = {
    "poly__degree": np.arange(1, 5),
    "ridge__alpha": loguniform(1, 3),
}

from sklearn.model_selection import RandomizedSearchCV

search_cv = RandomizedSearchCV(
    model,
    param_distributions=parameter_distributions,
    n_iter=10,
)

cv_results = cross_validate(search_cv, X, y, cv=cv, return_estimator=True)
cv_results = pd.DataFrame(cv_results)
cv_results

Processing column   1 / 4

Processing column   2 / 4

Processing column   3 / 4

Processing column   4 / 4

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

for est in cv_results["estimator"]:
    print(est.best_params_)

{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(1.0892448653280462)}
{'poly__degree': np.int64(2), 'ridge__alpha': np.float64(2.7007470640302915)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(2.5614708128731425)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(2.30369226393527)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(2.3274144716322303)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(1.4680685487622345)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(1.826016407037397)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(2.916102724447352)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(1.3985724716984993)}
{'poly__degree': np.int64(1), 'ridge__alpha': np.float64(1.2506874663856429)}

Model with internal hyperparameter tuning#

Some estimators include efficient hyperparameter selection, more efficient than grid-search. These estimators typically end with CV (e.g. RidgeCV).

EXERCISE:

Create a pipeline with PolynomialFeatures, StandardScaler, and Ridge
Create a grid-search with this pipeline and tune alpha using np.logspace(-2, 2, num=50)
Fit the grid-search on the training set and time it
Repeat using RidgeCV instead of Ridge and remove GridSearchCV
Compare computational performance between approaches

# Write your code here.

Inspection of hyperparameters in cross-validation#

When performing search cross-validation inside evaluation cross-validation, different hyperparameter values may emerge for each split. Let’s examine this with GridSearchCV:

from sklearn.linear_model import RidgeCV

inner_model = Pipeline(
    [
        ("poly", PolynomialFeatures()),
        ("scaler", StandardScaler()),
        ("ridge", Ridge()),
    ]
)
param_grid = {"poly__degree": [1, 2], "ridge__alpha": np.logspace(-2, 2, num=10)}
model = GridSearchCV(inner_model, param_grid=param_grid, n_jobs=-1)
model

GridSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()),
                                       ('scaler', StandardScaler()),
                                       ('ridge', Ridge())]),
             n_jobs=-1,
             param_grid={'poly__degree': [1, 2],
                         'ridge__alpha': array([1.00000000e-02, 2.78255940e-02, 7.74263683e-02, 2.15443469e-01,
       5.99484250e-01, 1.66810054e+00, 4.64158883e+00, 1.29154967e+01,
       3.59381366e+01, 1.00000000e+02])})

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

We run cross-validation and store models from each split by setting return_estimator=True:

cv_results = cross_validate(model, X, y, cv=cv, return_estimator=True)
cv_results = pd.DataFrame(cv_results)
cv_results

Processing column   1 / 4

Processing column   2 / 4

Processing column   3 / 4

Processing column   4 / 4

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

The estimator column contains the different estimators. We examine best_params_ from each GridSearchCV:

for estimator_cv_fold in cv_results["estimator"]:
    print(estimator_cv_fold.best_params_)

{'poly__degree': 1, 'ridge__alpha': np.float64(12.915496650148826)}
{'poly__degree': 1, 'ridge__alpha': np.float64(0.01)}
{'poly__degree': 2, 'ridge__alpha': np.float64(0.027825594022071243)}
{'poly__degree': 1, 'ridge__alpha': np.float64(35.93813663804626)}
{'poly__degree': 1, 'ridge__alpha': np.float64(12.915496650148826)}
{'poly__degree': 1, 'ridge__alpha': np.float64(12.915496650148826)}
{'poly__degree': 2, 'ridge__alpha': np.float64(0.01)}
{'poly__degree': 1, 'ridge__alpha': np.float64(35.93813663804626)}
{'poly__degree': 1, 'ridge__alpha': np.float64(12.915496650148826)}
{'poly__degree': 1, 'ridge__alpha': np.float64(12.915496650148826)}

This inspection reveals the stability of hyperparameter values across folds.

Note regarding the scoring metric to optimize during tuning#

The GridSearchCV and RandomizedSearchCV classes use the scoring parameter to define the metric to optimize during tuning. If not specified, the scoring metric used for classification is accuracy and the r2_score for regression.

These scoring rules are actually not optimal for hyperparameter tuning. Indeed, we recently recognized that it is better to use proper scoring rules. Such scoring rules allow to get calibrated models.

Therefore, we recommend to use brier_score_loss or log_loss for classification and mean_squared_error for regression.

	MedInc	HouseAge	AveRooms	AveBedrms	Population	AveOccup	Latitude	Longitude
	MedInc	HouseAge	AveRooms	AveBedrms	Population	AveOccup	Latitude	Longitude
0	8.3252	41.0	6.984126984126984	1.0238095238095237	322.0	2.5555555555555554	37.88	-122.23
1	8.3014	21.0	6.238137082601054	0.9718804920913884	2401.0	2.109841827768014	37.86	-122.22
2	7.2574	52.0	8.288135593220339	1.073446327683616	496.0	2.8022598870056497	37.85	-122.24
3	5.6431	52.0	5.8173515981735155	1.0730593607305936	558.0	2.547945205479452	37.85	-122.25
4	3.8462	52.0	6.281853281853282	1.0810810810810811	565.0	2.1814671814671813	37.85	-122.25

20635	1.5603	25.0	5.045454545454546	1.1333333333333333	845.0	2.5606060606060606	39.48	-121.09
20636	2.5568	18.0	6.114035087719298	1.3157894736842106	356.0	3.1228070175438596	39.49	-121.21
20637	1.7	17.0	5.20554272517321	1.120092378752887	1007.0	2.325635103926097	39.43	-121.22
20638	1.8672	18.0	5.329512893982808	1.171919770773639	741.0	2.1232091690544412	39.43	-121.32
20639	2.3886	16.0	5.254716981132075	1.1622641509433962	1387.0	2.616981132075472	39.37	-121.24

Column	Column name	dtype	Unique values	Mean	Std	Min	Median	Max
0	MedInc	Float64DType	12928 (62.6%)	3.87	1.90	0.500	3.53	15.0
1	HouseAge	Float64DType	52 (0.3%)	28.6	12.6	1.00	29.0	52.0
2	AveRooms	Float64DType	19392 (94.0%)	5.43	2.47	0.846	5.23	142.
3	AveBedrms	Float64DType	14233 (69.0%)	1.10	0.474	0.333	1.05	34.1
4	Population	Float64DType	3888 (18.8%)	1.43e+03	1.13e+03	3.00	1.17e+03	3.57e+04
5	AveOccup	Float64DType	18841 (91.3%)	3.07	10.4	0.692	2.82	1.24e+03
6	Latitude	Float64DType	862 (4.2%)	35.6	2.14	32.5	34.3	42.0
7	Longitude	Float64DType	844 (4.1%)	-120.	2.00	-124.	-118.	-114.

Column 1	Column 2	Cramér's V
AveRooms	AveBedrms	0.704
Latitude	Longitude	0.506
Population	AveOccup	0.267
MedInc	AveRooms	0.218
HouseAge	Longitude	0.162
HouseAge	Latitude	0.142
AveBedrms	Longitude	0.139
HouseAge	Population	0.128
AveBedrms	AveOccup	0.0994
AveOccup	Latitude	0.0954
MedInc	Latitude	0.0952
AveBedrms	Latitude	0.0907
MedInc	Longitude	0.0894
HouseAge	AveRooms	0.0851
AveRooms	Longitude	0.0842
MedInc	HouseAge	0.0805
AveOccup	Longitude	0.0782
Population	Latitude	0.0775
AveRooms	Latitude	0.0770
HouseAge	AveBedrms	0.0744

	fit_time	score_time	test_score
	fit_time	score_time	test_score
0	0.0189363956451416	0.003270864486694336	0.6399100590601892
1	0.020485639572143555	0.003040313720703125	0.6187311881404927
2	0.020253419876098633	0.0030012130737304688	0.6759248551427355
3	0.02010035514831543	0.0030221939086914062	0.6192682129762996
4	0.020159482955932617	0.0030336380004882812	0.6693539312328332
5	0.02017521858215332	0.003038167953491211	0.6481732142854086
6	0.02025604248046875	0.0029952526092529297	-6.694355651567401
7	0.02025294303894043	0.003010272979736328	0.6978353386945753
8	0.020119905471801758	0.0030372142791748047	0.6641263749290154
9	0.02048778533935547	0.003084421157836914	0.3299956235146686

Column	Column name	dtype	Unique values	Mean	Std	Min	Median	Max
0	fit_time	Float64DType	10 (100.0%)	0.0201	0.000438	0.0189	0.0202	0.0205
1	score_time	Float64DType	10 (100.0%)	0.00305	8.04e-05	0.00300	0.00303	0.00327
2	test_score	Float64DType	10 (100.0%)	-0.113	2.31	-6.69	0.640	0.698

	mean_fit_time	std_fit_time	mean_score_time	std_score_time	param_poly__degree	param_ridge__alpha	params	split0_test_score	split1_test_score	split2_test_score	split3_test_score	split4_test_score	mean_test_score	std_test_score	rank_test_score
	mean_fit_time	std_fit_time	mean_score_time	std_score_time	param_poly__degree	param_ridge__alpha	params	split0_test_score	split1_test_score	split2_test_score	split3_test_score	split4_test_score	mean_test_score	std_test_score	rank_test_score
0	0.0053403377532958984	0.000512476218616945	0.0015188217163085937	3.299237641489875e-05	1	0.01	{'poly__degree': 1, 'ridge__alpha': 0.01}	0.6006669698467644	0.6195832769358425	0.6043652050440378	0.610342428232162	0.6008419080639655	0.6071599576245544	0.007132380589920894	1
1	0.005099439620971679	0.00010284114038487937	0.0014857769012451172	1.8238881054507326e-05	1	0.1	{'poly__degree': 1, 'ridge__alpha': 0.1}	0.6006674922391301	0.6195818885036616	0.6043640943149728	0.6103430627707596	0.6008427468003833	0.6071598569257814	0.007131796913965441	2
2	0.005031394958496094	2.9049285991560704e-05	0.001463174819946289	2.3115538870889327e-06	1	1.0	{'poly__degree': 1, 'ridge__alpha': 1}	0.6006726008885412	0.6195679077514673	0.6043528934661087	0.6103492989236327	0.6008510147942704	0.607158743164804	0.007125970574526951	3
3	0.005036783218383789	4.4358518289785145e-05	0.001468992233276367	4.882868379034736e-06	1	10.0	{'poly__degree': 1, 'ridge__alpha': 10}	0.6007124027499688	0.619418684865192	0.6042317421971115	0.6104009819271281	0.6009220083375948	0.6071371640153991	0.0070687404126070314	4
4	0.012868022918701172	0.0006298785057320311	0.0028177738189697266	6.358612330154767e-05	2	0.01	{'poly__degree': 2, 'ridge__alpha': 0.01}	0.34439818963562785	0.6795898872720656	-83.25638601643489	0.661788106621743	-0.5015587337774408	-16.414433713336578	33.42372695873222	9

7	0.012513160705566406	3.1595274720533226e-05	0.002690744400024414	5.7426716679857625e-06	2	10.0	{'poly__degree': 2, 'ridge__alpha': 10}	0.6360666320573516	0.6614254980059	-3.755639939454153	0.6202921233200607	0.5358859436497205	-0.26039394848422404	1.7481308203604538	6
8	0.03881077766418457	0.0019776032997187074	0.006868124008178711	3.414134073465377e-05	3	0.01	{'poly__degree': 3, 'ridge__alpha': 0.01}	-0.7791345695282557	0.635723083061451	-310146.52431198815	-12.837606958033792	0.5574330392879037	-62031.789579478675	124057.36746906178	12
9	0.03777370452880859	0.00041335598220306553	0.006852865219116211	6.09406469699571e-05	3	0.1	{'poly__degree': 3, 'ridge__alpha': 0.1}	0.188162821072569	0.640981127131061	-21166.11794008665	-8.557584801002648	-0.7411898172164526	-4234.917514151333	8465.600877943767	11
10	0.03743314743041992	0.00027584528856669846	0.006869602203369141	5.416257246028476e-05	3	1.0	{'poly__degree': 3, 'ridge__alpha': 1}	0.6160114071632994	0.673883757075249	-262.322392563248	-3.070713254080597	0.3521583244401585	-52.75021046572998	104.79551612923026	10
11	0.037651777267456055	0.00036925454884401896	0.0068604469299316405	5.192850793382738e-05	3	10.0	{'poly__degree': 3, 'ridge__alpha': 10}	0.6705097321118163	0.6831188832171505	-14.335144143838047	-0.2471894629014415	0.48894584733730617	-2.5479518288146434	5.903430704200351	8

	fit_time	score_time	estimator	test_score
	fit_time	score_time	estimator	test_score
0	2.8412957191467285	0.0019352436065673828	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974c57560>})	0.5808366993416154
1	4.413870811462402	0.002675771713256836	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974b846e0>})	0.6140589423855506
2	2.073267936706543	0.0019118785858154297	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974c56270>})	0.634340049030369
3	2.9859070777893066	0.001911163330078125	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974e5df40>})	0.5945399043658037
4	1.435554027557373	0.0019328594207763672	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974bc7740>})	0.6156027563624343
5	2.9168639183044434	0.0019228458404541016	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974c27cb0>})	0.6026668892726261
6	3.348602294921875	0.001911163330078125	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974e7e570>})	0.5907064669091155
7	2.0839526653289795	0.0018928050994873047	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974b33110>})	0.6399171077892541
8	3.173591136932373	0.0020394325256347656	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974b30ad0>})	0.577813936768609
9	3.1985442638397217	0.0019273757934570312	RandomizedSearchCV(estimator=Pipeline(steps=[('poly', PolynomialFeatures()), ('scaler', StandardScaler()), ('ridge', Ridge())]), param_distributions={'poly__degree': array([1, 2, 3, 4]), 'ridge__alpha': <scipy.stats._distn_infrastructure.rv_continuous_frozen object at 0x7fa974b301a0>})	0.5941195297024942

	fit_time	score_time	test_score
	fit_time	score_time	test_score
mean	0.020122718811035157	0.0030533552169799806	-0.11310368535911826
std	0.0004382769163006327	8.044820943890715e-05	2.314789459200725

Hyperparameters tuning

Contents

Hyperparameters tuning#

Introductory example#

MedInc

HouseAge

AveRooms

AveBedrms

Population

AveOccup

Latitude

Longitude

MedInc

HouseAge

AveRooms

AveBedrms

Population

AveOccup

Latitude

Longitude

Please enable javascript

fit_time

score_time

test_score

fit_time

score_time

test_score

Please enable javascript

fit_time

score_time

test_score

fit_time

score_time

test_score

Please enable javascript

Manual hyperparameters search#

Hyperparameters search using a grid#

mean_fit_time

std_fit_time

mean_score_time

std_score_time

param_poly__degree

param_ridge__alpha

params

split0_test_score

split1_test_score

split2_test_score

split3_test_score

split4_test_score

mean_test_score

std_test_score

rank_test_score

mean_fit_time

std_fit_time

mean_score_time

std_score_time

param_poly__degree

param_ridge__alpha

params

split0_test_score

split1_test_score

split2_test_score

split3_test_score

split4_test_score

mean_test_score

std_test_score

rank_test_score

Please enable javascript

Randomized hyperparameters search#

fit_time

score_time

estimator

test_score

fit_time

score_time

estimator

test_score

Please enable javascript

Model with internal hyperparameter tuning#

Inspection of hyperparameters in cross-validation#