Support Vector Machines (SVM) Regressor parameters

Overview:

Support Vector Machines (SVM) for regression, known as Support Vector Regression (SVR), is a robust and flexible machine learning algorithm used for various regression tasks. OtasML provides extensive configuration options for fine-tuning SVR models to achieve optimal performance.

Configurations page:

The Configurations page allows users to adjust various parameters of the SVR model. Here are the detailed configuration options:

Kernel

Default Value: rbf
Description: It is used to specify the kernel function that is used to map the input features into a higher-dimensional space. The choice of kernel function has a significant impact on the performance of the support vector machine for regression tasks.
- linear: This option specifies a linear kernel, which is suitable for linear regression-like problems. It applies a linear transformation to map the input features into a higher-dimensional space.
- poly: This option specifies a polynomial kernel, which can capture polynomial relationships between the input features and the target variable. You can specify the degree of the polynomial using the degree parameter.
- rbf (Radial Basis Function): The RBF kernel is also known as the Gaussian kernel. It is a common choice and is capable of capturing non-linear relationships in the data. It maps the input features into an infinite-dimensional space.
- sigmoid: The sigmoid kernel maps the input features using a sigmoid function. It is suitable for problems where the relationship between features and the target variable is similar to a logistic function.

Degree

Default Value: 3
Description: It is applicable when you choose the polynomial kernel kernel=poly. The degree parameter specifies the degree of the polynomial used in the kernel function.

Gamma

Default Value: scale
Description: This parameter is a crucial hyperparameter that affects the shape and flexibility of the kernel function, particularly for the Radial Basis Function (RBF) kernel. The gamma parameter controls the smoothness or the wiggliness of the decision boundary or function learned by the SVR model.
- scale: If you set gamma=scale, the gamma value is automatically calculated based on the scale of the input features. It is computed as 1 / (n_features * X.var()), where n_features is the number of features in your data, and X.var() is the variance of the data along each feature.
- auto: If you set gamma=auto, it is equivalent to using the scalecode> option. The gamma value is computed in the same way based on the feature scale.
Warning: Kernel coefficient for rbf, poly and sigmoid.

Coef0

Default Value: 0.0
Description: The coef0 parameter, also known as the bias term, is used to control the influence of the constant (bias) in the kernel function. The coef0 parameter determines how much the model relies on the constant term when making predictions.
Warning: It is only significant in poly and sigmoid.

Tol

Default Value: 1e-3
Description: The tol parameter, short for tolerance, specifies the tolerance for stopping criteria during the training process. It controls when the training process should stop if certain conditions are met.

Default Value: 1.0
Description: It is a hyperparameter that controls the regularization strength of the SVR model. It determines the trade-off between maximizing the margin (distance between the support vectors and the decision boundary) and minimizing the training error.
Warning: Must be strictly positive. The penalty is a squared l2 penalty.

Epsilon

Default Value: 0.1
Description: The epsilon parameter, often denoted as ε (epsilon), is a hyperparameter that controls the margin of tolerance in the SVR model. It specifies the size of the epsilon-insensitive tube around the regression line or hyperplane. This parameter is used to control the trade-off between fitting the training data closely and allowing some margin for errors.
Warning: Must be non-negative.

Shrinking

Default Value: True
Description: It refers to a technique used to accelerate the training process and reduce the computational complexity of solving the SVM optimization problem. This technique is known as shrinking because it reduces the number of support vectors considered during the optimization process.

Cache Size

Default Value: 200
Description: It specifies the size of the kernel cache used during training. The kernel cache is a memory buffer that stores intermediate results of kernel computations to speed up the training process.
Warning: In MB

Verbose

Default Value: False
Description: It is a parameter that controls the level of verbosity or the amount of information that the SVR model outputs during the training process. It can be helpful for monitoring the progress of training or for debugging purposes.

Max Iter

Default Value: -1
Description: It is a parameter that determines the maximum number of iterations for the optimization solver to converge during the training process. The optimization solver is responsible for finding the optimal hyperplane that minimizes the loss function while respecting the margin constraints.
Warning: -1 for no limit.

Test Size

Default Value: 0.2
Description: The test size parameter is used when splitting the dataset into these subsets, and it specifies the portion of the data that will be used for testing.

Train Size

Default Value: 0.8
Description: The train size parameter is used when splitting the dataset into these subsets, and it specifies the portion of the data that will be used for model training.

Conclusion

OtasML’s configuration options for SVR models enable users to tailor their models to specific datasets and prediction tasks. Adjusting parameters such as kernel, degree, and gamma can significantly impact the model’s performance and flexibility. Use this guide to fine-tune your SVR models and achieve superior predictive accuracy with OtasML.

OtasAI

Did You Know?

Support Vector Machines (SVM) Regressor parameters

Overview:

Configurations page:

Conclusion

Related articles

Suggested articles