The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Problem with generalized linear model (lambda seach)
scottchung64
Member Posts: 1 Learner I
Hi all,
I'm trying to do classification using generalized linear model.
In default setting, the lambda value is chosen by H2O (described in documentation).
However, I found that if I use lambda search, the performance is much better.
I don't understand what is the difference between this two method.
Is the better performance from doing lambda search comes from overfitting?
Thanks!
Best,
Scott
Tagged:
0
Comments
Hi @scottchung64,
You are correct. The lambda search is used for controlling the regularization to avoid overfitting. When performing regularization, penalties are introduced to the model buidling process to avoid overfitting. GLM needs to find the optimal values of the regularization parameters alpha and lambda. The lambda parameter controls the amount of regularization applied to the model.
When you activate the labmda search in GLM operator, it will take longer time to find the best value of parameters.
YY
is it possible to initiate an Alpha search?
i see this: "Providing multiple alpha values via the advanced parameters triggers a search."
but how do i actualy provide multiple values...what is the format?
Hi @staskhalitov,
Good point. You will need to edit the "expert parameters" list
Hope it helps.
YY
so I tried your xml, but it seems like the model just uses what ever value of Alpha you have in the initial settings, .6 in your example.
It doesnt look like it considered the additional Alphas, .2 & .1, in the expert parameters.
How do i actualy initiate a search for an Alpha per this description?
alpha
Description: The alpha parameter controls the distribution between the L1 (Lasso) and L2 (Ridge regression) penalties. A value of 1.0 for alpha represents Lasso, and an alpha value of 0.0 produces Ridge regression. Providing multiple alpha values via the advanced parameters triggers a search. Default is 0.0 for the L-BFGS solver, else 0.5.
Range: real; 0.0-1.0
Optional: true
If i leave the initial Alpha .6 blank, and have additional Alphas in expert parameters i get an error.
Hi @staskhalitov,
Thanks for the followup! Great catch. I double checked the model descriptions and unfortunately the additional alpha values are not used for alpha search. We are investigating the bug. @phellinger
At the same time, you can manually do a grid search by loop. Here is an example:
Best,
YY