Skip to content

Hyperparameter

Medium — good to knowAI & ML

ELI5 — The Vibe Check

Hyperparameters are the settings you configure BEFORE training starts — as opposed to parameters (weights) which the model learns ON ITS OWN. Learning rate, batch size, number of layers: these are hyperparameters. Getting them right is an art form, and getting them wrong is why your model trains for a week and produces garbage.

Real Talk

Hyperparameters are configuration values set before training that govern the learning process itself, rather than being learned from data. Examples include learning rate, batch size, number of epochs, layer count, hidden dimension, dropout rate, and weight decay. Hyperparameter tuning (e.g., via grid search or Bayesian optimization) is a critical step in ML pipelines.

When You'll Hear This

"Tune the hyperparameters on the validation set." / "Learning rate is the most important hyperparameter."

Made with passive-aggressive love by manoga.digital. Powered by Claude.