100 terms
Machine Learning
Fundamentals

A field of study focused on algorithms that learn patterns from data to make predictions or decisions.

Supervised Learning
Fundamentals

Learning a mapping from inputs to outputs using labeled examples (e.g., regression, classification).

Unsupervised Learning
Fundamentals

Finding structure in unlabeled data, such as grouping similar points or discovering latent factors.

Reinforcement Learning
Fundamentals

Learning to act via trial and error to maximize cumulative reward in an environment.

Regression
Fundamentals

A supervised learning approach for predicting continuous numeric targets.

Classification
Fundamentals

A supervised learning task that assigns inputs to discrete categories.

Clustering
Fundamentals

An unsupervised learning task that groups similar instances without using labels.

Dimensionality Reduction
Fundamentals

A term in fundamentals used in machine learning practice.

Feature Engineering
Data & Features

A term in data & features used in machine learning practice.

Feature Selection
Data & Features

A term in data & features used in machine learning practice.

Model Selection
Fundamentals

A term in fundamentals used in machine learning practice.

Hyperparameter Tuning
Optimization & Training

Methods to tune hyperparameters to improve model performance.

Cross-validation
Optimization & Training

A resampling procedure for estimating generalization performance by training/validating on different splits.

K-Fold Cross-Validation
Optimization & Training

A procedure to assess generalization by rotating train/validation splits across folds.

Stratified Sampling
Data & Features

A term in data & features used in machine learning practice.

Train-Test Split
Data & Features

A term in data & features used in machine learning practice.

Overfitting
Fundamentals

A term in fundamentals used in machine learning practice.

Underfitting
Fundamentals

A term in fundamentals used in machine learning practice.

Bias-Variance Tradeoff
Fundamentals

A term in fundamentals used in machine learning practice.

Regularization
Optimization & Training

Techniques that constrain model complexity to reduce overfitting (e.g., L1, L2, dropout, weight decay).

L1 Regularization
Optimization & Training

A technique to control model complexity and reduce overfitting by penalizing large parameters.

L2 Regularization
Optimization & Training

A technique to control model complexity and reduce overfitting by penalizing large parameters.

Elastic Net
Optimization & Training

A term in optimization & training used in machine learning practice.

Logistic Regression
Classical Models

A linear classifier that models the log-odds of a binary label with a sigmoid link function.

Linear Regression
Classical Models

A supervised learning approach for predicting continuous numeric targets.

Decision Tree
Classical Models

A term in classical models used in machine learning practice.

Random Forest
Classical Models

An ensemble of decision trees that reduces variance by averaging many randomized trees.

Gradient Boosting
Classical Models

An additive model that fits new learners to the residuals of current predictions to reduce errors.

Xgboost
Classical Models

A term in classical models used in machine learning practice.

Lightgbm
Classical Models

A term in classical models used in machine learning practice.

Support Vector Machine
Classical Models

A margin-based classifier that finds a separating hyperplane; can use kernels for nonlinearity.

Naive Bayes
Classical Models

A term in classical models used in machine learning practice.

K-Nearest Neighbors
Classical Models

A term in classical models used in machine learning practice.

K-means
Clustering & Dimensionality Reduction

A clustering algorithm that partitions data into k clusters by minimizing within-cluster variance.

Hierarchical Clustering
Clustering & Dimensionality Reduction

An unsupervised learning task that groups similar instances without using labels.

Dbscan
Clustering & Dimensionality Reduction

A term in clustering & dimensionality reduction used in machine learning practice.

Principal Component Analysis
Clustering & Dimensionality Reduction

A technique for dimensionality reduction that finds orthogonal directions of maximum variance.

Linear Discriminant Analysis
Clustering & Dimensionality Reduction

A term in clustering & dimensionality reduction used in machine learning practice.

quadratic discriminant analysis
Clustering & Dimensionality Reduction

A term in clustering & dimensionality reduction used in machine learning practice.

scaling
Data & Features

Feature scaling transforms that normalize ranges or distributions to aid model training.

standardization
Data & Features

Feature scaling transforms that normalize ranges or distributions to aid model training.

normalization
Data & Features

Feature scaling transforms that normalize ranges or distributions to aid model training.

z-score
Data & Features

Feature scaling transforms that normalize ranges or distributions to aid model training.

min-max scaling
Data & Features

Feature scaling transforms that normalize ranges or distributions to aid model training.

pipeline
Data & Features

A term in data & features used in machine learning practice.

grid search
Optimization & Training

Methods to tune hyperparameters to improve model performance.

random search
Optimization & Training

Methods to tune hyperparameters to improve model performance.

bayesian optimization
Optimization & Training

Methods to tune hyperparameters to improve model performance.

evaluation metrics
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

accuracy
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

precision
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

recall
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

f1 score
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

roc auc
Metrics & Evaluation

A threshold-agnostic metric summarizing the tradeoff between true positive rate and false positive rate.

precision-recall auc
Metrics & Evaluation

A metric suited for imbalanced data, summarizing precision-recall tradeoffs across thresholds.

confusion matrix
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

mean squared error
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

mean absolute error
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

root mean squared error
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

r-squared
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

log loss
Metrics & Evaluation

An objective function that quantifies prediction error during training.

calibration
Metrics & Evaluation

A way to quantify model performance for validation, selection, and comparison.

data preprocessing
Data & Features

A term in data & features used in machine learning practice.

data augmentation
Data & Features

A term in data & features used in machine learning practice.

imbalanced data
Data & Features

A term in data & features used in machine learning practice.

resampling
Data & Features

A term in data & features used in machine learning practice.

anomaly detection
Data & Features

A term in data & features used in machine learning practice.

outlier detection
Data & Features

A term in data & features used in machine learning practice.

missing data
Data & Features

A term in data & features used in machine learning practice.

one-hot encoding
Data & Features

A term in data & features used in machine learning practice.

target encoding
Data & Features

A term in data & features used in machine learning practice.

embeddings
Deep Learning

A term in deep learning used in machine learning practice.

neural network
Deep Learning

A function approximator composed of layers of linear operations and nonlinear activations.

deep learning
Deep Learning

A term in deep learning used in machine learning practice.

convolutional neural network
Deep Learning

A deep network that uses convolutions to capture local spatial patterns in images.

recurrent neural network
Deep Learning

A recurrent neural network architecture designed to model sequences and long-range dependencies.

lstm
Deep Learning

A recurrent neural network architecture designed to model sequences and long-range dependencies.

gru
Deep Learning

A recurrent neural network architecture designed to model sequences and long-range dependencies.

transformer
Deep Learning

A neural architecture based on attention mechanisms that models relationships across sequence positions.

attention
Deep Learning

A mechanism that lets models focus on the most relevant parts of the input when computing representations.

self-attention
Deep Learning

An attention-based deep learning architecture effective for sequence modeling and long contexts.

encoder-decoder
Deep Learning

An attention-based deep learning architecture effective for sequence modeling and long contexts.

positional encoding
Deep Learning

A term in deep learning used in machine learning practice.

activation function
Deep Learning

A term in deep learning used in machine learning practice.

relu
Deep Learning

A term in deep learning used in machine learning practice.

sigmoid
Deep Learning

A term in deep learning used in machine learning practice.

tanh
Deep Learning

A term in deep learning used in machine learning practice.

softmax
Deep Learning

A term in deep learning used in machine learning practice.

loss function
Deep Learning

An objective function that quantifies prediction error during training.

cross-entropy
Deep Learning

A term in deep learning used in machine learning practice.

optimization
Optimization & Training

A term in optimization & training used in machine learning practice.

gradient descent
Optimization & Training

An optimization method that updates parameters in the direction that reduces the loss.

stochastic gradient descent
Optimization & Training

An optimization method that updates parameters in the direction that reduces the loss.

momentum
Optimization & Training

A term in optimization & training used in machine learning practice.

adam
Optimization & Training

A term in optimization & training used in machine learning practice.

rmsprop
Optimization & Training

A term in optimization & training used in machine learning practice.

learning rate
Optimization & Training

A term in optimization & training used in machine learning practice.

weight decay
Optimization & Training

A technique to control model complexity and reduce overfitting by penalizing large parameters.

initialization
Optimization & Training

A term in optimization & training used in machine learning practice.

backpropagation
Deep Learning

An algorithm to compute gradients of parameters efficiently via the chain rule for training neural networks.