Training
What is Labeled data?
Input data to train a model with a reliable description of what it is (truth).
Training
Highest clasifications of ML algorithms?
Supervised learning, unsupervised learning, and reinforcement learning
Training
How does Supervised Learning work?
Aalgorithms are trained on labeled data
Training
What is the goal of supervised learning?
Learn a mapping function that can predict the output for new, unseen input data
Training
How does unsupervised learning work?
Trained on unlabeled data
Training
Goal of unsupervised learning?
Discover inherent patterns, structures, or relationships within the input data
Reinforcement Learning
What is reinforcement learning?
Semi-supervised learning. Algorithm given rewards or penalties for its actions, and the machine learns from this feedback to improve its decision-making over time.
Reinforcement Learning
What data is used for Semi-Supervised learning?
Mostly labeled, some unlabeled
Reinforcement Learning
What is Reinforcement Learning usually used for?
Teach AI to play games, robotics to navigate and manipulate objects
Reinforcement Learning
Example of Reinforcement Learning in healthcare?
Optimize treatment plans
Reinforcement Learning
Example of Reinforcement Learning in finance?
Trading strategies
Reinforcement Learning
Are images and videos structured or unstructured data?
Unstructured
RLHF
What is RLHF?
Reinforcement Learning from Human Feedback
RLHF
Why bother with RLHF?
Better align with human values
RLHF
How do you do RLHF?
Separate reward model: ask humans which of two generated answers sound more human. Use reward model to tune the real model
Regression and Classification
What is the goal of a regression?
Predict an output value given input value
Regression and Classification
What happens in Regression Training?
Fit a curve so you can predict new values
Regression and Classification
What is the goal of classification?
Understand how things group together
Regression and Classification
What happens in Classification training?
Look for clumps of values that are similar
Regression and Classification
Example of regression training and inference?
Input tons of humans height and weight. Curve-fit the data. Answer how much someone weighs from their height
Regression and Classification
Example of a Classification training and inference?
Input tons of height and weight of cats, dogs, and giraffes, ask animal given height and/or weight
Regression and Classification
Classification or Regression for a continuous set of inputs?
Regression (numeric)
Regression and Classification
Classification or Regression for discrete set of inputs?
Classification
Regression and Classification
Is k-NN used with Classification or Regression?
Classification