What is the Difference Between Supervised and Unsupervised Learning?

The main difference between supervised and unsupervised learning lies in the type of data used for training. Here is a comparison of the two:

Supervised Learning:

Requires labeled input and output data for training.
The model learns the relationship between the input and output through the provided labels.
Used for classification or prediction tasks, where the goal is to approximate a mapping function that can accurately predict outputs given new inputs.
Examples include classification problems, such as distinguishing between cars and motorcycles, or predicting house prices based on features.

Unsupervised Learning:

Processes unlabeled or raw data.
The model learns from the unlabeled data without the guidance of labels, identifying underlying patterns and structures within the data.
Used for exploratory data analysis, clustering data based on similarities or differences, or identifying underlying patterns within datasets.
Examples include clustering people into groups based on similar features or properties, or finding homogeneous groups in the features of a dataset.

In summary, supervised learning relies on labeled data to learn a mapping function between input and output, while unsupervised learning processes unlabeled data to identify underlying patterns and structures. Supervised learning is typically used for classification or prediction tasks, whereas unsupervised learning is employed for exploratory data analysis.

On this page

What is the Difference Between Supervised and Unsupervised Learning?

Comparative Table: Supervised vs Unsupervised Learning

Here is a table comparing the differences between supervised and unsupervised learning:

Supervised Learning	Unsupervised Learning
Requires labeled input and output data for training	Requires unlabeled input data for training
Models are more predictable and controllable	Models are less predictable and more complex
Used for classification and regression tasks	Used for clustering, association, anomaly detection, and other tasks
Output is more aligned with end-user expectations	Output may not align with specific goals due to lack of expert guidance
Examples: logistic regression, decision trees, neural networks	Examples: k-means clustering, hierarchical clustering, dimensionality reduction algorithms

In summary, supervised learning uses labeled data to train models for classification and regression tasks, making them more predictable and controllable. On the other hand, unsupervised learning relies on unlabeled data to train models for tasks such as clustering, association, and anomaly detection, making them more complex and less predictable.

Guilherme Mazui

Guilherme Mazui is graduated in journalism from the Federal University of Minas Gerais (UFMG) and a master's degree in Communication from the University of São Paulo (USP). In addition, he has experience in advertising writing and has worked as a content editor in several companies.