I am finding it hard to understand the process of Naive Bayes, and I was wondering if someone could explain it with a simple step by step process in English. I understand it takes comparisons by times occurred as a probability, but I have no idea how the training data is related to the actual dataset.
Feb 22, 2022 96 views

## 1 answer to this question.

Naive Bayes Classification uses probability to classify data points.
Naive Bayes can be understood better if the concept of conditional probability and Bayes rule are clear. Naive Bayes makes a strong assumption that features are independent of each other.
Conditional Probability is based on the probability that something will happen, given that some event already happened i.e; based on past event occurrences.

• Event A is that it is raining outside, and it has a 0.4 (40%) chance of raining today.

• Event B is that you will play outside, and that has a probability of 0.6 (60%).

The conditional  probability is that it is both rain and you will play.
P(Play | Rain) = P(Rain) | P(Play) * P(Play) / P(Rain)

Thus, on the basis of some known past events, probability is calculated.
Naive bayes is suitable for multiclass prediction and it performs well with both categorical and numeric data.
But the strong assumption that Naive Bayes makes serves as a negative point as in real-life we hardly find features completely independent of each other.
Naive Bayes finds it application in Spam filtering, building recommender system, sentiment analysis and others.

• 5,480 points

## Is predicting number of sales a Regression or Classification problem?

The output will be discrete but the ...READ MORE

+1 vote

## Classification in Naive Bayes algorithm

Hi@Ogun, The Numpy module doesn't have a predict attribute. ...READ MORE

## How can I train a model and calculate the accuracy of CBR algorithm?

Hi@Abubakar, You can find lots of documents on ...READ MORE

## Linear Discriminant Analysis vs Naive Bayes

There are no standards fixed as to ...READ MORE

## Handling Imbalanced dataset

This usually occurs when a vast set ...READ MORE

## Why there is the need of using regularization in machine learning problems?

In Machine Learning we often divide the dataset ...READ MORE