ValueError Unknown label type array 7 5 9 2 9 2 5 8 10 9 6

Question

Hi guys, I'm trying to use the Naive Bayes Algorithm on my dataset.

This is my code:

#

data = pd.read_json('/Users/rokayadarai/Desktop/Coding/DataSets/Hotel_Reviews.json')

data.head()

#stopword are not usefull (a, and, the)

stopset = set(stopwords.words('english'))

vectorizer = TfidfVectorizer(use_idf=True, lowercase=True, strip_accents='ascii', stop_words=stopset)

#merge 2 columns negative_reviews&Positive reviews into 1

data ['Reviews'] = data['Negative_Review'] + data['Positive_Review']

y = data.Reviewer_Score

X = vectorizer.fit_transform(data.Reviews)

# 515738 observations and 83941 unique words

print (y.shape)

print (X.shape)

#split the data - 0.2 means 20% of the data. 123 means use same dataset with every test

X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=0.2,random_state=123)

#train naive bayes classifier

classifier = naive_bayes.MultinomialNB()

classifier.fit(X_train, y_train)

But after running it I keep getting the error:

ValueError: Unknown label type: (array([ 7.5, 9.2, 9.2, ..., 5.8, 10. , 9.6]),) for the line classifier.fit(X_train, y_train)

Could somebody please help me out?

MD · Answer 1 · Dec 16, 2020

Hi,

There is a problem with your steps. Before you go for the model, try to analyze the dataset. First, check the format and type of each column. Check the format of your X_train and y_train.

answered Dec 16, 2020 by MD
• 95,460 points

ValueError Unknown label type array 7 5 9 2 9 2 5 8 10 9 6

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Machine Learning

Logistic Regression not working because of "unknown label type 'continuous'"?

Lear regression using python error - ValueError: operands could not be broadcast together with shapes (1,3) (1,2)

valueerror: found input variables with inconsistent numbers of samples: [40, 10]

ValueError: Found input variables with inconsistent numbers of samples: [2, 515738].

how can i randomly select items from a list?

how can i count the items in a list?

how do i use the enumerate function inside a list?

TypeError: A sparse matrix was passed, but dense data is required. Use X.toarray() to convert to a dense numpy array.

ValueError: Expected 2D array, got 1D array instead: array=[2 4 7 9].

OpenCV Error: Unspecified error (The node does not represent a user object (unknown type?)) in cvRead, file /build/opencv-FWWjHr/opencv-2.4.9.1+dfsg/modules/core/src/persistence.cpp,

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES