ValueError: Found input variables with inconsistent numbers of samples: [2, 515738].

Question

Hi guys, first I wanted to say that this is my first time trying this. Secondly. I'm not sure I'm placing this question at the right forum. If it's not, please excuse me.&#160;I'm trying to use Naive Bayes on my data. This is my code till now:data = pd.read_json('/Users/rokayadarai/Desktop/Coding/DataSets/Hotel_Reviews.json')
data.head()
#stopword are not useful words (like: a, and, the)
stopset = set(stopwords.words('english'))
vectorizer = TfidfVectorizer(use_idf=True, lowercase=True, strip_accents='ascii', stop_words=stopset)
y = data["Reviewer_Score"]
x = vectorizer.fit_transform(['Ngative_Review', 'Positive_Review'])
#515738 observations and 2(?) unique words
print (y.shape)
print (x.shape)
#split the data - 0.2 means 20% of the data. 123 means use same dataset with every test
x_train, x_test, y_train, y_test = train_test_split(x,y,test_size=0.2,random_state=123)
When I try to run this, I get the error:&#160;ValueError: Found input variables with inconsistent numbers of samples: [2, 515738]. Could please somebody helps me out? I'm stuck and can't seem to find&#160;anything on the&#160;Internet to help me.&#160;

MD · Answer

Hi,You are asking your query in the right place. You might get the above error because of the shape of x and y.&#160;So check the shape of&#160;x&#160;and if it is&#160;1D, then convert it from 1D to 2D.

ValueError Found input variables with inconsistent numbers of samples 2 515738

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Machine Learning

ValueError: Found input variables with inconsistent numbers of samples: [11, 3988]

ValueError: Found input variables with inconsistent numbers of samples: [616, 308]

Found input variables with inconsistent numbers of samples:

problem with Found input variables with inconsistent numbers of samples: [1204, 134]

how can i randomly select items from a list?

how can i count the items in a list?

how do i use the enumerate function inside a list?

Purpose of fit method in sklearn module in python?

ValueError: Found input variables with inconsistent numbers of samples: [1, 1000]

valueerror: found input variables with inconsistent numbers of samples: [40, 10]

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES