Logistic Regression with continuous data using sklearn in python

Question

How can I describe these points with a regression? In the example the LinearRegression doesn't fit the logistic distribution of the points. The LogisticRegression() from sklearn just accept binary data. My y-values are continuous from 0 to 1. Do I have to transform the data or how do I get a appropriate model?

import matplotlib.pyplot as plt
import numpy as np
import pandas as pd
from sklearn.linear_model import LinearRegression
from sklearn.linear_model import LogisticRegression

a = np.array([1,2,3,4,5,6,7,8,9,10,11,12,13,14])
b = [0,0,0.01,0.08,0.16,00.28,0.5,0.66,0.8,0.9,0.95,0.99,1,1]
data = pd.DataFrame({'x': a, 'y':b})

LM = LinearRegression()
LM.fit(data[["x"]],data[["y"]])

plt.scatter(a,b)
plt.plot([1,14], LM.predict([[1],[14]]), color = "red")
plt.show() 

LogM = LogisticRegression()
LogM.fit(data[["x"]],data[["y"]])  # doesn't work

scatter plot with linear model

Dev · Answer 1 · Mar 23, 2022

Despite the fact that it produces a real between 0 and 1, a logistic regression is commonly used to classify labels. This is why sklearn requires binary data in y: so that the model can be trained
You have a sigmoid function s(x)=1/(1+exp(alpha*x + beta)) and you want to find alpha and beta in your scenario. The simplest method to achieve this, in my opinion, is to first change your data:

ne_a = a[2:-2]
ne_b = np.array(b[2:-2]) # Getting rid of 0 and 1 values
ne_b = np.log((1 / ne_b) - 1)

Because ne_b is now an array with values in the form alpha*ne_a + beta, you can use it to train a LinearRegression model to find alpha:

model = LinearRegression()
model.fit(ne_a.reshape(-1, 1), ne_b.reshape(-1, 1))
alpha = model.coef_[0, 0]
beta = l.predict([[0]])[0, 0]

Finally, you may see if this matches to your expectations:

predicted = 1 / (1 + np.exp(alpha * a + beta))
plt.figure()
plt.plot(a, b)
plt.plot(a, predicted)
plt.show()

enter image description here

Elevate Your Expertise with Our Machine Learning Certification Program!

answered Mar 23, 2022 by Dev
• 6,000 points

Logistic Regression with continuous data using sklearn in python

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Machine Learning

Linear regression with gradient descent to predict house prices using area (one var) in python

Example of Logistic regression with python code

Lear regression using python error - ValueError: operands could not be broadcast together with shapes (1,3) (1,2)

Can we change the sigmoid with tanh in Logistic regression transforms??

How to add outliers to a Linear Regression dataset?

Efficient online linear regression algorithm in python

View onto a numpy array?

Dimension in python numpy

Plotting logistic regression in R with the Smarket dataset

Can someone explain to me the difference between a cost function and the gradient descent equation in logistic regression?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES