R: Calculate and interpret odds ratio in logistic regression

Question

I am having trouble interpreting the results of a logistic regression. My outcome variable is&#160;Decision&#160;and is binary (0 or 1, not take or take a product, respectively).My predictor variable is&#160;Thoughts&#160;and is continuous, can be positive or negative, and is rounded up to the 2nd decimal point.I want to know how the probability of taking the product changes as&#160;Thoughts&#160;changes.The logistic regression equation is:glm(Decision ~ Thoughts, family = binomial, data = data)
According to this model,&#160;Thoughts has a significant impact on probability of&#160;Decision&#160;(b = .72, p = .02). To determine the odds ratio of&#160;Decision&#160;as a function of&#160;Thoughts:exp(coef(results))
Odds ratio = 2.07.Questions:How do I interpret the odds ratio?Does an odds ratio of 2.07 imply that a .01 increase (or decrease) in&#160;Thoughts&#160;affect the odds of taking (or not taking) the product by 0.07&#160;ORDoes it imply that as&#160;Thoughts&#160;increases (decreases) by .01, the odds of taking (not taking) the product increase (decrease) by approximately 2 units?How do I convert odds ratio of&#160;Thoughts&#160;to an estimated probability of&#160;Decision?Or can I only estimate the probability of&#160;Decision&#160;at a certain&#160;Thoughts&#160;score (i.e. calculate the estimated probability of taking the product when&#160;Thoughts == 1)?

Nandini · Answer

A logit, or the log of the odds, is the coefficient provided by a logistic regression in r. You can use exponentiation to convert logits to odds ratios, as seen above. The function exp(logit)/(1+exp(logit)) can be used to convert logits to probabilities. There are a few things to keep in mind concerning this process.To begin, I'll utilise some data that can be replicated.library('MASS')
data("menarche")
m<-glm(cbind(Menarche, Total-Menarche) ~ Age, family=binomial, data=menarche)
summary(m)
The Output is:Call:
glm(formula = cbind(Menarche, Total - Menarche) ~ Age, family = binomial, 
    data = menarche)

Deviance Residuals: 
    Min       1Q   Median       3Q      Max  
-2.0363  -0.9953  -0.4900   0.7780   1.3675

Coefficients:
             Estimate Std. Error z value Pr(>|z|)    
(Intercept) -21.22639    0.77068  -27.54   <2e-16 ***
Age           1.63197    0.05895   27.68   <2e-16 ***
---
Signif. codes:  0 &#8216;***&#8217; 0.001 &#8216;**&#8217; 0.01 &#8216;*&#8217; 0.05 &#8216;.&#8217; 0.1 &#8216; &#8217; 1

(Dispersion parameter for binomial family taken to be 1)

Null deviance: 3693.884  on 24  degrees of freedom
Residual deviance:   26.703  on 23  degrees of freedom
AIC: 114.76

Number of Fisher Scoring iterations: 4As in your case, the coefficients displayed are for logits. We can see the sigmoidal function that is characteristic of a logistic model fit to binomial data if we plot this data with this model.#predict gives the predicted value in terms of logits
plot.dat <- data.frame(prob = menarche$Menarche/menarche$Total,
                       age = menarche$Age,
                       fit = predict(m, menarche))
#convert those logit values to probabilities
plot.dat$fit_prob <- exp(plot.dat$fit)/(1+exp(plot.dat$fit))

library(ggplot2)
ggplot(plot.dat, aes(x=age, y=prob)) + 
  geom_point() +
  geom_line(aes(x=age, y=fit_prob))
It's worth noting that the rate of change in probability isn't constant; the curve rises slowly at initially, then accelerates in the middle, before levelling off towards the conclusion. The probability difference between 10 and 12 is much smaller than the probability difference between 12 and 14. This indicates that summarizing the link between age and probability with a single number is difficult without altering probabilities.To respond to your specific inquiries:What does it mean to interpret odds ratios?The probabilities of a "success" (with your data, this is the odds of taking the product) when x = 0 is the odds ratio for the value of the intercept (i.e. zero thoughts). The rise in odds above this value of the intercept when you add one entire x value (i.e. x=1; one thought) is the odds ratio for your coefficient. Using the data from menarche:exp(coef(m))

(Intercept)          Age 
6.046358e-10 5.113931e+00 We can deduce that the chances of menarche occurring at age 0 are.00000000006. Or, to put it another way, nearly impossible. The projected increase in the probabilities of menarche for each unit of age is calculated by exponentiating the age coefficient. It's little over a quintupling in this situation. A one-to-one odds ratio shows no change, while a two-to-one odds ratio indicates a doubling, and so on.Your odds ratio of 2.07 means that increasing 'Thoughts' by one unit raises the chances of taking the product by a factor of 2.07.How do you translate thinking odds ratios to a decision probability estimate?Because the change is not consistent over the range of x values, as shown in the plot above, you must do this for selected values of thinking. Get the following response if you want to know the probability of some value for thoughts:exp(intercept + coef*THOUGHT_Value)/(1+(exp(intercept+coef*THOUGHT_Value))hope this helps.Read the Artificial Intelligence tutorial to learn more about Artificial Intelligence and Machine Learning. Also, enrol in&#160;Machine Learning Course&#160;to become proficient.

R Calculate and interpret odds ratio in logistic regression

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Machine Learning

Can someone explain to me the difference between a cost function and the gradient descent equation in logistic regression?

different results for Random Forest Regression in R and Python

Plotting logistic regression in R with the Smarket dataset

How to add random and/or fixed effects into cloglog regression in R

Empirical probability in R with x1+x2>2*x3

Calculate Z-Score from Probability Value - R programming

Calculate the probability in R for sum of two dice rolls

Union probability

difference between a cost function and the gradient descent equation in logistic regression?

Plot logistic regression curve in R

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES