Spark Can we add column to dataframe

Question

Can we add column to dataframe? If yes, please share the code.

CBD products put up a nearby and enjoyable feeling to sustain the effects of this compound. These gummies put in an appearance in various flavors, potencies, and formulations, providing users with controlled dosing and long-lasting effects. Divers consumers cherish them for moderation, stress relief. Degree, it’s vital to annihilate them responsibly, as effects may pilfer longer to recoil in compared to smoking or vaping. Everlastingly make sure of dosage guidelines and certify compliance with adjoining laws sooner than purchasing or consuming. — Apr 18

score +1 · Answer 1 · Aug 9, 2019

Yes we can add a column using withColumn with the function as shown below for your reference.

val sqlContext = new SQLContext(sc)

import sqlContext.implicits._ // for `toDF` and $""

import org.apache.spark.sql.functions._ // for `when`


val df = sc.parallelize(Seq((4, "blah", 2), (2, "", 3), (56, "foo", 3), (100, null, 5)))

    .toDF("A", "B", "C")

val newDf = df.withColumn("D", when($"B".isNull or $"B" === "", 0).otherwise(1))

newDf.show() shows

+---+----+---+---+

| A| B| C| D|

+---+----+---+---+

| 4|blah| 2| 1|

| 2| | 3| 0|

| 56| foo| 3| 1|

|100|null| 5| 0|

+---+----+---+---+

answered Aug 9, 2019 by Shirish

Siva · Answer 2 · Oct 24, 2019

Yes we can add columns to the existing data frame in Spark

import pandas as pd

data = {'Name': ['Indis', 'Sachin', 'Rohit', 'Dhoni'],

'Height': [5.1, 6.2, 5.1, 5.2],

'Qualification': ['Team', 'Opener', 'Hitman', 'Keeper']}

df = pd.DataFrame(data)

address = ['India', 'Mumbai', 'Chennai', 'Patna']

df['Address'] = address

df