How to work with Matrix Multiplication in Apache Spark?

Question

I am trying to perform matrix multiplication using Apache Spark and Java, where I need to create RDD that can represent matrix in Apache Spark. Can anyone help me with this?

Gitika · Answer

Hey,You can follow this below solution for your above query:IndexRowMatrix: Can be created directly from a&#160;RDD[IndexedRow]&#160;where IndexedRow consist of row index and&#160;org.apache.spark.mllib.linalg.Vector&#8203;import org.apache.spark.mllib.linalg.{Vectors, Matrices}
import org.apache.spark.mllib.linalg.distributed.{IndexedRowMatrix,
  IndexedRow}

val rows =  sc.parallelize(Seq(
  (0L, Array(1.0, 0.0, 0.0)),
  (0L, Array(0.0, 1.0, 0.0)),
  (0L, Array(0.0, 0.0, 1.0)))
).map{case (i, xs) => IndexedRow(i, Vectors.dense(xs))}

val indexedRowMatrix = new IndexedRowMatrix(rows)RowMatrix:&#160;Similar to&#160;IndexedRowMatrix&#160;but without meaningful row indices. Can be created directly from&#160;RDD[org.apache.spark.mllib.linalg.Vector]import org.apache.spark.mllib.linalg.distributed.RowMatrix

val rowMatrix = new RowMatrix(rows.map(_.vector))     BlockMatrix: Can be created from&#160;RDD[((Int, Int), Matrix)]&#160;where the&#160;first element of the tuple contains coordinates of the block and the second one is a local&#160;org.apache.spark.mllib.linalg.Matrixval eye = Matrices.sparse(
  3, 3, Array(0, 1, 2, 3), Array(0, 1, 2), Array(1, 1, 1))

val blocks = sc.parallelize(Seq(
   ((0, 0), eye), ((1, 1), eye), ((2, 2), eye)))

val blockMatrix = new BlockMatrix(blocks, 3, 3, 9, 9)Hope it helps.

How to work with Matrix Multiplication in Apache Spark

Your comment on this question:

1 answer to this question.

Your answer

Your comment on this answer:

Related Questions In Apache Spark

How to create new column with function in Spark Dataframe?

How to print the contents of RDD in Apache Spark?

How to work with multidimensional arrays in Scala?

How to change the spark Session configuration in Pyspark?

How do I get number of columns in each line from a delimited file??

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

How to check if a particular keyword exists in Apache Spark?

How to save RDD in Apache Spark?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES