questions/apache-spark/page/3
looking for a platform where we can ...READ MORE
You can use filter to do this. ...READ MORE
Hi@akhtar, The PageRank algorithm outputs a probability distribution ...READ MORE
Hi@akhtar, As we know text files are in ...READ MORE
I am pretty sure createOrReplaceTempView just replaced ...READ MORE
Spark revolves around the concept of a ...READ MORE
Hi@akhtar, To create multiple producer you have to ...READ MORE
How can one parse an S3 XML ...READ MORE
Hi@akhtar Generally, Spark streaming is used for real time ...READ MORE
Hi, Scala provides a helper class, called App, that ...READ MORE
Since the file is in HDFS so ...READ MORE
The following code that I wrote for ...READ MORE
Hi, Actions are RDD’s operation, that value returns ...READ MORE
Hi, I have the input RDD as a ...READ MORE
Hi, Paired RDD is a distributed collection of ...READ MORE
Hi@akhtar, Yes, Spark streaming uses checkpoint. Checkpoint is ...READ MORE
Hi@akhtar, Dstreams are the basic abstraction that is ...READ MORE
Please go through the below explanation : Full ...READ MORE
Hi@akhtar, There is no concept of indexing in ...READ MORE
Hey @Esha, you can use this code. ...READ MORE
Hi everyone, I tried to count individual words ...READ MORE
Hi, I am getting this error when try ...READ MORE
You can load a DAT file into ...READ MORE
Hi@akhtar, Here you are trying to save csv ...READ MORE
Try to put the kafka client for ...READ MORE
We do not have access to hive ...READ MORE
Function Definition : def test():Unit{ var a=10 var b=20 var c=a+b } calling ...READ MORE
Hi @asif, Share with us please the application ...READ MORE
Suppose you have two dataset results( id, ...READ MORE
Hey, In Apache Spark, the data storage model is ...READ MORE
Seems like you have not started the ...READ MORE
There seems to be a problem with ...READ MORE
Instead of spliting on '\n'. You should ...READ MORE
Hey, You can follow this below solution for ...READ MORE
df.registerTempTable(“airports”) This command is used to register ...READ MORE
You can use the following to print ...READ MORE
You can do this by turning off ...READ MORE
Try df.where($"cola".isNotNull && $"cola" =!= "" && !$"colb".isin(2,3)) your ...READ MORE
Give read-write permissions to C:\tmp\hive folder Cd to winutils bin folder ...READ MORE
Yes we can add columns to the ...READ MORE
Hello, From the error I get that the ...READ MORE
import sqlContext.implicits._ import org.apache.spark.sql.Row import org.apache.spark.sql.types.{StructType, StructField, LongType} val df ...READ MORE
Hi, foreach() operation is an action. It does not ...READ MORE
1) First we loaded the data to ...READ MORE
Hey, sortByKey() is a transformation. It returns an RDD sorted ...READ MORE
1. We will check whether master and ...READ MORE
Please check the below mentioned links for ...READ MORE
As far as I understand your intentions ...READ MORE
In spark 2.0.+ it should look something ...READ MORE
Try this, it should work: > from pyspark.sql.functions ...READ MORE
OR
At least 1 upper-case and 1 lower-case letter
Minimum 8 characters and Maximum 50 characters
Already have an account? Sign in.