Recent questions tagged developer

0 votes
1 answer

Sqoop Metastore ?

Jul 19, 2018 in Big Data Hadoop by shams
• 3,670 points
1,317 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How Namenode handles data node failures?

Jul 11, 2018 in Big Data Hadoop by Shubham
• 13,490 points
6,295 views
0 votes
1 answer

Kafka topic not being deleted

Jul 9, 2018 in Apache Kafka by Shubham
• 13,490 points
3,059 views
+1 vote
8 answers

How to print the contents of RDD in Apache Spark?

Jul 6, 2018 in Apache Spark by Shubham
• 13,490 points
61,793 views
0 votes
2 answers

How to use RDD filter with other function?

Jul 5, 2018 in Apache Spark by Shubham
• 13,490 points
9,673 views
0 votes
1 answer

How to add third party java jars for use in PySpark?

Jul 4, 2018 in Apache Spark by Shubham
• 13,490 points
8,692 views
0 votes
1 answer
0 votes
1 answer
+1 vote
1 answer

map vs mapValues in Spark

Jun 29, 2018 in Apache Spark by Shubham
• 13,490 points
16,034 views
+1 vote
3 answers

Which cluster type should I choose for Spark?

Jun 27, 2018 in Apache Spark by Shubham
• 13,490 points
1,567 views
0 votes
1 answer

Which is better in term of speed, Shark or Spark?

Jun 26, 2018 in Apache Spark by Shubham
• 13,490 points
928 views
0 votes
1 answer

Spark Driver roles

Jun 21, 2018 in Apache Spark by shams
• 3,670 points
1,045 views
0 votes
1 answer

Spark standalone client mode

Jun 20, 2018 in Apache Spark by shams
• 3,670 points
998 views
0 votes
1 answer

Ways to create RDD in Apache Spark

Jun 19, 2018 in Apache Spark by Shubham
• 13,490 points
4,057 views
0 votes
3 answers

Lineage Graph in Spark

Jun 19, 2018 in Apache Spark by Data_Nerd
• 2,390 points
11,729 views
0 votes
1 answer
0 votes
1 answer

How RDD persist the data in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,398 views
0 votes
1 answer

What do we mean by an RDD in Spark?

Jun 18, 2018 in Apache Spark by kurt_cobain
• 9,350 points
4,045 views
0 votes
1 answer

Different Hadoop Modes

Jun 13, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
13,063 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

InputSplit vs HDFS Block

Jun 1, 2018 in Big Data Hadoop by shams
• 3,670 points
4,403 views
0 votes
1 answer

How does partitioning work in Spark?

May 31, 2018 in Apache Spark by coldcode
• 2,090 points
1,214 views
0 votes
1 answer

Is there any way to uncache RDD?

May 30, 2018 in Apache Spark by kurt_cobain
• 9,350 points
1,708 views
0 votes
1 answer

Sqoop vs distCP

May 30, 2018 in Big Data Hadoop by shams
• 3,670 points
1,326 views
0 votes
1 answer

NameNode without any data

May 29, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,291 views
0 votes
1 answer
0 votes
1 answer

How to find max value in pair RDD?

May 26, 2018 in Apache Spark by kurt_cobain
• 9,350 points
7,983 views
0 votes
1 answer
0 votes
1 answer

out of Memory Error in Hadoop

May 22, 2018 in Big Data Hadoop by coldcode
• 2,090 points
1,820 views
0 votes
1 answer
0 votes
1 answer

Is a HDFS block sequential ?

May 21, 2018 in Big Data Hadoop by Shubham
• 13,490 points
1,606 views
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer
0 votes
1 answer

How to install Hadoop in Ubuntu?

May 17, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
650 views
0 votes
1 answer
0 votes
1 answer

Visualization Tool in Cloudera CDH

May 16, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
1,187 views
0 votes
10 answers