Trending questions in Big Data Hadoop

0 votes
1 answer

Issue While Running Oozie Sample Project

Hey @supriya.  Seems like you have not set ...READ MORE

Jan 4, 2019 in Big Data Hadoop by Omkar
• 69,220 points
978 views
0 votes
1 answer

Not able to start MySql service

Hello. "The system never lies :-P".  The service ...READ MORE

Dec 26, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,322 views
0 votes
1 answer

Processing image in hdfs

If your block size is 64 MB, ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,265 views
0 votes
1 answer

Hadoop: Adding multiple inputs with a different mapper using python

hadoop jar hadoop-multiple-streaming.jar \ ...READ MORE

Nov 19, 2018 in Big Data Hadoop by Omkar
• 69,220 points
2,882 views
0 votes
1 answer

Hadoop Hive Hbase: How to insert data into Hbase using Hive (JSON file)?

You can use the get_json_object function to parse the ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,220 points
3,049 views
0 votes
1 answer

Extracting pig log messages

You can use the following commands in ...READ MORE

Jan 7, 2019 in Big Data Hadoop by Omkar
• 69,220 points
753 views
0 votes
1 answer

What is -cp command in hadoop? How it works?

/user/cloudera/data1 is not a directory, it is ...READ MORE

Oct 17, 2018 in Big Data Hadoop by Frankie
• 9,830 points
4,303 views
+1 vote
1 answer

Not able to start datanode in Hadoop

Follow these steps: Stop namenode Delete the datanode directory ...READ MORE

Dec 12, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,803 views
0 votes
1 answer

Hadoop backup and recovery tool and guidance

There are a few options for backup ...READ MORE

Jan 4, 2019 in Big Data Hadoop by Frankie
• 9,830 points
873 views
–1 vote
1 answer

How to restrict nodes that talk to namenode?

Step 1: Create includes file in /home/hadoop ...READ MORE

Jan 5, 2019 in Big Data Hadoop by Omkar
• 69,220 points
868 views
0 votes
1 answer

Transfer data between 2 Tables having issues with Date conversion from string to Date datatype.

This can be solved making use of ...READ MORE

Dec 28, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,131 views
0 votes
1 answer

Hadoop Spark: __spark_lib.zip does not exist

To find this file, your HADOOP_CONF_DIR env ...READ MORE

Nov 15, 2018 in Big Data Hadoop by Omkar
• 69,220 points
2,965 views
–1 vote
1 answer

Job History Server edit logs

When the application master fails, each file ...READ MORE

Dec 26, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,217 views
0 votes
1 answer

Small files and HDFS blocks

Multiple files are not stored in a ...READ MORE

Nov 16, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,899 views
0 votes
1 answer

How to securely transfer data from rdms to hdfs using sqoop?

Sqoop stores metadata in a repository and ...READ MORE

Dec 18, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,513 views
0 votes
1 answer

Hive ACID semantics

ACID stands for Atomicity, Consistency, Isolation, and Durability.  Until ...READ MORE

Jan 4, 2019 in Big Data Hadoop by Omkar
• 69,220 points
752 views
0 votes
1 answer

How to use mountable HDFS on Centos?

Hey. You can use the following commands ...READ MORE

Nov 26, 2018 in Big Data Hadoop by Omkar
• 69,220 points
2,435 views
0 votes
1 answer

How do I install cloudera Hadoop without internet connection?

You can do that by selecting the ...READ MORE

Dec 12, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,751 views
–1 vote
1 answer

Serde input and output

Input Processing Hive's execution engine (referred to as ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,133 views
+1 vote
1 answer

What is difference between Hadoop Namenode HA and HDFS federation?

The main difference between HDFS High Availability ...READ MORE

Oct 31, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,545 views
0 votes
1 answer

How can I calculate exact median with Apache Spark?

You need to sort RDD and take ...READ MORE

Oct 8, 2018 in Big Data Hadoop by Omkar
• 69,220 points
4,550 views
0 votes
1 answer

How to execute wordcount in Hadoop?

Follow these steps: Step 1:  Import all these hadoop ...READ MORE

Dec 19, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,425 views
0 votes
1 answer

Pig CLI mode

Pig can be used in two modes: 1) ...READ MORE

Jan 4, 2019 in Big Data Hadoop by Omkar
• 69,220 points
734 views
–1 vote
1 answer

Copying jar file into hadoop

hdfs dfs -put input_file_name output_location READ MORE

Dec 19, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,461 views
+1 vote
1 answer

Why minimum 3 Journal Nodes are required in Hadoop HA architecture?

Initially in Hadoop 1.x, the NameNode was ...READ MORE

Apr 20, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
11,856 views
–1 vote
1 answer

Issue with MySql Server connection

Try to restart the mysqld server and then login: sudo ...READ MORE

Jan 2, 2019 in Big Data Hadoop by Omkar
• 69,220 points
827 views
0 votes
1 answer

Hadoop regarding input split

Hadoop framework divides a large file into ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,044 views
–1 vote
1 answer

Where does reduce take place in Hadoop MapReduce?

A MapReduce job usually splits the input data-set into ...READ MORE

Dec 18, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,456 views
–1 vote
1 answer

How to Set cluster Rebalancer threshold

The command you are using is wrong. ...READ MORE

Jan 5, 2019 in Big Data Hadoop by Omkar
• 69,220 points
698 views
–1 vote
1 answer

Spark multiple version without major version

Yes. It is not necessary to set ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,038 views
–1 vote
1 answer

How to start pyspark?

Follow these steps: First start hadoop daemons: cd $HADOOP_HOME/sbin ./start-all.sh Now ...READ MORE

Jan 3, 2019 in Big Data Hadoop by Omkar
• 69,220 points
736 views
0 votes
1 answer

What is distributed Cache in MapReduce Framework ?

Distributed Cache is an important feature provided ...READ MORE

Dec 7, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,865 views
0 votes
2 answers

How to exit NameNode safe mode?

hadoop dfsadmin -safemode leave READ MORE

Apr 11, 2019 in Big Data Hadoop by anonymous
4,169 views
–1 vote
1 answer

When to use Sqoop?

Sqoop is used to transfer any data ...READ MORE

Dec 31, 2018 in Big Data Hadoop by Omkar
• 69,220 points
833 views
0 votes
1 answer

Can you build “Spark” with any particular Hadoop version?

Yes, one can build “Spark” for a specific ...READ MORE

Dec 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,512 views
–1 vote
1 answer

Can't seem to make hdfs commands to work

Check the ip address mentioned in core-site.xml ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,238 views
0 votes
1 answer

Not able to start jobtracker in hadoop

There is no jobtracker in hadoop 2.2.0 YARN framework. ...READ MORE

Dec 11, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,604 views
0 votes
0 answers

wordcount example not running on edureka vm

Hi, wordcount example failing on edureka vm(VM is ...READ MORE

Dec 25, 2018 in Big Data Hadoop by PRASAD
• 120 points
1,035 views
0 votes
1 answer

How to read Spark elements having multiple lines each?

Try this: val new_records = sc.newAPIHadoopRDD(hadoopConf,classOf[ ...READ MORE

Dec 12, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,525 views
+2 votes
5 answers

Not able to start hadoop dfs

You can re-install openssh-client and openssh-server: $ sudo ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Jino
3,432 views
0 votes
1 answer

Cant find slave on ResourceManager Web UI

Seems like it is running on default ...READ MORE

Dec 5, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,852 views
0 votes
1 answer

Mapreduce shuffle and sort

The mapreduce task happens in the following ...READ MORE

Dec 27, 2018 in Big Data Hadoop by Omkar
• 69,220 points
844 views
–1 vote
1 answer

Mapreduce in Python

mapper.py #!/usr/bin/python import sys #Word Count Example # input comes from ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,177 views
0 votes
1 answer

Apache Spark vs MapReduce

Both Spark and Hadoop MapReduce are used ...READ MORE

Dec 19, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,183 views
0 votes
1 answer

How to fix corrupt HDFS FIles?

You can use hdfs fsck / to ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
3,535 views
0 votes
1 answer

Not able to install hadoop using maven

First make sure you have ant installed ...READ MORE

Dec 6, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,700 views
–1 vote
1 answer

Facing the below error while installing mysql in VM

We would like to say that the ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,081 views
0 votes
1 answer

How is a file written in hdfs?

When you copy a file from the ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,045 views
–1 vote
1 answer

Hadoop cluster is not running in vm

First check if all daemons are running: sudo ...READ MORE

Dec 26, 2018 in Big Data Hadoop by Omkar
• 69,220 points
858 views
0 votes
1 answer

All daemons starting except datanode

There could be more than one reason ...READ MORE

Dec 21, 2018 in Big Data Hadoop by Omkar
• 69,220 points
1,022 views