Trending questions in Big Data Hadoop

0 votes
1 answer

Is it possible to integrate Kdb+ and hadoop?

kdb is a column oriented database plus ...READ MORE

Oct 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,064 views
0 votes
1 answer

How to discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,941 views
0 votes
1 answer

Backup and Checkpoint Node

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
4,081 views
+1 vote
2 answers

Execute Pig Script from Grunt Shell

From your current directory run  pig -x local Then ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Kunal
6,532 views
0 votes
1 answer

What is Modeling data in Hadoop and how to do it?

I suggest spending some time with Apache ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,915 views
0 votes
1 answer

What is Hadoop Speculative task execution?

One problem with the Hadoop system is ...READ MORE

Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,312 views
+3 votes
1 answer

Getting Connection Error while loading data into table using cloudera hive

Hey Nafeesa, Itseems that Hive is not able ...READ MORE

Oct 4, 2018 in Big Data Hadoop by Vardhan
• 13,150 points
1,171 views
0 votes
1 answer

When I try to connect to Hadoop, I get a error as -"Connecting to Resource Manager failed".

Make sure you've started Yarn. Use this ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,496 views
0 votes
1 answer

What is the command to check the number of cores in Spark?

Go to your Spark Web UI & ...READ MORE

May 17, 2018 in Big Data Hadoop by Shubham
• 13,490 points
7,302 views
0 votes
1 answer

How to avoid a “split-brain” scenario with NameNodes?

Okay, so let me tell you that ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
4,875 views
0 votes
1 answer

How to pass large records to map/reduce tasks?

Hadoop is not designed for records about ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,586 views
0 votes
1 answer

What are the prerequisites to learn Hadoop in java perspective?

In my day job, I've just spent ...READ MORE

Oct 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
882 views
0 votes
1 answer

Hadoop Cluster Node Setup.

Yes, at least in small clusters those ...READ MORE

Oct 15, 2018 in Big Data Hadoop by Frankie
• 9,830 points
656 views
+1 vote
1 answer

Where to set hadoop.tmp.dir? core-site.xml or hdfs-site.xml?

hadoop.tmp.dir (A base for other temporary directories) is ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,368 views
0 votes
1 answer

Is it possible to start a zookeeper server instance in process?

To start ZooKeeper you have to execute ZooKeeperServerMain class. You can use ...READ MORE

Oct 9, 2018 in Big Data Hadoop by Omkar
• 69,220 points
854 views
0 votes
1 answer

Hadoop absolute path - pwd ?

"hdfs dfs -pwd" does not exist because ...READ MORE

Mar 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
9,280 views
0 votes
1 answer

How do I print hadoop properties in command line?

You can dump Hadoop config by running: $ ...READ MORE

Aug 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,859 views
0 votes
1 answer

Hadoop on OSX “Unable to load realm info from SCDynamicStore”

Add the following to your hadoop-env.sh file: export ...READ MORE

Oct 8, 2018 in Big Data Hadoop by Omkar
• 69,220 points
851 views
0 votes
1 answer

What is Custom partitioner in Hadoop? How to write partition function ?

Don't think that in Hadoop the same ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,704 views
0 votes
1 answer

What should be the choice of database and what type of data format is suitable for Spark/hadoop?

Use Parquet. I'm not sure about CSV ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,107 views
0 votes
1 answer

How hadoop command works with javac?

Javac is used for compiling your java ...READ MORE

Aug 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
2,909 views
0 votes
1 answer

What is the difference between Hadoop MapReduce and built-in MapReduce?

Differences are as follows: Hadoop's MR can be ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,818 views
0 votes
1 answer

Is Hadoop a good open-source project to join?

Yes, Hadoop has a whole ecosystem of ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,128 views
0 votes
1 answer

After installing Hadoop on Windows7, NameNode failed to start

You need to install Hadoop winutils to ...READ MORE

May 3, 2018 in Big Data Hadoop by Shubham
• 13,490 points
7,351 views
0 votes
1 answer

Will hadoop(sqoop) load oracle faster than SQL loader?

down vote Had a similar task of exporting ...READ MORE

Sep 4, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,965 views
0 votes
1 answer

What are the hardware requirements for installing Hadoop on my Laptop?

You can either install Apache Hadoop on ...READ MORE

Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,490 points
8,296 views
0 votes
1 answer

What is Dumbo MapReduce in Hadoop?

Doc on Hadoop Streaming : http://hadoop.apache.org/docs/r1.2.1/streaming.html Hadoop streaming is ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,147 views
0 votes
1 answer

Using Hadoop for Data Analytics.

Yes, your approach is correct - you ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
875 views
0 votes
1 answer

Where can I find older versions of Hadoop?

You can check here. From the archives. In particular, ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
930 views
0 votes
1 answer

Kafka vs Flume

Even though both are used for real-time ...READ MORE

Sep 24, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,019 views
0 votes
1 answer

Not able to find HDFS directory on my system?

I guess you didn't understand it correctly dfs.datanode.data.dir, ...READ MORE

Apr 17, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
7,919 views
0 votes
1 answer

What is the difference between local file system commands touch and touchz?

Actually they both do the same except touchz is ...READ MORE

Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,791 views
0 votes
1 answer

Does map/reduce merge output files after reduce phase?

Hey there, instead of doing the file ...READ MORE

Sep 25, 2018 in Big Data Hadoop by digger
• 26,740 points
948 views
0 votes
1 answer

Hive with JSON: getting EOF exception while querying

You are missing the commas in your ...READ MORE

Aug 30, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,992 views
0 votes
1 answer

How does the HDFS architecture provide redundancy?

Hadoop provides redundancy by storing multiple replicas ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,120 views
0 votes
1 answer

Where are Hadoop's task log files stored?

For each MapReduce job, Hadoop stores the ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,152 views
0 votes
1 answer

Hbase REST API: Timerange scan

Looking at the XSD, it seems that ...READ MORE

Aug 30, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,952 views
0 votes
1 answer

How to install Sqoop with Hadoop 2.2.0?

You can refer the below link to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,129 views
0 votes
1 answer

What additional capability does Ganglia provide to monitor Hadoop?

Ganglia is an open-source, scalable and distributed ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,063 views
0 votes
1 answer

Hadoop: intervals and JOIN

Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE

Sep 24, 2018 in Big Data Hadoop by slayer
• 29,370 points
775 views
0 votes
1 answer

ISR out of Replica

If the replica stays out of the ...READ MORE

Sep 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
751 views
0 votes
2 answers

Which framework allows random, realtime read/write access to hundreds of terabytes of data?

For this type of case you can ...READ MORE

Aug 14, 2018 in Big Data Hadoop by anonymous
2,581 views
0 votes
1 answer

Why Hadoop is not implemented using Message Passing Interface (MPI)?

One of the big features of Hadoop/map-reduce ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
891 views
0 votes
1 answer

How to choose between Cassandra, Membase, Hadoop, MongoDB and RDBMS?

Actually it comes in two ways: One ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
938 views
0 votes
1 answer

Difference between a failed task attempt and a killed task attempt

Let me explain you the difference between ...READ MORE

Jul 24, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,339 views
0 votes
1 answer

How to access different directories in a Hadoop cluster?

You need to configure the client to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
904 views
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,358 views
0 votes
1 answer

Determining how many mappers will run?

Here what happens is, each file would ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,651 views
0 votes
1 answer

submitJob method in JobClient of Hadoop throws null pointer

I had this problem as well. But when ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Neha
• 6,300 points
883 views
0 votes
1 answer

How to achieve maximum memory to disk I/O ratio?

You need to tune io.sort.mb value until ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
824 views