Trending questions in Big Data Hadoop

0 votes
1 answer

Is it possible to integrate Kdb+ and hadoop?

kdb is a column oriented database plus ...READ MORE

Oct 10, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,062 views
0 votes
1 answer

How to discover missing or corrupt HDFS data?

HDFS supports fsck command to check for ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,938 views
0 votes
1 answer

Backup and Checkpoint Node

Checkpoint Node keeps track of the latest ...READ MORE

Jul 31, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
4,079 views
+1 vote
2 answers

Execute Pig Script from Grunt Shell

From your current directory run  pig -x local Then ...READ MORE

Oct 25, 2018 in Big Data Hadoop by Kunal
6,525 views
0 votes
1 answer

What is Modeling data in Hadoop and how to do it?

I suggest spending some time with Apache ...READ MORE

Sep 19, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,914 views
0 votes
1 answer

What is Hadoop Speculative task execution?

One problem with the Hadoop system is ...READ MORE

Oct 3, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,311 views
+3 votes
1 answer

Getting Connection Error while loading data into table using cloudera hive

Hey Nafeesa, Itseems that Hive is not able ...READ MORE

Oct 4, 2018 in Big Data Hadoop by Vardhan
• 13,150 points
1,169 views
0 votes
1 answer

When I try to connect to Hadoop, I get a error as -"Connecting to Resource Manager failed".

Make sure you've started Yarn. Use this ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,495 views
0 votes
1 answer

What is the command to check the number of cores in Spark?

Go to your Spark Web UI & ...READ MORE

May 17, 2018 in Big Data Hadoop by Shubham
• 13,490 points
7,299 views
0 votes
1 answer

How to avoid a “split-brain” scenario with NameNodes?

Okay, so let me tell you that ...READ MORE

Jul 11, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
4,869 views
0 votes
1 answer

How to pass large records to map/reduce tasks?

Hadoop is not designed for records about ...READ MORE

Sep 25, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,585 views
0 votes
1 answer

What are the prerequisites to learn Hadoop in java perspective?

In my day job, I've just spent ...READ MORE

Oct 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
881 views
0 votes
1 answer

Hadoop Cluster Node Setup.

Yes, at least in small clusters those ...READ MORE

Oct 15, 2018 in Big Data Hadoop by Frankie
• 9,830 points
656 views
+1 vote
1 answer

Where to set hadoop.tmp.dir? core-site.xml or hdfs-site.xml?

hadoop.tmp.dir (A base for other temporary directories) is ...READ MORE

Mar 26, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
9,365 views
0 votes
1 answer

Is it possible to start a zookeeper server instance in process?

To start ZooKeeper you have to execute ZooKeeperServerMain class. You can use ...READ MORE

Oct 9, 2018 in Big Data Hadoop by Omkar
• 69,220 points
853 views
0 votes
1 answer

Hadoop absolute path - pwd ?

"hdfs dfs -pwd" does not exist because ...READ MORE

Mar 27, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
9,274 views
0 votes
1 answer

Hadoop on OSX “Unable to load realm info from SCDynamicStore”

Add the following to your hadoop-env.sh file: export ...READ MORE

Oct 8, 2018 in Big Data Hadoop by Omkar
• 69,220 points
851 views
0 votes
1 answer

How do I print hadoop properties in command line?

You can dump Hadoop config by running: $ ...READ MORE

Aug 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,856 views
0 votes
1 answer

What is Custom partitioner in Hadoop? How to write partition function ?

Don't think that in Hadoop the same ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,702 views
0 votes
1 answer

What should be the choice of database and what type of data format is suitable for Spark/hadoop?

Use Parquet. I'm not sure about CSV ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,106 views
0 votes
1 answer

How hadoop command works with javac?

Javac is used for compiling your java ...READ MORE

Aug 17, 2018 in Big Data Hadoop by Neha
• 6,300 points
2,907 views
0 votes
1 answer

What is the difference between Hadoop MapReduce and built-in MapReduce?

Differences are as follows: Hadoop's MR can be ...READ MORE

Sep 11, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,817 views
0 votes
1 answer

Is Hadoop a good open-source project to join?

Yes, Hadoop has a whole ecosystem of ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,127 views
0 votes
1 answer

After installing Hadoop on Windows7, NameNode failed to start

You need to install Hadoop winutils to ...READ MORE

May 3, 2018 in Big Data Hadoop by Shubham
• 13,490 points
7,349 views
0 votes
1 answer

Will hadoop(sqoop) load oracle faster than SQL loader?

down vote Had a similar task of exporting ...READ MORE

Sep 4, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,964 views
0 votes
1 answer

What are the hardware requirements for installing Hadoop on my Laptop?

You can either install Apache Hadoop on ...READ MORE

Apr 10, 2018 in Big Data Hadoop by Shubham
• 13,490 points
8,291 views
0 votes
1 answer

What is Dumbo MapReduce in Hadoop?

Doc on Hadoop Streaming : http://hadoop.apache.org/docs/r1.2.1/streaming.html Hadoop streaming is ...READ MORE

Sep 23, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,146 views
0 votes
1 answer

Using Hadoop for Data Analytics.

Yes, your approach is correct - you ...READ MORE

Sep 28, 2018 in Big Data Hadoop by Frankie
• 9,830 points
873 views
0 votes
1 answer

Where can I find older versions of Hadoop?

You can check here. From the archives. In particular, ...READ MORE

Sep 27, 2018 in Big Data Hadoop by Frankie
• 9,830 points
927 views
0 votes
1 answer

Kafka vs Flume

Even though both are used for real-time ...READ MORE

Sep 24, 2018 in Big Data Hadoop by Data_Nerd
• 2,390 points
1,016 views
0 votes
1 answer

What is the difference between local file system commands touch and touchz?

Actually they both do the same except touchz is ...READ MORE

Aug 14, 2018 in Big Data Hadoop by Frankie
• 9,830 points
2,788 views
0 votes
1 answer

Not able to find HDFS directory on my system?

I guess you didn't understand it correctly dfs.datanode.data.dir, ...READ MORE

Apr 17, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
7,914 views
0 votes
1 answer

Does map/reduce merge output files after reduce phase?

Hey there, instead of doing the file ...READ MORE

Sep 25, 2018 in Big Data Hadoop by digger
• 26,740 points
946 views
0 votes
1 answer

Hive with JSON: getting EOF exception while querying

You are missing the commas in your ...READ MORE

Aug 30, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,990 views
0 votes
1 answer

How does the HDFS architecture provide redundancy?

Hadoop provides redundancy by storing multiple replicas ...READ MORE

Sep 19, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,118 views
0 votes
1 answer

Where are Hadoop's task log files stored?

For each MapReduce job, Hadoop stores the ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,150 views
0 votes
1 answer

Hbase REST API: Timerange scan

Looking at the XSD, it seems that ...READ MORE

Aug 30, 2018 in Big Data Hadoop by Neha
• 6,300 points
1,950 views
0 votes
1 answer

How to install Sqoop with Hadoop 2.2.0?

You can refer the below link to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
1,127 views
0 votes
1 answer

What additional capability does Ganglia provide to monitor Hadoop?

Ganglia is an open-source, scalable and distributed ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
1,061 views
0 votes
1 answer

Hadoop: intervals and JOIN

Hey, a solution was given on Biostar: http://biostar.stackexchange.com/questions/8821. Hope ...READ MORE

Sep 24, 2018 in Big Data Hadoop by slayer
• 29,370 points
774 views
0 votes
1 answer

ISR out of Replica

If the replica stays out of the ...READ MORE

Sep 24, 2018 in Big Data Hadoop by kurt_cobain
• 9,350 points
748 views
0 votes
2 answers

Which framework allows random, realtime read/write access to hundreds of terabytes of data?

For this type of case you can ...READ MORE

Aug 14, 2018 in Big Data Hadoop by anonymous
2,579 views
0 votes
1 answer

Why Hadoop is not implemented using Message Passing Interface (MPI)?

One of the big features of Hadoop/map-reduce ...READ MORE

Sep 21, 2018 in Big Data Hadoop by Frankie
• 9,830 points
889 views
0 votes
1 answer

How to choose between Cassandra, Membase, Hadoop, MongoDB and RDBMS?

Actually it comes in two ways: One ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Frankie
• 9,830 points
935 views
0 votes
1 answer

Difference between a failed task attempt and a killed task attempt

Let me explain you the difference between ...READ MORE

Jul 24, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,335 views
0 votes
1 answer

How to access different directories in a Hadoop cluster?

You need to configure the client to ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
902 views
0 votes
1 answer

What metadata is stored on a DataNode when a block is written to it?

Let me explain you step by step.  Each ...READ MORE

Jul 23, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
3,356 views
0 votes
1 answer

Determining how many mappers will run?

Here what happens is, each file would ...READ MORE

Aug 9, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
2,649 views
0 votes
1 answer

submitJob method in JobClient of Hadoop throws null pointer

I had this problem as well. But when ...READ MORE

Sep 18, 2018 in Big Data Hadoop by Neha
• 6,300 points
882 views
0 votes
1 answer

How to achieve maximum memory to disk I/O ratio?

You need to tune io.sort.mb value until ...READ MORE

Sep 18, 2018 in Big Data Hadoop by nitinrawat895
• 11,380 points
823 views