How should i prepare for CCA 175 Exam?

Question

I want to get the Cloudera certification for their Hadoop and Spark Dev exam, ie. CCA 175.

I have roughly 2 months of time to prepare.

Any thoughts on how to approach this?

kurt_cobain · Answer

Edureka&#160;has one of the most detailed and comprehensive course on Apache Spark and Hadoop online. But before going for any online training just go through this to have a basic grasp of the technology and the fundamentalsTo learn Spark and Hadoop, you need to start with the basics, i.e Big Data and emergence of Hadoop.Moving forward you need to focus on the main reason Hadoop became popular. It was because of&#160;HDFS&#160;(Hadoop Distributed File System).Further moving on take a deep dive into&#160;Hadoop Ecosystem&#160;and learn various tools inside Hadoop Ecosystem with their functionalities. So, that you will learn how to create a tailored solution according to your requirementsThe main components of&#160;HDFS&#160;are&#160;NameNode&#160;and&#160;DataNode.NameNodeIt is the master daemon that maintains and manages the DataNodes (slave nodes). It records the metadata of all the files stored in the cluster, e.g. location of blocks stored, the size of the files, permissions, hierarchy, etc. It records each and every change that takes place to the file system metadata.For example, if a file is deleted in HDFS, the NameNode will immediately record this in the EditLog. It regularly receives a Heartbeat and a block report from all the DataNodes in the cluster to ensure that the DataNodes are live. It keeps a record of all the blocks in HDFS and in which nodes these blocks are stored.DataNodeThese are slave daemons which&#160;runs&#160;on each slave machine. The actual data is stored on DataNodes. They are responsible for serving read and write requests from the clients. They are also responsible for creating blocks, deleting blocks and replicating the same based on the decisions taken by the NameNode.For processing, we use YARN(Yet Another Resource Negotiator). The components of&#160;YARN&#160;are&#160;ResourceManager&#160;and&#160;NodeManager.ResourceManagerIt is a cluster level (one for each cluster) component and runs on the master machine. It manages resources and schedule applications running on top of YARN.NodeManagerIt is a node level component (one on each node) and runs on each slave machine. It is responsible for managing containers and monitoring resource utilization in each container. It also keeps track of node health and log management. It continuously communicates with ResourceManager to remain up-to-date.So, you can perform parallel processing on HDFS using MapReduce.Next comes the concepts of&#160;Pig,&#160;Hive&#160;and&#160;Hbase.Moving on to Spark you need to learn about&#160;Scala, as Spark-shell by default runs on Scala.Scala is a general-purpose programming language, which is aimed to implement common programming patterns in a concise, elegant, and type-safe wayIt supports&#160;both object-oriented and functional programming styles,thus&#160;helping programmers to be more productive.Further moving forward, you need to learn about&#160;RDDs&#160;,&#160;which are the basic building blocks for any spark code.RDD(Resilient Distributed Dataset) is a distributed memory abstraction which lets programmers perform in-memory computations on large clusters in a fault-tolerant manner.They are&#160;read-only&#160;collection of objects partitioned across a set of machines that can be rebuilt if a partition is lost.RDDs can be created from multiple data sources e.g. Scala collection, local file system, Hadoop, Amazon S3, HBase table etc.SparkSQL&#160;is another main component of Spark which is very important to process structured data in an&#160;sql&#160;style format.Next comes the Machine Learning library of Spark, ie.&#160;MLlib.&#160;How it is used to perform various ML algorithms through Spark. (Regressions and K-means Clustering)Flume&#160;also plays an important role in the process of Streaming data and so does&#160;Kafka.Spark itself has the ability to process and Stream data, which is done through&#160;Spark Streaming&#160;using&#160;DStreams.Edureka&#8217;s&#160;Apache Spark and Scala Certification training&#160;offers&#160;a detailed course specifically designed for the CCA175 exam, covering all the&#160;above mentioned&#160;topics.Edureka provides a good list of Spark Videos. I would recommend you go through this&#160;Edureka Spark Playlist&#160;as well as the&#160;Spark TutorialThere are a lot of&#160;Hadoop Videos&#160;too.Hope this helps.

zombie · Answer

CCA 175 is a very important exam for the people who want to excel in Hadoop and Spark. Since you have less time and importance of this exam is very high, you should go for Edureka's&#160;course on Hadoop and Spark. They have covered everything and their instructors are well versed with the topics, with ample of examples for better understanding.

erichamm · Answer

If you have to prepare for Cloudera CCA175 exam and need to get help, by then DumpsStar&#160;is an extraordinary stage for you. You can pass Cloudera exam easily by getting Cloudera CCA175 exam dumps in actuality that these CCA175 exam dumps are offered by DumpsStar&#160;are affirmed by Cloudera specialists. Its self-appraisal mechanical assembly is shocking, which evaluate your performance and pointed out weak areas. DumpsStar&#160;is the best webpage forgiving on the web preparing material to Cloudera CCA175 exam.You can find related material of CCA175 exam on the DumpsStar&#160;that will help you with clearing your Cloudera CCA175 exam on the vital undertaking. DumpsStar&#160;is the best source where you can get all the available online exam material. You can without quite a bit of a stretch get Cloudera CCA175 exam dumps and can pass your CCA175 exam with comfort. I authorize to at first get a look at DumpsStar. This gainful resource will help you with understanding the focuses and honest to goodness exam configuration attached into the exam and where to focus your essentialness on. DumpsStar&#160;look at material for the CCA175 exam has made things incredibly less requesting.

How should i prepare for CCA 175 Exam

Your comment on this question:

3 answers to this question.

Your answer

Your comment on this answer:

Your comment on this answer:

Your comment on this answer:

Related Questions In Career Counselling

How much hike should i ask for 2 years experience?

How much salary should I ask for?

How to prepare for Oracle Certified Associate Java Programmer Exam?

I am working in the same organization since last 7+ years as a UI designer now if want to switch how much hike I should ask for?

How do I know linux administrator role right for me?

How do I know Digital Marketing is for me?

How do I get number of columns in each line from a delimited file??

Hadoop Mapreduce word count Program

hadoop.mapred vs hadoop.mapreduce?

hadoop fs -put command?

Subscribe to our Newsletter, and get personalized recommendations.

TRENDING CERTIFICATION COURSES

TRENDING MASTERS COURSES

COMPANY

WORK WITH US

DOWNLOAD APP

CATEGORIES

CATEGORIES

TRENDING BLOG ARTICLES

TRENDING BLOG ARTICLES