Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Cloudera CCA175 Exam

Certification Provider: Cloudera
Exam Name: CCA Spark and Hadoop Developer
Number of questions in our database: 96
Exam Version: Apr. 06, 2024
CCA175 Exam Official Topics:
  • Topic 1: Understand the fundamentals of querying datasets in Spark/ Write the results back into HDFS using Spark
  • Topic 2: Write queries that calculate aggregate statistics/ Load data from HDFS for use in Spark applications
  • Topic 3: Use meta store tables as an input source or an output sink for Spark applications/ Filter data using Spark
  • Topic 4: Generate reports by using queries against loaded data/ Produce ranked or sorted data
  • Topic 5: Perform standard extract, transform, load (ETL) processes on data using the Spark API/ Join disparate datasets using Spark
  • Topic 6: Use Spark SQL to interact with the meta store programmatically in your applications/ Read and write files in a variety of file formats
Disscuss Cloudera CCA175 Topics, Questions or Ask Anything Related

Currently there are no comments in this discussion, be the first to comment!

Free Cloudera CCA175 Exam Actual Questions

The questions for CCA175 were last updated On Apr. 06, 2024

Question #1

Problem Scenario 92 : You have been given a spark scala application, which is bundled in jar named hadoopexam.jar.

Your application class name is com.hadoopexam.MyTask

You want that while submitting your application should launch a driver on one of the cluster node.

Please complete the following command to submit the application.

spark-submit XXX -master yarn \

YYY SSPARK HOME/lib/hadoopexam.jar 10

Reveal Solution Hide Solution
Correct Answer: B

Question #2

Problem Scenario 75 : You have been given MySQL DB with following details.

user=retail_dba

password=cloudera

database=retail_db

table=retail_db.orders

table=retail_db.order_items

jdbc URL = jdbc:mysql://quickstart:3306/retail_db

Please accomplish following activities.

1. Copy "retail_db.order_items" table to hdfs in respective directory p90_order_items .

2. Do the summation of entire revenue in this table using pyspark.

3. Find the maximum and minimum revenue as well.

4. Calculate average revenue

Columns of ordeMtems table : (order_item_id , order_item_order_id , order_item_product_id, order_item_quantity,order_item_subtotal,order_item_subtotal,order_item_product_price)

Reveal Solution Hide Solution
Correct Answer: B

Question #3

Problem Scenario 94 : You have to run your Spark application on yarn with each executor 20GB and number of executors should be 50.Please replace XXX, YYY, ZZZ

export HADOOP_CONF_DIR=XXX

./bin/spark-submit \

-class com.hadoopexam.MyTask \

xxx\

-deploy-mode cluster \ # can be client for client mode

YYY\

222 \

/path/to/hadoopexam.jar \

1000

Reveal Solution Hide Solution
Correct Answer: A

Question #4

Problem Scenario 69 : Write down a Spark Application using Python,

In which it read a file "Content.txt" (On hdfs) with following content.

And filter out the word which is less than 2 characters and ignore all empty lines.

Once doen store the filtered data in a directory called "problem84" (On hdfs)

Content.txt

Hello this is ABCTECH.com

This is ABYTECH.com

Apache Spark Training

This is Spark Learning Session

Spark is faster than MapReduce

Reveal Solution Hide Solution
Correct Answer: A

Question #5

Problem Scenario 91 : You have been given data in json format as below.

{"first_name":"Ankit", "last_name":"Jain"}

{"first_name":"Amir", "last_name":"Khan"}

{"first_name":"Rajesh", "last_name":"Khanna"}

{"first_name":"Priynka", "last_name":"Chopra"}

{"first_name":"Kareena", "last_name":"Kapoor"}

{"first_name":"Lokesh", "last_name":"Yadav"}

Do the following activity

1. create employee.json tile locally.

2. Load this tile on hdfs

3. Register this data as a temp table in Spark using Python.

4. Write select query and print this data.

5. Now save back this selected data in json format.

Reveal Solution Hide Solution
Correct Answer: B


Unlock all CCA175 Exam Questions with Advanced Practice Test Features:
  • Select Question Types you want
  • Set your Desired Pass Percentage
  • Allocate Time (Hours : Minutes)
  • Create Multiple Practice tests with Limited Questions
  • Customer Support
Get Full Access Now

Save Cancel