Pre-Summer Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: validbest

Exam Professional-Data-Engineer All Questions
Exam Professional-Data-Engineer All Questions

View all questions & answers for the Professional-Data-Engineer exam

Google Cloud Certified Professional-Data-Engineer Question # 32 Topic 4 Discussion

Professional-Data-Engineer Exam Topic 4 Question 32 Discussion:
Question #: 32
Topic #: 4

You created an analytics environment on Google Cloud so that your data scientist team can explore data without impacting the on-premises Apache Hadoop solution. The data in the on-premises Hadoop Distributed File System (HDFS) cluster is in Optimized Row Columnar (ORC) formatted files with multiple columns of Hive partitioning. The data scientist team needs to be able to explore the data in a similar way as they used the on-premises HDFS cluster with SQL on the Hive query engine. You need to choose the most cost-effective storage and processing solution. What should you do?


A.

Import the ORC files lo Bigtable tables for the data scientist team.


B.

Import the ORC files to BigOuery tables for the data scientist team.


C.

Copy the ORC files on Cloud Storage, then deploy a Dataproc cluster for the data scientist team.


D.

Copy the ORC files on Cloud Storage, then create external BigQuery tables for the data scientist team.


Get Premium Professional-Data-Engineer Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.