Summer Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

Exam Professional-Machine-Learning-Engineer All Questions
Exam Professional-Machine-Learning-Engineer All Questions

View all questions & answers for the Professional-Machine-Learning-Engineer exam

Google Machine Learning Engineer Professional-Machine-Learning-Engineer Question # 48 Topic 6 Discussion

Professional-Machine-Learning-Engineer Exam Topic 6 Question 48 Discussion:
Question #: 48
Topic #: 6

You have deployed a scikit-learn model to a Vertex Al endpoint using a custom model server. You enabled auto scaling; however, the deployed model fails to scale beyond one replica, which led to dropped requests. You notice that CPU utilization remains low even during periods of high load. What should you do?


A.

Attach a GPU to the prediction nodes.


B.

Increase the number of workers in your model server.


C.

Schedule scaling of the nodes to match expected demand.


D.

Increase the minReplicaCount in your DeployedModel configuration.


Get Premium Professional-Machine-Learning-Engineer Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.