Pre-Summer Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: validbest

Exam Databricks-Generative-AI-Engineer-Associate All Questions
Exam Databricks-Generative-AI-Engineer-Associate All Questions

View all questions & answers for the Databricks-Generative-AI-Engineer-Associate exam

Databricks Generative AI Engineer Databricks-Generative-AI-Engineer-Associate Question # 16 Topic 2 Discussion

Databricks-Generative-AI-Engineer-Associate Exam Topic 2 Question 16 Discussion:
Question #: 16
Topic #: 2

A Generative Al Engineer has built an LLM-based system that will automatically translate user text between two languages. They now want to benchmark multiple LLM's on this task and pick the best one. They have an evaluation set with known high quality translation examples. They want to evaluate each LLM using the evaluation set with a performant metric.

Which metric should they choose for this evaluation?


A.

ROUGE metric


B.

BLEU metric


C.

NDCG metric


D.

RECALL metric


Get Premium Databricks-Generative-AI-Engineer-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.