Will there be a evaluation framework for Google AI models?

VBe · April 24, 2024, 8:30pm

Hi!
Going into production with AI models requires careful assessment of the expected performance. Will there be a framework we can leverage to evaluate the Google LLMs?
I am sure this will be useful for researchers as well.

sps · April 24, 2024, 8:36pm

This sounds interesting. I wonder we can port the existing evals.

grandell1234 · April 24, 2024, 8:38pm

In Vertex AI on Google Cloud in the Model Development section, there is an Experiment window that allows you to run experiments and then have the ability to track, visualize, and compare them. Is this sort of what you are looking for?

VBe · April 24, 2024, 8:59pm

This is very interesting and thanks for sharing the link.
From a first quick screening of the extensive documentation this looks very interesting for fine-tuning and other model training jobs. Looking forward to learn more!

Topic		Replies	Views
Gemini Experimental 0801 model Gemini API gemini-15 , api , models	3	134	September 4, 2024
There is no Extension support for Gemini seemingly in any google python library Gemini API api	1	65	June 24, 2025
Gemini 2.0 - Exp: Train on data even on Paid as you Go? Google AI Studio models , model-training	5	896	March 1, 2025
Finetuning gemini free of cost? Gemini API fine-tuning	5	529	October 22, 2024
A new frame work for google gemini Google AI Edge models	2	256	September 21, 2024

Will there be a evaluation framework for Google AI models?

Related topics