

1·
10 days agoWhy do you say that? I have had no reason to doubt their reporting
Why do you say that? I have had no reason to doubt their reporting
I have been pretty impressed by Gemini 2.0 Flash.
Its slightly worse than the very best on the benchmarks I have seen, but is pretty much instant and incredibly cheap. Maybe a loss leader?
Anyways, which model of the commercial ones do you consider to be good?
FYI: Kagi uses the Russian search engine Yandex as well and have no plans of changing it.
Mentioning it, as to some it might be an issue indirectly financially supporting a Russian company.
Just search for Kagi Yandex and you will get plenty officiak sources
So there is not any trustworthy benchmarks I can currently use to evaluate? That in combination with my personal anecdotes is how I have been evaluating them.
I was pretty impressed with Deepseek R1. I used their app, but not for anything sensitive.
I don’t like that OpenAI defaults to a model I can’t pick. I have to select it each time, even when I use a special URL it will change after the first request
I am having a hard time deciding which models to use besides a random mix between o3-mini-high, o1, Sonnet 3.5 and Gemini 2 Flash