Matt Low
dfe43179c0
And calculate the tokens/chunk for gemini responses, fixing the tok/s meter for gemini models. Further, only consider the first candidate of streamed gemini responses. |
||
---|---|---|
.. | ||
anthropic | ||
ollama | ||
openai |
Matt Low
dfe43179c0
And calculate the tokens/chunk for gemini responses, fixing the tok/s meter for gemini models. Further, only consider the first candidate of streamed gemini responses. |
||
---|---|---|
.. | ||
anthropic | ||
ollama | ||
openai |