Are audio output tokens equal to text output tokens?
|
|
1
|
16
|
May 22, 2025
|
New Live API Features Don't Work through the API (Proactive audio and Affective dialong)
|
|
0
|
31
|
May 21, 2025
|
Audio has output in ai studio
|
|
1
|
90
|
May 19, 2025
|
Gemini Live Caching
|
|
4
|
73
|
May 16, 2025
|
Timestamp generation (Forced Alignment) on 2.0 production models is still broken
|
|
9
|
201
|
May 14, 2025
|
Gemini flash 1.5 8B having an error with not generating content in audio file
|
|
2
|
48
|
May 14, 2025
|
Capturing emotions with Chirp3 like Multispeaker
|
|
0
|
35
|
May 9, 2025
|
Gemini 2.5 timestamp references for start and end in the prompt are being ignored
|
|
2
|
58
|
May 8, 2025
|
Static Audio Output from Gemini Live API (google-genai SDK) on iOS with AVAudioEngine
|
|
7
|
146
|
May 7, 2025
|
Live API Audio Talk Worse After the Update
|
|
2
|
104
|
April 29, 2025
|
Please add audio-only mode to Youtube link
|
|
2
|
127
|
April 14, 2025
|
Is it possible to get audio output from Gemini in the NON-live API?
|
|
1
|
40
|
April 11, 2025
|
Elevated error rate (>95%) with `gemini-1.5-flash` when processing audo
|
|
4
|
67
|
April 10, 2025
|
Inconsistent Previews and Audio Playback Issues During Scrolling
|
|
0
|
20
|
April 10, 2025
|
Gemini-1.5-flash is no longer processing audio files (500 Exception) - retry does not help
|
|
4
|
83
|
April 9, 2025
|
More audio file type support in (openai-compatible) api?
|
|
3
|
81
|
April 3, 2025
|
Reducing latency for gemini audio prompt requests?
|
|
0
|
33
|
April 2, 2025
|
Audio+Text Stream Modus
|
|
0
|
52
|
March 30, 2025
|
Gemini Flash 2.0 audio transcription timestamps incorrect
|
|
4
|
453
|
March 27, 2025
|
Why it takes 20s to answer in audio, for the Gemini Flash 2.0 exp model?
|
|
0
|
32
|
March 26, 2025
|
Latency problems API gemini 2.0 flash multimodal life
|
|
2
|
86
|
March 25, 2025
|
Is it possible to manually set end of turns for the live api when inputing audio?
|
|
0
|
25
|
March 21, 2025
|
About gemini audio input
|
|
2
|
94
|
March 19, 2025
|
Live API Hangs When Using System Prompt with Audio-Only Response Modality
|
|
0
|
60
|
March 15, 2025
|
How to request access to gemini flash 2.0 experimental Audio API?
|
|
0
|
114
|
March 17, 2025
|
Issue with websocket on react native app
|
|
3
|
91
|
March 10, 2025
|
Timestamp Generation (Forced Alignment) on 2.0-Pro-Exp
|
|
5
|
251
|
March 3, 2025
|
Why not support more voices or support synthesized voices in multimodal live api?
|
|
4
|
68
|
February 20, 2025
|
When is Stream Realtime going to enable voice and screen share at the same time?
|
|
0
|
85
|
February 17, 2025
|
Unexpected global shape for user_input.visi
|
|
1
|
41
|
January 19, 2025
|