whole stack uses Google tools: Gemini models, google Text to speech, Speech to text, attetnion mechanism partial vision as we send single camera frames code will soon be available on patreon
Tools will be easy to integgreate