Integrating MedGemma with LangChain Challenges and Solutions

zeerakwyne · July 31, 2025, 12:01pm

Hi everyone, this is Zeerak from Spryt.

We’re working on a multi-agent healthcare system for appointment scheduling and patient support, built using LangChain and LangGraph. Our system uses a sophisticated architecture where different specialized agents (scheduling, rescheduling, cancellation, FAQ) collaborate to handle patient interactions for cervical screening appointments.

Our Setup:

Multi-agent system using LangChain/LangGraph
Currently supports Anthropic (Claude), Google (Gemini), and Anthropic Vertex models
Deployed on Google Cloud using dedicated Vertex AI endpoints
Production system handling real patient interactions

The Challenge:

We want to integrate MedGemma into our system, but we’re facing several technical hurdles:

1. Dedicated Endpoint Issue

We’ve discovered that LangChain’s existing Gemma integration (GemmaChatVertexAIModelGarden) doesn’t work with dedicated Vertex AI endpoints. The current implementation is designed for the Model Garden but not for custom DNS endpoints like mg-endpoint-xxxx.europe-west4-xxxx.prediction.vertexai.goog.

2. System Message Support

We’ve confirmed that Gemma models don’t support system messages - they only accept the chat format with alternating user/model turns. Our agents rely heavily on system prompts to define behavior, context, and response formats.

3. Tool Calling

Our agents use LangChain’s tool calling framework extensively. Since MedGemma doesn’t have native tool/function calling support, we need to implement a text-based workaround where:

Tool definitions are injected into prompts
Tool calls are wrapped in special markers (e.g., ```tool_code```)
Responses are parsed to extract tool calls and convert them to LangChain’s expected format

What We’re Looking For:

Has anyone successfully integrated MedGemma with LangChain using dedicated Vertex AI endpoints?
Are there existing patterns or wrappers for handling tool calling with MedGemma in a LangChain context?
Any best practices for converting system messages + tool definitions into MedGemma’s chat format while maintaining conversation coherence?

We’re planning to build a custom wrapper that:

Connects to our dedicated endpoint using the OpenAI client approach
Converts LangChain messages (including system and tool messages) to MedGemma format
Implements text-based tool calling with reliable parsing
Maintains compatibility with LangChain’s agent framework

Any insights, code examples, or similar experiences would be greatly appreciated! Happy to share our solution once we get it working.

Thanks!

fmahvar · August 2, 2025, 5:42am

Thank you for sharing! Please allow me to get back to you with a better answer.

zeerakwyne · August 18, 2025, 12:25pm

Looking forward to hearing from you @fmahvar

fmahvar · August 18, 2025, 2:05pm

Hello @zeerakwyne

System instructions should be prepended to the initial user prompt (see documentation).

Gemma cookbook has several example notebooks that might be helpful for reference:

Tool calling: Gemma function calling, Gemma Agentic AI, TxGemma agentic demo
Minimal example using LangChain with tools: Gemma LangChain

Hope this help!

Fereshteh on behalf of HAI-DEF engineering

Topic		Replies	Views
Invalid argument provided to Gemini: 400 Please ensure that function call turn comes immediately after a user turn or after a function response turn Gemini API gemini-15 , api , model-code	4	2122	October 7, 2024
Gemma 3 - missing features despite announcement Gemini API api , models , gemma-3	13	3796	April 10, 2025
Using MedGemma in Medibound: Launching HIPAA-Ready Agentic Apps in Minutes HAI-DEF models , medgemma	1	165	August 18, 2025
Documentation for Developers to Design their Integration with FHIR Systems HAI-DEF api , model , medgemma	4	283	July 10, 2025
Google VertexAI `with_structured_output` throws `StatusCode.INVALID_ARGUMENT` Gemini API vertexai	2	144	June 19, 2025