Optimizing Gemini Agent Loops with Deterministic Token Compression (Skillware)

rosspeili · March 21, 2026, 5:13pm

Hi everyone,

I’m working on Skillware, an open-source framework designed to standardize how we package and deploy agentic capabilities across models.

One of the challenges with long-running Gemini agent loops is managing the context window expansion and minimizing token costs perfectly. We just merged a new skill: Prompt Token Rewriter (optimization/prompt_rewriter).

This skill acts as a deterministic heuristic middleware that compresses bloated prompts and history by 50-80% before they ever hit the Gemini API.

Because it’s deterministic (uses regex-based heuristics), it doesn’t add another stochastic LLM call or extra billing to your loop—it just strips the conversational “slop” and redundant structures so Gemini can focus on the signal.

We’ve built native support for the Gemini SDK with a to_gemini_tool adapter, making it easy to plug these modular skills directly into your GenerativeModel tools.

Registry & Integration:
We’re building a community-driven “App Store” for Agentic Skills (Logic + Cognition + Governance). If you’ve been building custom tools for Gemini agents or want to see how we standardise skill delivery, we’d love your feedback or a PR!

Check out the repo and the Gemini implementation guide:

github.com/ARPAHLS/skillware

docs/usage/gemini.md

main

# Integration Guide: Google Gemini

Skillware provides first-class support for Google's Gemini models via the `google-generativeai` SDK.

## ⚡ Quick Snippet

```python
from skillware.core.loader import SkillLoader
import google.generativeai as genai

# Load & Convert
skill = SkillLoader.load_skill("finance/wallet_screening")
tool = SkillLoader.to_gemini_tool(skill)

# Initialize
model = genai.GenerativeModel(
    'gemini-2.0-flash-exp',
    tools=[tool],
    system_instruction=skill['instructions']
)

This file has been truncated. show original

[Repo Link] GitHub - ARPAHLS/skillware: A Python framework for modular, self-contained skill management for machines. · GitHub

Topic		Replies	Views
Ai context engineering Gemini API models , llm	0	128	July 9, 2025
Building a better RAG pipeline: Introducing the open-source RAG-Framework-2026 💡 Gemini API models , gemini , datasets	1	258	June 6, 2026
Open-source CrewAi Basetool for the Interactions API (Deep Research) Gemini API api , gemini-api , gemini , deep-research	0	48	May 18, 2026
Announcing `gemini-cli`: A Command-Line Tool for the Google Gemini API Community	0	419	July 29, 2025
A Unified Tool to Minimize Token Usage and Preserve Context in Antigravity Google Antigravity models	0	457	June 19, 2026

Optimizing Gemini Agent Loops with Deterministic Token Compression (Skillware)

Related topics