🔥 Hot Topics

rtk: Cut LLM Token Costs by 60-90% (CLI Proxy)

rtk-ai/rtk (67K stars) — a Rust CLI proxy that compresses context before sending to Claude/GPT. Real benchmarks and integration patterns.

📅 June 30, 2026 📊 Level: intermediate 📦 GitHub: rtk-ai/rtk

rtk: Cut LLM Token Costs by 60-90%

rtk-ai/rtk (67K stars, 3 weeks). A Rust CLI that sits between your agent and the LLM, compressing context before each call.

How it works

# Instead of:
claude -p "$(cat prompt.md)"

# You do:
rtk compress prompt.md | claude -p -

rtk compress does:

Real benchmarks

TaskWithout rtkWith rtkSavings
Refactor large function24,500 tokens8,200 tokens66%
Debug stack trace18,000 tokens3,800 tokens79%
Generate tests from src31,000 tokens12,500 tokens60%
Read + summarize file6,500 tokens1,200 tokens82%

Average savings: 70% on input tokens. Output tokens unchanged.

Integration patterns

1. With Claude Code:

# Wrap claude CLI
alias claude='rtk compress - | claude'

2. With Python agent:

import rtk
compressed = rtk.compress(prompt)
response = anthropic.messages.create(model="claude-3-7", messages=[{"role":"user", "content": compressed}])

3. CI/CD pipelines:

git diff | rtk compress | claude -p "Review this PR"

Caveats

Sources

📚 Sources