◊ 03_OPEN_SOURCE
Open Source
Building blocks for multi-agent systems.
A zero-dependency SQLite memory layer for AI agents that learns what works. Most memory tools store what you tell them; metabrain closes the loop—a pattern recorded enough times graduates into a hypothesis, every outcome becomes an experiment for or against it, and proven hypotheses graduate into preferences the agent runs on.
Deterministic by construction: using the API correctly populates every table as a side effect. Seven tables, one file, stdlib sqlite—no vector database, no server, no API key. General across self-learning content engines, lead capture, and self-improving job applications.
Most agent memory remembers what you told it. The memory that compounds is the kind that proves which lessons actually work—and promotes them.
status
Published · PyPI + GitHub
stack
Python · SQLite · zero-dependency
install
pip install metabrain
license
MIT
Self-evolving Claude Code plugin built from failure paths, not theory. Every pattern earned by breaking something first, then encoding the fix so it never breaks the same way twice.
AgentDB-first methodology where every artifact reads on start, writes on end. Multi-agent orchestration with contracts, checkpoints, and verdicts. An experiment engine that proves which workflows actually hold up. Active development, open source.
Representation is the bottleneck. Markdown is optimized for human eyes—terrible for agent coordination. SQLite is optimized for structured retrieval.
status
Active Development · Production Validated
stack
Claude Code · SQLite · Shell
methodology
AgentDB · Contracts · Orchestration
license
MIT
Practical benchmark for local and open-source LLMs—21 programmatically-verified tests that mirror real work: extracting structured data, finding bugs, resisting prompt injection, reasoning through dependency chains. Not another MMLU wrapper.
Every test has a deterministic verifier—no LLM-as-judge, no vibes. Benchmarks any local model against Claude haiku/sonnet/opus tiers through headless Claude Code, no API key required. Collaborative: run it on your hardware, submit results, build a shared performance map.
Academic scores don't predict real workflow performance. The only benchmark that matters is whether the model can do the actual job—verified, not judged.
status
Active Development · Open Source
stack
Python · Ollama · Claude Code
tests
21 verified · standard + adversarial
license
MIT
A2A communication protocol with 3x semantic density. Natural language is inefficient for agent coordination—ambiguity, wasted tokens, latency.
Symbols trigger pre-trained statistical patterns from math, programming, and config files.
A self-evolving agent framework—AI agents that get better every time they run. Normally you deploy an agent, then burn weeks hand-tuning prompts, swapping models, and adjusting temperatures, and redo it all whenever user behavior shifts. Armature wraps the agent in a reinforcement-learning loop so it improves automatically from its own outcomes.
Reinforcement learning + evolutionary algorithms + self-improving policies: weak behaviors die off, strong ones survive and propagate. Published to PyPI as armature-ai and integrated into HeyContext.
Manual prompt-tuning doesn't scale and breaks every time user behavior shifts. The agent should learn from its own results instead of waiting for you to tune it.
method
Reinforcement learning · Evolutionary selection
validation
Hackathon winner (Weavehacks RL Track) · Production deployed
distribution
pip install armature-ai · GitHub
license
MIT