Create an account, copy your ick_ key, and set your client's base URL to the
gateway. Memory, security, and caching switch on for every call. No SDK, no rewrite.
Works in any client that reads a custom Anthropic base URL. Some IDE agents pin their own endpoint and won’t honor ANTHROPIC_BASE_URL — if yours does, reach out and we’ll help you wire it up.
Prefer to keep billing on your own provider account? Register your provider key and it becomes your Iconia identity. Iconia injects memory and screens the request, then forwards through your credentials. Your key stays yours.
Every response carries receipts — what happened, and how fast.
| x-iconia-augmented | 1 if memory was injected, else 0 |
| x-iconia-items-injected | how many memories were added to the call |
| x-iconia-retrieval-ms | retrieval time in milliseconds |
| x-iconia-cache | hit-semantic · miss · off |
| x-iconia-cache-latency-ms | total time when served from cache |
Repeated meaning shouldn't cost full price twice. Opt in per request with
x-iconia-cache: semantic — same-meaning questions answer from cache without a
model call. Opposite meanings and different questions always miss and go to the model.
When your memories change, affected answers invalidate automatically.