Most tools I’ve seen focus on observability (logs, traces, dashboards), but not actual enforcement at runtime.
Curious how people here are handling this in production:
- Are you enforcing hard limits (budget, rate, etc.) or just monitoring?
- Do you handle this at the app level or via some middleware/proxy?
- Have you built something in-house for this?
Feels like an unsolved problem, especially with agents.
Would love to hear how others are dealing with it.
4 comments