Post
  1. Portrait of Onur Solmaz

    A throttling protocol for model providers

    @onusoz · /2026/04/04· View on
    A more reasonable long term option for Anthropic is to create a throttling protocol A standardized harness agnostic protocol for model providers to send warnings and throttle usage in real time Harnesses would implement the protocol. A client can be warned. If it doesn’t listen, it can be temporarily blocked from the server side, or banned permanently if it breaks the rules too many times Needless to say, throttling could be done first on server side easily. That would actually fix the load issue for them in the short run, while not banning the user and just giving a bad delayed UX. They probably already do this to prevent abuse The suggested protocol would then save the user from abuse related delays too, and also inform the harness developer when they do something wrong