Automatic Request Retries

Background

The upstream LLM APIs, such as OpenAI’s API, can be unstable, rate-limited, or otherwise return occasional errors. A good production practice is to retry failed requests in case the upstream error was transient. Usage Panda can automatically retry requests on your behalf, without having to write retry logic into your application code.

Enabling the Setting

To enable automatic retries:

Navigate to the API Keys page
Click the gear (settings) icon on the API key you wish to modify
Scroll down to the “Retry Count” setting and enter a number between 0 (disabled) and 5
Click “Save”

Setting via Headers

You can optionally override this setting on a per-request basis by passing the x-usagepanda-retry-count header, like so:

response = openai.Completion.create(
  model="text-davinci-003",
  prompt="Hello there",
  headers={ # Usage Panda Auth
    "x-usagepanda-api-key": USAGE_PANDA_KEY,
    "x-usagepanda-retry-count": 3
  }
)
output = response.choices[0].text