Prompt recipe - controlling request rate?

TrevorHall · ‎11-15-2023

Is it possible to control the rate at which a prompt recipe issues requests? I ran my first one yesterday and very quickly hit the TPM rate cap on our OpenAI model deployment.

AdrienL · ‎11-16-2023

Hi,
On the connection, there are some settings to control the rate of queries. There is no direct limit for TPM, but reducing parallelism will usually fix it.

View solution in original post

AdrienL · ‎11-16-2023

Hi,
On the connection, there are some settings to control the rate of queries. There is no direct limit for TPM, but reducing parallelism will usually fix it.

TrevorHall · ‎11-16-2023

Thanks! I set "Max parallelism" to 1 and that seems to be working. I also upped the retry delay to 2 seconds.

Sign up to take part

Prompt recipe - controlling request rate?

Prompt recipe - controlling request rate?