Troubleshooting latency issues across all characters

Today I am getting 8 to 10 second response times when speaking to characters. The issue seems to be across the board with all my characters and the STT service is also cutting out.
I’m using Unreal Engine, but talking to my agents on the website shows the same latency issues.
Here is one of my character IDs for testing:
f8155bf2-c0eb-11f0-a761-42010a7be025

Please try using a different LLM. You can try Gemini 2.5.

My latest test on this character were ~13 second delays as well as longer time out errors in the chat dialogue about “Response take long to load. Plead hold on tight”.

The LLM change got it to down around 6 seconds but that is still at the max of what we need. Gemini 2.5 has also given us issues with simple actions so now we are bouncing around trying to find the LLM that can both respond with actions reliably and does not have a long latency delay.

The major concern is the inconsistency of the response times. Yesterday we had ~5-6 second delays while today they shot up to over twice that and were often timing out. This was the case on both our business and test accounts.

Which version of the Convai plugin are you using?

I’m using 3.6.8-beta.3 but it’s the same latency on the website as it is in the app.

The free tier seems to get the exact same latency as the Business plan, is this the case?

No, the plan does not affect response time.

Latency is generally influenced by the selected LLM and voice. In some cases, the issue may also come from the LLM provider or voice provider. Your Knowledge Bank content and Narrative Design setup can also affect response time.

I’d recommend trying our new beta package. If you want to test on convai.com, you can also switch the LLM to Gemini 2.5 Live Beta, which offers newer features such as lower latency and hands-free support.