Experiencing response latency

Samuel_Ionesi · May 12, 2026, 4:58pm

Hi,

I’m building a VR therapy application on Meta Quest 2 using Convai SDK v4.1.0
with Unity 6 and URP. The app is running standalone on Quest 2 (Android build).

I’m experiencing response latency of around 4-6 seconds from when the patient
speaks to when Delia (my AI character) starts responding. This is noticeable
and affects the therapy experience.

My current setup:

Model: gpt-4o-mini
Core Description: ~600 words
TTS: default Convai voice
Platform: Meta Quest 2 standalone, WiFi connection
SDK: Convai v4.1.0, Unity 6000.4.2f1

Questions:

Which LLM model gives the fastest response time for short conversational
replies in Romanian?
Which TTS voice/engine has the lowest latency?
Does Core Description length significantly impact response time?
Are there any SDK-level settings to reduce latency
(streaming, buffer size, etc.)?
Is there a recommended architecture for minimizing latency on
standalone VR headsets?

Thank you!

K3 · May 12, 2026, 5:33pm

Hello,

Could you please try using GPT-Realtime 1.5 and let us know if the issue still occurs?

Samuel_Ionesi · May 12, 2026, 7:02pm

Hi, I tried with GPT-Realtime 1.5 and it’s much better.

Topic		Replies	Views
Balancing latency and safety in NPCs Character Intelligence unity , conversation-issues , question	4	182	November 9, 2025
Very slow response Character Intelligence conversation-issues , question	36	939	June 3, 2025
How can I reduce response times? Unity SDK unity , question	10	371	January 20, 2025
Troubleshooting latency issues across all characters Character Intelligence conversation-issues , question	29	234	May 7, 2026
Is there any way of accelerating the latency between question and anwser ? thanks Unreal Engine Plugin unreal-engine , question	31	201	July 31, 2024

Experiencing response latency

Related topics