Hi,
I’m building a VR therapy application on Meta Quest 2 using Convai SDK v4.1.0
with Unity 6 and URP. The app is running standalone on Quest 2 (Android build).
I’m experiencing response latency of around 4-6 seconds from when the patient
speaks to when Delia (my AI character) starts responding. This is noticeable
and affects the therapy experience.
My current setup:
- Model: gpt-4o-mini
- Core Description: ~600 words
- TTS: default Convai voice
- Platform: Meta Quest 2 standalone, WiFi connection
- SDK: Convai v4.1.0, Unity 6000.4.2f1
Questions:
- Which LLM model gives the fastest response time for short conversational
replies in Romanian? - Which TTS voice/engine has the lowest latency?
- Does Core Description length significantly impact response time?
- Are there any SDK-level settings to reduce latency
(streaming, buffer size, etc.)? - Is there a recommended architecture for minimizing latency on
standalone VR headsets?
Thank you!