Hello @Robert_Ivanov,
Welcome to the Convai Developer Forum!
Yes, absolutely, you can achieve this. There are no limitations on this.
Convai is visually independent, which means you’re not required to use a 3D model. You can build your own interface in Unity WebGL with just voice and text chat, and call your AI character directly through the ConvaiGRPCWEBAPI.cs