Multimodal data for Knowledge Bank beyond just text?

Melvin_Eng · March 3, 2026, 2:22pm

Hi there,

I’m wondering if there are future plans to widen the scope for the kinds of data that the Knowledge Bank can ingest beyond just text, to include multimodal data like images/videos and audio?

I’m currently experimenting with the live audio beta package(using Gemini 2.5 Flash Live as foundation model) that processes live audio/video stream, and feel that the Knowledge Bank also ought to be able to accept visual and audio data. Besides, much of the info in the real world is simply too complex/subtle to be meaningfully reduced to text.

So any thoughts on this?

Cheers,

Melvin Eng.

Melvin_Eng · March 5, 2026, 3:42am

Hi again,

Would be great if Convai could share some comments on this?

Additionally, I read somewhere in the forum that PDF uploads are accepted, which begs the question as to whether Convai will actually understand the images contained in them?

Cheers,

Melvin Eng.

K3 · March 11, 2026, 11:29am

At the moment, we do not have plans for that.

Melvin_Eng · March 11, 2026, 1:06pm

Hi Kaan,

Thanks for sharing

What about PDF uploads? I understand that PDF uploads to the Knowledge Bank is available for enterprise users, and also read somewhere in the forum that this feature will be rolled out soon to other tier users(though it’s still unavailable now).

So I’m most curious as to whether images in the PDFs will be processed within the same context as the text that references them? (thus forming a coherent multimodal image-textual memory where there is correlation between the 2)

Regarding the rollout of PDF support for Knowledge Bank to other tier users, may I know if this will be coming soon?

Cheers!

Melvin Eng.

Topic		Replies	Views
Problem with accessing knowledge from uploaded text files Knowledge Bank knowledge-bank	20	227	August 19, 2025
ConvAI to access Internet Knowledge Bank knowledge-bank	8	74	November 25, 2024
Upload PDF Files Knowledge Bank knowledge-bank	2	57	July 11, 2025
Knowledge Bank - does not answer what it should Questions unreal-engine	14	52	November 25, 2024
Knowledge bank issues Questions unity	8	58	November 25, 2024

Multimodal data for Knowledge Bank beyond just text?

Related topics