Audio Parsing (Whisper)

Xio includes support for audio transcription through Whisper. This is useful for adding content from voice notes, meetings, or transcribed video/audio files into your workspace.

Whisper can be used with local audio files or automatically triggered when a YouTube link is submitted. The audio is converted to text, cleaned, and then embedded alongside your documents.

Enable Whisper support by configuring the following in .env:

envCopyEditENABLE_WHISPER_TRANSCRIPTION=true

Once enabled, dropping in an audio file or YouTube URL will automatically generate a transcript ready for chat.

Xio handles the transcription in the background and stores it like any other text source, ready for question-answering and summarization.

Last updated