Audio Parsing (Whisper)
Xio includes support for audio transcription through Whisper. This is useful for adding content from voice notes, meetings, or transcribed video/audio files into your workspace.
Whisper can be used with local audio files or automatically triggered when a YouTube link is submitted. The audio is converted to text, cleaned, and then embedded alongside your documents.
Enable Whisper support by configuring the following in .env
:
envCopyEditENABLE_WHISPER_TRANSCRIPTION=true
Once enabled, dropping in an audio file or YouTube URL will automatically generate a transcript ready for chat.
Xio handles the transcription in the background and stores it like any other text source, ready for question-answering and summarization.
Last updated