Skip to Content
MCP ServersSpeech MCP Server

Speech MCP Server

Local audio transcription via OpenAI Whisper, exposed as an MCP tool. Supports mp3, mp4, m4a, wav, ogg, flac, and webm formats.

Install

ark install marketplace/mcps/speech-mcp-server

Or with Helm:

cd mcps/speech-mcp-server helm install speech-mcp-server ./chart -n default --create-namespace

Tools

ToolDescription
transcribe_audioTranscribe an audio file using Whisper

Configuration

VariableDescriptionDefault
PORTServer port8080
BASE_DATA_DIRBase directory for audio files/data
WHISPER_MODELWhisper model size (tiny, base, small, medium, large)small
CACHE_DIRTranscription cache directory/data/whisper-cache

Usage

Used by the COBOL modernization bundle for transcribing interview recordings.

Additional Resources

Last updated on