loading…
Search for a command to run...
loading…
The Gemini Audio MCP server brings enterprise-grade generative audio directly to your AI assistant. Built in high-performance Rust, it leverages Google's state-
The Gemini Audio MCP server brings enterprise-grade generative audio directly to your AI assistant. Built in high-performance Rust, it leverages Google's state-of-the-art models to provide a unified bridge for environmental sound design, expressive narration, and professional music production. ✨ Key Capabilities * 🎙️ Infinite Soundscapes: Generate complex, immersive environmental audio using the Gemini 2.0 Multimodal Live API. * 🎵 Music & SFX: Create high-fidelity rhythmic loops, full songs, and discrete foley cues via Google's Lyria 3 Pro and Clip models. * 🗣️ Expressive Voice: Convert text to speech with natural voice direction and emotional nuances. * 🎲 Seamless Looping: Features a proprietary 100ms micro-crossfade algorithm to ensure click-free, non-repeating background audio. * 🎭 Cinematic Transitions: Smoothly blend and crossfade between two distinct audio prompts for dynamic environment changes. * 🎛️ Universal Encoding: Direct Stdin-to-FFmpeg piping allows for zero-latency transcoding into 10+ formats (MP3, OGG, FLAC, OPUS, WAV, etc.). 🎮 Use Cases * Game Developers (UE5, Godot, Blender): Instantly generate procedural soundscapes and NPC dialogue lines during development. * Content Creators: Automate foley and background texture generation for video projects. * Productivity: Enhance your AI workspace with high-quality narration and focus-oriented ambient audio. --- 🛠️ Requirements * FFmpeg: Must be installed on the system path for audio transcoding. * API Key: A valid Google AI Studio (Gemini) API Key.
Run in your terminal:
claude mcp add gemini-audio-mcp -- npx -y @smithery/cli run jxoesneon/gemini-audio-mcpYes, Gemini Audio MCP MCP is free — one-click install via Unyly at no cost.
No, Gemini Audio MCP runs without API keys or environment variables.
Self-hosted: the server runs locally on your machine via the install command above.
Open Gemini Audio MCP on unyly.org, pick your client tab (Claude Desktop, Claude Code, Cursor) and press Install — the config is generated automatically, no JSON editing.
Extract design specs and assets
by FigmaEnables AI agents to read, write, and edit Office documents via LibreOffice with token-efficient design. Supports multiple formats including DOCX, XLSX, PPTX, a
by passerbyflutterSearch and retrieve company logos by brand or domain. Customize size, format, and theme to match your design needs. Accelerate design, prototyping, and content
by NOVA-3951Enables GUI automation for controlling PIX4Dmatic on Windows through MCP. Supports launching, focusing, capturing screenshots, sending hotkeys, clicking UI elem
by jangjo123Not sure what to pick?
Find your stack in 60 seconds
Author?
Embed badge for your README
Browse similar
All design MCPs