loading…
Search for a command to run...
loading…
Audio AI API for stem separation (vocal, drum, bass, guitar, piano), DME separation (dialogue, music, effects), and AI lyrics sync. 7 tools, 11 models, supports
Audio AI API for stem separation (vocal, drum, bass, guitar, piano), DME separation (dialogue, music, effects), and AI lyrics sync. 7 tools, 11 models, supports WAV/FLAC/MP3/M4A/MOV/MP4.
npm version Glama score License: MIT
MCP server for Gaudio Lab Audio AI API. Separate vocals, instruments, dialogue, music, effects from any audio/video — or sync lyrics to timestamps — all through natural language in your AI tools.
Works with Claude, ChatGPT, Cursor, VS Code, GitHub Copilot, and any MCP-compatible client.
Add to your MCP client config:
{
"mcpServers": {
"gaudio": {
"command": "npx",
"args": ["-y", "@gaudiolab/mcp-developers"],
"env": {
"GAUDIO_API_KEY": "your-api-key-here"
}
}
}
}
Then just ask in natural language:
| Tool | Description |
|---|---|
gaudio_get_key_info |
Get API key info: credits, project, permitted models |
gaudio_list_models |
List available AI models by category |
gaudio_upload_file |
Upload audio/video/text file (multipart, auto-chunked) |
gaudio_create_job |
Create a processing job |
gaudio_get_job |
Check job status and get download URLs |
gaudio_separate_audio |
All-in-one: upload → process → download URLs |
gaudio_sync_lyrics |
All-in-one lyrics sync with timestamps |
| Model | Description | Type Options |
|---|---|---|
gsep_music_hq_v1 |
Multi-instrument separation | vocal, drum, bass, electric_guitar, acoustic_piano |
gsep_music_shq_v1 |
Super HQ vocal + accompaniment | vocal |
gsep_speech_hq_v1 |
Speech / noise removal | speech |
Max: 1GB / 20 min per file. Types can be combined (e.g. vocal,drum).
| Model | Description |
|---|---|
gsep_dme_dtrack_v1 |
Dialogue extraction |
gsep_dme_d2track_v1 |
Dialogue + vocals |
gsep_dme_metrack_v1 |
Music + effects |
gsep_dme_me2track_v1 |
Music + effects v1 |
gsep_dme_me2track_v2 |
Music + effects v2 (high quality) |
gsep_dme_mtrack_v1 |
Music only |
gsep_dme_etrack_v1 |
Effects only |
Max: 10GB / 200 min per file.
| Model | Description | Languages |
|---|---|---|
gts_lyrics_line_v1 |
Lyrics line sync | en, ko, ja, zh-cn |
Max: 1GB / 10 min. Text: .txt (UTF-8), min 2 lines, max 60 chars/line.
Output: CSV (timestamp, lyric_text, confidence_score) + JSON report.
| Type | Formats |
|---|---|
| Audio | WAV, FLAC, MP3, M4A |
| Video | MOV, MP4 (audio auto-extracted) |
| Text | TXT (UTF-8) |
Output: MP3 (48kHz/320kbps) + WAV (same as input). Download URLs valid for 48 hours.
Upload file → Create job → Poll status → Get download URLs
The high-level tools (gaudio_separate_audio, gaudio_sync_lyrics) handle this entire flow automatically. Upload IDs are valid for 72 hours and can be reused across multiple jobs.
MIT
Run in your terminal:
claude mcp add gaudiolab-jp-gaudio-developers-mcp -- npx Yes, gaudiolab-jp/gaudio-developers-mcp MCP is free — one-click install via Unyly at no cost.
No, gaudiolab-jp/gaudio-developers-mcp runs without API keys or environment variables.
Self-hosted: the server runs locally on your machine via the install command above.
Open gaudiolab-jp/gaudio-developers-mcp on unyly.org, pick your client tab (Claude Desktop, Claude Code, Cursor) and press Install — the config is generated automatically, no JSON editing.
Transcripts, channel stats, search
by YouTubeAI image generation using various models.
by modelcontextprotocolUnified GPU inference API with 30 AI services (LLM, image gen, video, TTS, whisper, embeddings, reranking, OCR) as MCP tools. Pay-per-use via x402 USDC or API k
by gpu-bridgeA powerful image generation tool using Google's Imagen 3.0 API through MCP. Generate high-quality images from text prompts with advanced photography, artistic,
by hamflxNot sure what to pick?
Find your stack in 60 seconds
Author?
Embed badge for your README
Browse similar
All media MCPs