gaudiolab-jp/gaudio-developers-mcp

FreeNot checked

Audio AI API for stem separation (vocal, drum, bass, guitar, piano), DME separation (dialogue, music, effects), and AI lyrics sync. 7 tools, 11 models, supports

by gaudiolab-jp

GitHub Embed

About

Audio AI API for stem separation (vocal, drum, bass, guitar, piano), DME separation (dialogue, music, effects), and AI lyrics sync. 7 tools, 11 models, supports WAV/FLAC/MP3/M4A/MOV/MP4.

README

npm version Glama score License: MIT

MCP server for Gaudio Lab Audio AI API. Separate vocals, instruments, dialogue, music, effects from any audio/video — or sync lyrics to timestamps — all through natural language in your AI tools.

Works with Claude, ChatGPT, Cursor, VS Code, GitHub Copilot, and any MCP-compatible client.

Get Your API Key

Sign up at Gaudio Developers
Create a project and get your API key from the dashboard

Quick Start

Add to your MCP client config:

{
  "mcpServers": {
    "gaudio": {
      "command": "npx",
      "args": ["-y", "@gaudiolab/mcp-developers"],
      "env": {
        "GAUDIO_API_KEY": "your-api-key-here"
      }
    }
  }
}

Then just ask in natural language:

"Separate the vocals from this file"
"Extract the dialogue from this video"
"Sync these lyrics to this song"
"What models are available?"
"How many credits do I have left?"

Tools

Tool	Description
`gaudio_get_key_info`	Get API key info: credits, project, permitted models
`gaudio_list_models`	List available AI models by category
`gaudio_upload_file`	Upload audio/video/text file (multipart, auto-chunked)
`gaudio_create_job`	Create a processing job
`gaudio_get_job`	Check job status and get download URLs
`gaudio_separate_audio`	All-in-one: upload → process → download URLs
`gaudio_sync_lyrics`	All-in-one lyrics sync with timestamps

Models

Stem Separation

Model	Description	Type Options
`gsep_music_hq_v1`	Multi-instrument separation	vocal, drum, bass, electric_guitar, acoustic_piano
`gsep_music_shq_v1`	Super HQ vocal + accompaniment	vocal
`gsep_speech_hq_v1`	Speech / noise removal	speech

Max: 1GB / 20 min per file. Types can be combined (e.g. vocal,drum).

DME Separation (Dialogue, Music, Effects)

Model	Description
`gsep_dme_dtrack_v1`	Dialogue extraction
`gsep_dme_d2track_v1`	Dialogue + vocals
`gsep_dme_metrack_v1`	Music + effects
`gsep_dme_me2track_v1`	Music + effects v1
`gsep_dme_me2track_v2`	Music + effects v2 (high quality)
`gsep_dme_mtrack_v1`	Music only
`gsep_dme_etrack_v1`	Effects only

Max: 10GB / 200 min per file.

AI Text Sync

Model	Description	Languages
`gts_lyrics_line_v1`	Lyrics line sync	en, ko, ja, zh-cn

Max: 1GB / 10 min. Text: .txt (UTF-8), min 2 lines, max 60 chars/line.

Output: CSV (timestamp, lyric_text, confidence_score) + JSON report.

Supported Formats

Type	Formats
Audio	WAV, FLAC, MP3, M4A
Video	MOV, MP4 (audio auto-extracted)
Text	TXT (UTF-8)

Output: MP3 (48kHz/320kbps) + WAV (same as input). Download URLs valid for 48 hours.

How It Works

Upload file → Create job → Poll status → Get download URLs

The high-level tools (gaudio_separate_audio, gaudio_sync_lyrics) handle this entire flow automatically. Upload IDs are valid for 72 hours and can be reused across multiple jobs.

License

MIT

from github.com/gaudiolab-jp/gaudio-developers-mcp

Install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code & Cursor

Run in your terminal:

claude mcp add gaudiolab-jp-gaudio-developers-mcp -- npx

FAQ

Is gaudiolab-jp/gaudio-developers-mcp MCP free?

Yes, gaudiolab-jp/gaudio-developers-mcp MCP is free — one-click install via Unyly at no cost.

Does gaudiolab-jp/gaudio-developers-mcp need an API key?

No, gaudiolab-jp/gaudio-developers-mcp runs without API keys or environment variables.

Is gaudiolab-jp/gaudio-developers-mcp hosted or self-hosted?

Self-hosted: the server runs locally on your machine via the install command above.

How do I install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code or Cursor?

Open gaudiolab-jp/gaudio-developers-mcp on unyly.org, pick your client tab (Claude Desktop, Claude Code, Cursor) and press Install — the config is generated automatically, no JSON editing.

Related MCPs

YouTube

Transcripts, channel stats, search

by YouTube

4.33.4K

EverArt

AI image generation using various models.

by modelcontextprotocol

gpu-bridge/mcp-server

Unified GPU inference API with 30 AI services (LLM, image gen, video, TTS, whisper, embeddings, reranking, OCR) as MCP tools. Pay-per-use via x402 USDC or API k

by gpu-bridge

hamflx/imagen3-mcp

A powerful image generation tool using Google's Imagen 3.0 API through MCP. Generate high-quality images from text prompts with advanced photography, artistic,

by hamflx

Compare gaudiolab-jp/gaudio-developers-mcp with

gaudiolab-jp/gaudio-developers-mcpvsYouTube gaudiolab-jp/gaudio-developers-mcpvsEverArt gaudiolab-jp/gaudio-developers-mcpvsgpu-bridge/mcp-server gaudiolab-jp/gaudio-developers-mcpvshamflx/imagen3-mcp

Not sure what to pick?

Find your stack in 60 seconds

Author?

Embed badge for your README

Browse similar

All media MCPs

loading…

Browse all

gaudiolab-jp/gaudio-developers-mcp

FreeNot checked

Audio AI API for stem separation (vocal, drum, bass, guitar, piano), DME separation (dialogue, music, effects), and AI lyrics sync. 7 tools, 11 models, supports

by gaudiolab-jp

GitHub Embed

About

Audio AI API for stem separation (vocal, drum, bass, guitar, piano), DME separation (dialogue, music, effects), and AI lyrics sync. 7 tools, 11 models, supports WAV/FLAC/MP3/M4A/MOV/MP4.

README

npm version Glama score License: MIT

MCP server for Gaudio Lab Audio AI API. Separate vocals, instruments, dialogue, music, effects from any audio/video — or sync lyrics to timestamps — all through natural language in your AI tools.

Works with Claude, ChatGPT, Cursor, VS Code, GitHub Copilot, and any MCP-compatible client.

Get Your API Key

Sign up at Gaudio Developers
Create a project and get your API key from the dashboard

Quick Start

Add to your MCP client config:

{
  "mcpServers": {
    "gaudio": {
      "command": "npx",
      "args": ["-y", "@gaudiolab/mcp-developers"],
      "env": {
        "GAUDIO_API_KEY": "your-api-key-here"
      }
    }
  }
}

Then just ask in natural language:

"Separate the vocals from this file"
"Extract the dialogue from this video"
"Sync these lyrics to this song"
"What models are available?"
"How many credits do I have left?"

Tools

Tool	Description
`gaudio_get_key_info`	Get API key info: credits, project, permitted models
`gaudio_list_models`	List available AI models by category
`gaudio_upload_file`	Upload audio/video/text file (multipart, auto-chunked)
`gaudio_create_job`	Create a processing job
`gaudio_get_job`	Check job status and get download URLs
`gaudio_separate_audio`	All-in-one: upload → process → download URLs
`gaudio_sync_lyrics`	All-in-one lyrics sync with timestamps

Models

Stem Separation

Model	Description	Type Options
`gsep_music_hq_v1`	Multi-instrument separation	vocal, drum, bass, electric_guitar, acoustic_piano
`gsep_music_shq_v1`	Super HQ vocal + accompaniment	vocal
`gsep_speech_hq_v1`	Speech / noise removal	speech

Max: 1GB / 20 min per file. Types can be combined (e.g. vocal,drum).

DME Separation (Dialogue, Music, Effects)

Model	Description
`gsep_dme_dtrack_v1`	Dialogue extraction
`gsep_dme_d2track_v1`	Dialogue + vocals
`gsep_dme_metrack_v1`	Music + effects
`gsep_dme_me2track_v1`	Music + effects v1
`gsep_dme_me2track_v2`	Music + effects v2 (high quality)
`gsep_dme_mtrack_v1`	Music only
`gsep_dme_etrack_v1`	Effects only

Max: 10GB / 200 min per file.

AI Text Sync

Model	Description	Languages
`gts_lyrics_line_v1`	Lyrics line sync	en, ko, ja, zh-cn

Max: 1GB / 10 min. Text: .txt (UTF-8), min 2 lines, max 60 chars/line.

Output: CSV (timestamp, lyric_text, confidence_score) + JSON report.

Supported Formats

Type	Formats
Audio	WAV, FLAC, MP3, M4A
Video	MOV, MP4 (audio auto-extracted)
Text	TXT (UTF-8)

Output: MP3 (48kHz/320kbps) + WAV (same as input). Download URLs valid for 48 hours.

How It Works

Upload file → Create job → Poll status → Get download URLs

The high-level tools (gaudio_separate_audio, gaudio_sync_lyrics) handle this entire flow automatically. Upload IDs are valid for 72 hours and can be reused across multiple jobs.

License

MIT

from github.com/gaudiolab-jp/gaudio-developers-mcp

Install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code & Cursor

Run in your terminal:

claude mcp add gaudiolab-jp-gaudio-developers-mcp -- npx

FAQ

Is gaudiolab-jp/gaudio-developers-mcp MCP free?

Yes, gaudiolab-jp/gaudio-developers-mcp MCP is free — one-click install via Unyly at no cost.

Does gaudiolab-jp/gaudio-developers-mcp need an API key?

No, gaudiolab-jp/gaudio-developers-mcp runs without API keys or environment variables.

Is gaudiolab-jp/gaudio-developers-mcp hosted or self-hosted?

Self-hosted: the server runs locally on your machine via the install command above.

How do I install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code or Cursor?

Open gaudiolab-jp/gaudio-developers-mcp on unyly.org, pick your client tab (Claude Desktop, Claude Code, Cursor) and press Install — the config is generated automatically, no JSON editing.

Related MCPs

YouTube

Transcripts, channel stats, search

by YouTube

4.33.4K

EverArt

AI image generation using various models.

by modelcontextprotocol

gpu-bridge/mcp-server

Unified GPU inference API with 30 AI services (LLM, image gen, video, TTS, whisper, embeddings, reranking, OCR) as MCP tools. Pay-per-use via x402 USDC or API k

by gpu-bridge

hamflx/imagen3-mcp

A powerful image generation tool using Google's Imagen 3.0 API through MCP. Generate high-quality images from text prompts with advanced photography, artistic,

by hamflx

Compare gaudiolab-jp/gaudio-developers-mcp with

Not sure what to pick?

Find your stack in 60 seconds

Author?

Embed badge for your README

Browse similar

All media MCPs

Command Palette

gaudiolab-jp/gaudio-developers-mcp

About

README

Get Your API Key

Quick Start

Tools

Models

Stem Separation

DME Separation (Dialogue, Music, Effects)

AI Text Sync

Supported Formats

How It Works

Links

License

Install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code & Cursor

FAQ

Is gaudiolab-jp/gaudio-developers-mcp MCP free?

Does gaudiolab-jp/gaudio-developers-mcp need an API key?

Is gaudiolab-jp/gaudio-developers-mcp hosted or self-hosted?

How do I install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code or Cursor?

Related MCPs

YouTube

EverArt

gpu-bridge/mcp-server

hamflx/imagen3-mcp

Compare gaudiolab-jp/gaudio-developers-mcp with

gaudiolab-jp/gaudio-developers-mcp

About

README

Get Your API Key

Quick Start

Tools

Models

Stem Separation

DME Separation (Dialogue, Music, Effects)

AI Text Sync

Supported Formats

How It Works

Links

License

Install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code & Cursor

FAQ

Is gaudiolab-jp/gaudio-developers-mcp MCP free?

Does gaudiolab-jp/gaudio-developers-mcp need an API key?

Is gaudiolab-jp/gaudio-developers-mcp hosted or self-hosted?

How do I install gaudiolab-jp/gaudio-developers-mcp in Claude Desktop, Claude Code or Cursor?

Related MCPs

YouTube

EverArt

gpu-bridge/mcp-server

hamflx/imagen3-mcp

Compare gaudiolab-jp/gaudio-developers-mcp with