{"type":"mcp_client","name":"multimodal-mcp-client","description":"This repository offers a voice-controlled AI interface using Google Gemini and Anthropic MCP, enabling natural speech and multimodal inputs for interacting with AI systems. It supports both custom and Systemprompt MCP servers.","category":"AI","language":"TypeScript","stars":211,"forks":21,"owner":"Ejb503","github_url":"https://github.com/Ejb503/multimodal-mcp-client","homepage":"https://systemprompt.io","setup":"## Setup\n\n1. **Clone the repository**\n\n   ```bash\n   git clone https://github.com/Ejb503/multimodal-mcp-client.git\n   cd multimodal-mcp-client\n   ```\n\n2. **Install dependencies**\n\n   ```bash\n   npm install\n   cd proxy\n   npm install\n   ```\n\n3. **Configure the application**\n\n   ```bash\n   # Navigate to config directory\n   cd config\n\n   # Create local configuration files\n   cp mcp.config.example.json mcp.config.custom.json\n   ```\n\n   Required API Keys:\n\n   - [Google AI Studio](https://ai.google.dev/gemini-api/docs) - Gemini API key\n   - [systemprompt.io/console](https://systemprompt.io/console) - Systemprompt API key\n\n   Add keys to `.env` (see `.env.example` for reference). note that the `VITE_` prefix is required to share the keys with the MCP server and client.\n\n4. **Start development server**\n   ```bash\n   npm run dev\n   ```\n   Access the development server at `http://localhost:5173`","tools":"## Available Tools\n\n1.  **Multimodal AI** (Understand and process text, voice, and visual inputs naturally)\n2.  **MCP (Model Control Protocol)** (Execute complex AI workflows with a robust tooling system)\n3.  **Voice-First Design** (Control everything through natural speech, making AI interaction more intuitive)\n4.  **Natural Voice Control** (Speak naturally to control AI workflows and execute commands)\n5.  **Multimodal Understanding** (Process text, voice, and visual inputs simultaneously)\n6.  **Real-time Voice Synthesis** (Get instant audio responses from your AI interactions)\n7.  **Extensible Tool System** (Add custom tools and workflows through MCP)\n8.  **Workflow Automation** (Chain multiple AI operations with voice commands)\n9.  **State Management** (Robust handling of complex, multi-step AI interactions)\n10. **Modern Tech Stack** (Built with Vite, React, TypeScript, and NextUI)\n11. **Type Safety** (Full TypeScript support with comprehensive type definitions)\n12. **Hot Module Replacement** (Fast development with instant feedback)\n13. **Comprehensive Testing** (Built-in testing infrastructure with high coverage)","faq":null,"created_at":"2025-01-08T07:50:46+00:00","updated_at":"2025-03-27T16:47:22+00:00","source_url":"https://model-context-protocol.com/clients/voice-controlled-ai-agentic-workflows-client","related_articles":[]}