## Performance Optimizations (3-10x faster responses) - STT beam_size reduced to 1 (3-5x faster transcription, minimal quality loss) - Smart query routing: Haiku (simple) → Sonnet (medium) → Opus (complex) - TTS cache for common phrases (27 pre-generated responses) - Sentence-level streaming TTS (start playing while generating) - Sample-based VAD timing (30x improvement in silence detection) ## TTS Engine Upgrade - Migrated from Chatterbox to Chatterbox-Turbo - Zero-shot voice cloning (no fine-tuning required) - Native paralinguistic tag support ([laugh], [sigh], [chuckle], etc.) - Emotion presets with temperature control - Improved marker conversion (*action*, (action), ~action~) ## Discord Bot Enhancements - Multi-agent support (Jarvis, Sage) - Improved voice receiving with discord-ext-voice-recv - Enhanced /join, /leave, /status commands - Per-agent personality configuration - Better audio sink/receiver implementation ## OpenClaw Integration - WebSocket support for Gateway communication - Query complexity routing (auto-select model) - Improved error handling and retries - Session management per Discord guild - Better latency tracking ## Pipeline Improvements - Sentence splitter for streaming optimization - Query router for intelligent model selection - Enhanced VAD receiver with sample-based timing - Improved audio buffering and format conversion - Better transcript management ## Documentation - Added QUICK_START.md (5-minute test guide) - Added OPTIMIZATION_SUMMARY.md (performance analysis) - Added DISCORD_OPTIMIZATION_TEST.md (testing guide) - Added USAGE_GUIDE.md (comprehensive usage) - Updated README.md with optimization details ## Utilities & Scripts - Added get_invite_link.py (Discord bot invite) - Added sync_commands.py, sync_to_guild.py (command sync) - Added test_gateway.py, test_stt.py (testing utilities) - Added openclaw_wrapper.py (wrapper script) - Removed create_mock_turn_model.py (no longer needed) ## Configuration Updates - STT model: medium → small (faster, acceptable quality) - TTS engine: chatterbox → coqui (Turbo integration) - Beam size: 5 → 1 (latency optimization) - Added emotion_exaggeration per agent - Updated .gitignore for project files Total: ~2105 insertions, ~462 deletions across 35 files Performance: ~5.5s total latency (down from 22-35s) Target: ~3.5s (achieved in simple queries with cache) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
62 lines
1.5 KiB
Markdown
62 lines
1.5 KiB
Markdown
# GitHub Repository Setup
|
|
|
|
## Quick Setup
|
|
|
|
1. **Create GitHub Repository**
|
|
- Go to https://github.com/new
|
|
- Repository name: `jarvis-voice-bot`
|
|
- Description: `AI-powered voice assistant for Discord with natural conversation`
|
|
- Visibility: **Public**
|
|
- **DO NOT** initialize with README, .gitignore, or license (we already have these)
|
|
- Click "Create repository"
|
|
|
|
2. **Push Code to GitHub**
|
|
|
|
```bash
|
|
cd "C:\Users\kruz7\OneDrive\Documents\Code Repos\MCKRUZ\openclaw-voice"
|
|
|
|
# Add GitHub remote (replace YOUR_USERNAME with your GitHub username)
|
|
git remote add origin https://github.com/YOUR_USERNAME/jarvis-voice-bot.git
|
|
|
|
# Push code
|
|
git branch -M main
|
|
git push -u origin main
|
|
```
|
|
|
|
3. **Verify**
|
|
- Refresh your GitHub repository page
|
|
- You should see all 54 files
|
|
- README.md should display automatically
|
|
|
|
## Repository Configuration
|
|
|
|
After pushing, configure:
|
|
|
|
**Topics/Tags** (for discoverability):
|
|
- `discord-bot`
|
|
- `voice-assistant`
|
|
- `ai`
|
|
- `speech-recognition`
|
|
- `text-to-speech`
|
|
- `python`
|
|
- `discord-py`
|
|
|
|
**About Section:**
|
|
```
|
|
AI-powered voice assistant for Discord with natural conversation, Smart Turn detection,
|
|
and OpenAI-compatible API. Features GPU-accelerated STT/TTS, intelligent relevance
|
|
filtering, and OpenClaw integration.
|
|
```
|
|
|
|
**Website:** (optional)
|
|
- Your documentation or demo site
|
|
|
|
## Done!
|
|
|
|
Your repository is now public at:
|
|
`https://github.com/YOUR_USERNAME/jarvis-voice-bot`
|
|
|
|
Clone command for others:
|
|
```bash
|
|
git clone https://github.com/YOUR_USERNAME/jarvis-voice-bot.git
|
|
```
|