docs: Add PyTorch RTX 5090 monitoring guide
- Weekly check instructions for sm_120 support - Automated monitoring options (GitHub Watch, RSS, calendar) - Step-by-step GPU setup when support lands - Historical timeline estimates (2-3 months typical) - Optimistic timeline: March 2025 - While-waiting optimization suggestions Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This commit is contained in:
parent
2f17d4847d
commit
74167edc0d
1 changed files with 220 additions and 0 deletions
220
PYTORCH_MONITORING.md
Normal file
220
PYTORCH_MONITORING.md
Normal file
|
|
@ -0,0 +1,220 @@
|
|||
# PyTorch RTX 5090 Support Monitoring Guide
|
||||
|
||||
**Goal:** Get notified when PyTorch adds Blackwell (sm_120) support
|
||||
|
||||
---
|
||||
|
||||
## Quick Check (Weekly)
|
||||
|
||||
### Test PyTorch Nightly
|
||||
|
||||
```bash
|
||||
# From project root
|
||||
cd "C:\Users\kruz7\OneDrive\Documents\Code Repos\MCKRUZ\openclaw-voice"
|
||||
source venv/Scripts/activate
|
||||
|
||||
# Test nightly build
|
||||
pip install --upgrade --pre torch --index-url https://download.pytorch.org/whl/nightly/cu124
|
||||
|
||||
# Quick test
|
||||
python -c "import torch; x=torch.rand(10,10,device='cuda'); print('✓ RTX 5090 WORKS!' if x.device.type=='cuda' else '✗ Not yet')"
|
||||
```
|
||||
|
||||
If you see **"✓ RTX 5090 WORKS!"** → GPU support is here! Run `fix_pytorch_cuda.bat`
|
||||
|
||||
---
|
||||
|
||||
## Automated Monitoring
|
||||
|
||||
### Option 1: GitHub Watch (Recommended)
|
||||
|
||||
1. Go to: https://github.com/pytorch/pytorch
|
||||
2. Click **"Watch"** → **"Custom"**
|
||||
3. Check **"Releases"** only
|
||||
4. Get email when new PyTorch releases
|
||||
|
||||
### Option 2: RSS Feed
|
||||
|
||||
Subscribe to PyTorch releases:
|
||||
```
|
||||
https://github.com/pytorch/pytorch/releases.atom
|
||||
```
|
||||
|
||||
Use RSS reader (Feedly, Inoreader) or browser extension
|
||||
|
||||
### Option 3: Weekly Calendar Reminder
|
||||
|
||||
Set recurring calendar event:
|
||||
- **When:** Every Monday 9am
|
||||
- **What:** Check PyTorch RTX 5090 support
|
||||
- **How:** Run quick test above
|
||||
|
||||
---
|
||||
|
||||
## What to Look For
|
||||
|
||||
### In Release Notes
|
||||
|
||||
Keywords indicating sm_120 support:
|
||||
- ✅ "Blackwell" or "sm_120"
|
||||
- ✅ "RTX 5090" or "50-series"
|
||||
- ✅ "CUDA capability 12.0"
|
||||
- ✅ "Hopper+Blackwell" or "H100+B100"
|
||||
|
||||
### Example Release Note:
|
||||
```
|
||||
PyTorch 2.X.0 Release Notes
|
||||
- Added support for NVIDIA Blackwell architecture (sm_120)
|
||||
- RTX 50-series GPUs now fully supported
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## When Support Lands
|
||||
|
||||
### 1. Update PyTorch
|
||||
|
||||
```bash
|
||||
cd "C:\Users\kruz7\OneDrive\Documents\Code Repos\MCKRUZ\openclaw-voice"
|
||||
|
||||
# Run the fix script
|
||||
fix_pytorch_cuda.bat
|
||||
|
||||
# Or manually:
|
||||
source venv/Scripts/activate
|
||||
pip uninstall torch torchaudio torchvision -y
|
||||
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124
|
||||
```
|
||||
|
||||
### 2. Verify GPU Works
|
||||
|
||||
```bash
|
||||
python -c "import torch; print(torch.cuda.get_device_name(0)); x=torch.rand(100,100,device='cuda'); print('GPU OK!')"
|
||||
```
|
||||
|
||||
Expected output:
|
||||
```
|
||||
NVIDIA GeForce RTX 5090
|
||||
GPU OK!
|
||||
```
|
||||
|
||||
### 3. Update Config
|
||||
|
||||
Edit `config.yaml`:
|
||||
```yaml
|
||||
pipeline:
|
||||
stt:
|
||||
device: "cuda" # Was: cpu
|
||||
model_size: "medium" # Can increase from small
|
||||
beam_size: 5 # Can increase from 1
|
||||
|
||||
tts:
|
||||
device: "cuda" # Was: cpu
|
||||
```
|
||||
|
||||
### 4. Test Performance
|
||||
|
||||
```bash
|
||||
# Start the bot
|
||||
python run.py
|
||||
|
||||
# In Discord
|
||||
/join
|
||||
# Say: "Hey Jarvis, test response time"
|
||||
/status # Check latency stats
|
||||
|
||||
# Expected improvement:
|
||||
# - STT: ~2s → ~0.35s (6x faster)
|
||||
# - TTS: ~4s → ~0.9s (4x faster)
|
||||
# - Total: ~10s → ~4s (near 3.5s target!)
|
||||
```
|
||||
|
||||
### 5. Re-test Kani-TTS-2
|
||||
|
||||
```bash
|
||||
python test_kani_tts.py
|
||||
|
||||
# If successful:
|
||||
# - Compare quality with current Coqui XTTS v2
|
||||
# - Check if RTF ~0.2 achieved
|
||||
# - Decide if worth integrating
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Estimated Timeline
|
||||
|
||||
Based on historical GPU support addition:
|
||||
|
||||
| GPU Architecture | Release Date | PyTorch Support Added | Time Gap |
|
||||
|------------------|--------------|----------------------|----------|
|
||||
| Ampere (RTX 30) | Sep 2020 | Nov 2020 | 2 months |
|
||||
| Ada Lovelace (RTX 40) | Oct 2022 | Dec 2022 | 2 months |
|
||||
| Hopper (H100) | Mar 2023 | May 2023 | 2 months |
|
||||
| **Blackwell (RTX 50)** | **Jan 2025** | **Est: Mar-Apr 2025** | **2-3 months** |
|
||||
|
||||
**Conservative estimate:** March 2025 (1 month from now)
|
||||
**Optimistic estimate:** Late February 2025 (2 weeks)
|
||||
**Pessimistic estimate:** May 2025 (3 months)
|
||||
|
||||
---
|
||||
|
||||
## While You Wait
|
||||
|
||||
### Optimize Non-GPU Components
|
||||
|
||||
Focus on improvements that work on CPU:
|
||||
|
||||
1. **Query Routing** (already implemented)
|
||||
- Haiku for simple queries
|
||||
- Sonnet for medium
|
||||
- Opus for complex
|
||||
|
||||
2. **TTS Caching** (already implemented)
|
||||
- Pre-generate common phrases
|
||||
- Cache by hash
|
||||
|
||||
3. **Response Filtering**
|
||||
- Improve relevance detection
|
||||
- Reduce unnecessary responses
|
||||
|
||||
4. **Streaming Optimization**
|
||||
- Sentence-level playback
|
||||
- Parallel processing where possible
|
||||
|
||||
### Test Bot Logic
|
||||
|
||||
Even with slow performance, you can:
|
||||
- Test conversation flow
|
||||
- Debug agent personalities
|
||||
- Refine prompt engineering
|
||||
- Test Discord commands
|
||||
- Verify OpenClaw integration
|
||||
|
||||
### Prepare for GPU
|
||||
|
||||
- Read KANI_TTS_EVALUATION.md
|
||||
- Plan integration strategy
|
||||
- Review current TTS implementation
|
||||
- Identify optimization opportunities
|
||||
|
||||
---
|
||||
|
||||
## Contact/Support
|
||||
|
||||
**PyTorch Issues:** https://github.com/pytorch/pytorch/issues
|
||||
**PyTorch Forums:** https://discuss.pytorch.org/
|
||||
**NVIDIA Developer:** https://forums.developer.nvidia.com/
|
||||
|
||||
**Search for:** "RTX 5090 support" or "sm_120" or "Blackwell"
|
||||
|
||||
---
|
||||
|
||||
## Summary
|
||||
|
||||
✅ **Weekly:** Run quick test or check GitHub releases
|
||||
✅ **When ready:** Run `fix_pytorch_cuda.bat`
|
||||
✅ **Then:** Update config, test performance, evaluate Kani-TTS-2
|
||||
✅ **Expected:** March 2025 (1-2 months)
|
||||
|
||||
**Bookmark this file** and check weekly until GPU support lands!
|
||||
Loading…
Add table
Add a link
Reference in a new issue