Voice bot can now see the users camera or screen share when asked. Captures a single frame, encodes as JPEG, sends to Sonnet vision with full context (transcript + document). Triggered by phrases like schau mal, siehst du das, can you see this. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
14 lines
313 B
Plaintext
14 lines
313 B
Plaintext
livekit-agents>=1.4,<2.0
|
|
livekit-plugins-openai>=1.4,<2.0
|
|
livekit-plugins-elevenlabs>=1.4,<2.0
|
|
livekit-plugins-silero>=1.4,<2.0
|
|
livekit>=1.0,<2.0
|
|
livekit-api>=1.0,<2.0
|
|
matrix-nio[e2e]>=0.25,<1.0
|
|
canonicaljson>=2.0,<3.0
|
|
httpx>=0.27,<1.0
|
|
openai>=2.0,<3.0
|
|
pymupdf>=1.24,<2.0
|
|
python-docx>=1.0,<2.0
|
|
Pillow>=10.0,<12.0
|