HeyRon@partner $ ~tool-setup whisper

Whisper Setup_

Voice-to-text transcription. Send voice memos and let your agent turn them into notes, summaries, or actionable items. Perfect for hands-free input.

Skill Level Beginner

Time 5–10 minutes

Cost Free (with rate limits)

Docs openai.com/research/whisper

What It Does

Whisper enables:

Visit platform.openai.com, sign up, and generate an API key in your account settings.

Store your key in the workspace .env:

OPENAI_API_KEY=sk-your_key_here

Your agent should already have Whisper support. Test it by sending a voice message (if your platform supports audio upload).

Send a short voice memo. Your agent should respond with the transcribed text:

Voice memo: "Remember to review the project report"

Agent responds: "Transcribed: 'Remember to review the project report'"

Successful setup means:

API key invalid: Verify your OpenAI key is active and has sufficient credits.
Audio not uploading: Check your messaging app supports audio and file sizes are under limits (typically 25MB).
Poor transcription quality: Speak more clearly or use shorter phrases.

💡 Tip: Use Whisper for quick capture—ideas while driving, reminders while cooking, notes during meetings. Your agent transcribes and archives them.