02 What Your Agent Can Do_
An agent comes with built-in capabilities. Some work immediately. Others require setup. Here's what you have and what you need.
Built-In Skills (No Setup Required)
These work right now, in your agent's foundational layer:
| Capability | What It Does |
|---|---|
| File Operations | Read, write, edit, and organize files in your workspace |
| Shell Commands | Run commands in your terminal (in your container within HeyRon, not on your physical machine) |
| Web Search | Search the web using Brave API — instant lookups |
| Web Fetch | Pull readable content from URLs — paste a link, ask about it |
| Image Analysis | Analyze images you upload — describe, extract, answer questions |
| PDF Analysis | Read and extract information from PDF files |
Tools You Add (Optional But Powerful)
Beyond the built-ins, you can connect optional tools. These expand what your agent can do:
Communication
- AgentMail — Send and receive emails
- Discord/Slack — Chat in your communities
- Telegram — Mobile messaging integration
Storage & Version Control
- GitHub — Push, pull, and manage repositories
- here.now — Instant web publishing
Media & Transcription
- Whisper — Voice-to-text transcription
- Text-to-Speech — Audio output for replies
Advanced
- Cloudflare Workers — Serverless backend for custom workflows
Don't feel pressured to add everything. Start with what you need. You can add more later.
What Your Agent Actually Does
The real skill isn't the list of tools. It's the *approach*:
- Reads your files first — Understands your philosophy and priorities
- Reasons through structure — Navigates your workspace with intention
- Respects boundaries — Takes initiative on internal work, asks before external actions
- Maintains context — Remembers decisions and patterns across sessions
- Uses tools appropriately — Knows when to search, when to fetch, when to execute
- Partners, not serves — Treats collaboration as genuine teamwork
Your agent can't see your screen (unless you send it a screenshot), doesn't know what time it is (you have to tell it otherwise it is in UTC Zulu Time), and won't remember between sessions unless you build that memory (aka memoralize it in a file).
Tools ≠ Skills
A quick distinction that matters:
- Tools are permissions — what your agent *can* do (enabled in config)
- Skills are knowledge — what your agent *knows how* to do (built-in workflows)
Knowing how to use GitHub doesn't mean your agent can — you still need to connect your account with a token. The skill is the knowledge; the tool is the permission.
What's Next?
Now that you know what's possible, it's time to introduce yourself to your agent properly. That's the first conversation — the onboarding that teaches your agent who you are and how you work.