100x more powerful than OpenHands, Claude Code, and Hermes
Features · Quick Start · Dashboard · Docs · Configuration · Contributing
- Overview
- Features
- Quick Start
- Configuration
- Dashboard
- Architecture
- Stats
- Comparison
- Contributing
- Links
- License
Z.AGENT is an autonomous desktop agent that controls your computer when you're away. Send tasks in natural language via Telegram (text or voice), the agent plans, executes, and notifies you — using any of 10 LLM providers with automatic fallback.
Use cases: file organization, email management, meeting prep, browser automation, system maintenance, scheduled tasks, document research, voice control — all from your phone.
- ReAct Loop — Reason → Act → Observe → Critique. Adapts to failures, replans mid-task
- Multi-Agent Orchestrator — Spawns specialized sub-agents (researcher, coder, file_organizer) running in parallel
- Skill Library — Agent learns reusable skills from successful tasks
- Auto Skill Creator — Automatically detects recurring patterns and creates skills
- Native Tool Calling — Multi-round function calling (GLM, OpenAI, Claude, Mistral)
- Long-term Conversation Context — Persistent multi-task memory with summary compaction
- Vector Memory — Semantic long-term memory with embeddings, cosine similarity
- Conversation Context — Per-session context that compacts old turns
- Skill Library — Saved action sequences reusable across tasks
- Knowledge Base (RAG) — Embed your documents (PDF, DOCX, TXT) for semantic search
- 10 LLM Providers — z.ai, OpenAI, Anthropic, Mistral, NVIDIA, Groq, DeepSeek, Ollama, Together, Fireworks
- Telegram Bot — Text + voice messages, proactive push notifications
- Webhooks — HTTP endpoints for GitHub, Stripe, Slack, IoT
- File Watcher — Trigger tasks on file system events
- MCP — Model Context Protocol client
- Plugin Marketplace — Install third-party plugins
- Cost Tracker — Track every API call's token usage and cost (USD)
- Audit Log — Append-only security trail of every action
- Activity Heatmap — GitHub-style 90-day activity grid
- Scheduled Tasks — Cron/interval/one-time recurring tasks
- Smart Suggestions — Predicts your next action
- Prompt Templates — 8 built-in + custom templates
- Backup & Restore — Full backup of all agent data
- Full autonomy mode (configurable) — or confirmation/whitelist/sandbox modes
- Protected paths — ~/.ssh, ~/.aws, system files never touched
- Safe delete — Trash by default
- Blocked actions — format disk, rm -rf /, etc. always refused
- Audit trail — Every action logged with redacted sensitive params
git clone https://github.com/AFKmoney/z-agent-desktop.git
cd z-agent-desktop
# Install
python -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activate
pip install -r requirements.txt
playwright install chromium
# Run (configure API keys from the dashboard — no .env editing needed!)
python main.py # Start (Telegram + Web API + Notifier)Then open the dashboard at http://localhost:3000, click ⚙️ Settings, and configure your API keys directly in the app. You can also test each key before saving.
💡 No more
.envediting — all 15 environment variables (10 LLM providers, Telegram, Email, Slack, SDK) can be configured from the dashboard Settings panel. Sensitive values are masked, and each LLM key can be tested with one click.
python main.py --cli # Interactive CLI mode
python main.py --task "..." # Run a single task
python main.py --check # Configuration checkThe easiest way to configure Z.AGENT — no file editing needed:
- Start the agent:
python main.py - Open the dashboard: http://localhost:3000
- Click ⚙️ Settings in the header
- Fill in your API keys (sensitive values are hidden by default)
- Click Test to verify each key works
- Click Save — the
.envfile is written automatically - Restart the agent for changes to take effect
All 15 environment variables are configurable from the Settings panel, organized into 5 categories: LLM Providers, Telegram, Email, Agent Settings, Integrations.
You can also create a .env file manually:
# Required
ZAI_API_KEY=your-z.ai-key # https://z.ai/
# Telegram (recommended)
TELEGRAM_BOT_TOKEN=your-bot-token # @BotFather
# Email (optional)
EMAIL_USER=you@gmail.com
EMAIL_APP_PASSWORD=your-app-password # https://myaccount.google.com/apppasswords
# Multi-LLM providers (optional — add any you have)
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
MISTRAL_API_KEY=...
NVIDIA_API_KEY=nvapi-...
GROQ_API_KEY=gsk_...
DEEPSEEK_API_KEY=sk-...
TOGETHER_API_KEY=...
FIREWORKS_API_KEY=...
# Optional: use z.ai coding plan SDK
ZDA_USE_SDK=trueagent:
use_react_loop: true # ReAct loop (recommended) or single-shot planner
language: auto # auto | fr | en
zai:
models:
planner: glm-4.6 # Switch to glm-5.1 when ready
vision: glm-4v
executor: glm-4.5
llm_provider:
primary: zai # Primary LLM provider
fallbacks: [openai, anthropic, mistral] # Fallback chainCinematic command-center UI built with Next.js 16, TypeScript, Tailwind CSS 4, shadcn/ui, and Framer Motion.
cd dashboard
bun install
bun run dev| Feature | Description |
|---|---|
| State Orb | Breathing/pulsing centerpiece that changes color with agent state |
| Thinking Stream | Live ReAct trace with timeline and typewriter cursor |
| Module Grid | 14 module tiles with per-module colors and hover glow |
| Command Palette | Cmd+K to submit tasks |
| Activity Heatmap | 90-day GitHub-style grid with streak counter |
| Cost Tracker | Total cost, API calls, per-model breakdown |
| Audit Log | Live security trail with blocked-only filter |
| Scheduled Tasks | CRUD for recurring tasks |
| Knowledge Base | Semantic search with score badges |
| LLM Provider Switcher | Switch primary, test connections |
| Prompt Templates | Browse and use templates |
| Backup Panel | Create and restore backups |
| Smart Suggestions | Predicted next actions |
| ⚙️ Settings Panel | Configure all API keys directly — test keys, no .env editing |
| Bilingual EN/FR | Toggle in header |
┌─────────────────────────────────────────────────────────────┐
│ Telegram / Dashboard / CLI / Webhooks │
└────────────────────────┬────────────────────────────────────┘
│ tasks (natural language)
▼
┌─────────────────────────────────────────────────────────────┐
│ AGENT CORE │
│ ┌───────────┐ ┌──────────┐ ┌────────────────────────┐ │
│ │ ReAct │→ │ Executor │→ │ Conversation Context │ │
│ │ Loop │ │ │ │ (long-term memory) │ │
│ └─────┬─────┘ └────┬─────┘ └────────────────────────┘ │
│ │ │ │
│ ┌─────┴─────┐ ┌─────┴──────────────────────────────┐ │
│ │ Multi-LLM │ │ 16 MODULES (88 actions) │ │
│ │ Provider │ │ screen files email calendar │ │
│ │ (10 prov) │ │ browser system windows │ │
│ └───────────┘ │ code web voice vision │ │
│ ┌───────────┐ │ plugin mcp slack kb │ │
│ │ Vector │ └────────────────────────────────────┘ │
│ │ Memory │ │
│ └───────────┘ │
│ ┌───────────┐ ┌───────────┐ ┌───────────┐ │
│ │ Auto-Skill│ │ File │ │ Webhooks │ │
│ │ Creator │ │ Watcher │ │ │ │
│ └───────────┘ └───────────┘ └───────────┘ │
│ ┌───────────┐ ┌───────────┐ ┌───────────┐ │
│ │ Cost │ │ Audit │ │ Activity │ │
│ │ Tracker │ │ Log │ │ Tracker │ │
│ └───────────┘ └───────────┘ └───────────┘ │
└─────────────────────────────────────────────────────────────┘
| Metric | Value |
|---|---|
| Total actions | 88 |
| Modules | 16 |
| Core components | 26 |
| LLM providers | 10 |
| Dashboard panels | 22 |
| Languages | EN + FR |
| API endpoints | 50+ |
| Feature | Claude Code | OpenHands | Cursor | Hermes | Z.AGENT |
|---|---|---|---|---|---|
| Multi-LLM (10 providers) | — | — | — | — | ✅ |
| ReAct loop | ✅ | ✅ | — | — | ✅ |
| Vector memory | — | — | — | — | ✅ |
| Auto skill creator | — | — | — | — | ✅ |
| Code interpreter | ✅ | ✅ | — | — | ✅ |
| Web search | — | — | — | — | ✅ |
| Multi-agent | — | ✅ | — | — | ✅ |
| Voice control | — | — | — | — | ✅ |
| Webhooks | — | — | — | — | ✅ |
| File watcher | — | — | — | — | ✅ |
| MCP | — | — | — | — | ✅ |
| Plugin marketplace | — | — | — | — | ✅ |
| Cost tracker | — | — | — | — | ✅ |
| Audit log | — | — | — | — | ✅ |
| Activity heatmap | — | — | — | — | ✅ |
| Scheduled tasks | — | — | — | — | ✅ |
| RAG knowledge base | — | — | ✅ | — | ✅ |
| 100% Windows control | — | — | — | — | ✅ |
| Telegram remote | — | — | — | — | ✅ |
| Bilingual EN/FR | — | — | — | — | ✅ |
| Cinematic UI | Terminal | Basic | IDE | Basic | ✅ |
Contributions are welcome! Here's how to get started:
- Fork the repo
- Clone your fork:
git clone https://github.com/your-username/z-agent-desktop.git - Create a branch:
git checkout -b feature/amazing-feature - Commit:
git commit -m 'Add amazing feature' - Push:
git push origin feature/amazing-feature - **Open a Pull Request`
- 🌍 Translations — Add more languages (ES, DE, PT, IT, ZH, JA)
- 🔌 Plugins — Create new plugins (Spotify, Notion, Docker, etc.)
- 🌐 MCP servers — Add curated MCP server configurations
- 🧪 Tests — Add integration tests with mock LLM providers
- 📚 Docs — Improve documentation and examples
- 🐛 Bugs — Fix issues from the issue tracker
See open issues for ideas — look for the good first issue label if you're new.
| Resource | URL |
|---|---|
| Repository | https://github.com/AFKmoney/z-agent-desktop |
| Documentation | https://afkmoney.github.io/z-agent-desktop/ |
| PDF Manual | https://afkmoney.github.io/z-agent-desktop/Z-AGENT-Documentation.pdf |
| Discussions | https://github.com/AFKmoney/z-agent-desktop/discussions |
| Issues | https://github.com/AFKmoney/z-agent-desktop/issues |
| Actions CI | https://github.com/AFKmoney/z-agent-desktop/actions |
MIT — see LICENSE
Z.AGENT v4.0 — Powered by 10 LLM providers · 88 actions · 16 modules · 26 core components
Made with 🤖 by AFKmoney