mirror of
https://github.com/NousResearch/hermes-agent.git
synced 2026-05-21 03:39:54 +00:00
252d68fd45
* docs: deep audit — fix stale config keys, missing commands, and registry drift Cross-checked ~80 high-impact docs pages (getting-started, reference, top-level user-guide, user-guide/features) against the live registries: hermes_cli/commands.py COMMAND_REGISTRY (slash commands) hermes_cli/auth.py PROVIDER_REGISTRY (providers) hermes_cli/config.py DEFAULT_CONFIG (config keys) toolsets.py TOOLSETS (toolsets) tools/registry.py get_all_tool_names() (tools) python -m hermes_cli.main <subcmd> --help (CLI args) reference/ - cli-commands.md: drop duplicate hermes fallback row + duplicate section, add stepfun/lmstudio to --provider enum, expand auth/mcp/curator subcommand lists to match --help output (status/logout/spotify, login, archive/prune/ list-archived). - slash-commands.md: add missing /sessions and /reload-skills entries + correct the cross-platform Notes line. - tools-reference.md: drop bogus '68 tools' headline, drop fictional 'browser-cdp toolset' (these tools live in 'browser' and are runtime-gated), add missing 'kanban' and 'video' toolset sections, fix MCP example to use the real mcp_<server>_<tool> prefix. - toolsets-reference.md: list browser_cdp/browser_dialog inside the 'browser' row, add missing 'kanban' and 'video' toolset rows, drop the stale '38 tools' count for hermes-cli. - profile-commands.md: add missing install/update/info subcommands, document fish completion. - environment-variables.md: dedupe GMI_API_KEY/GMI_BASE_URL rows (kept the one with the correct gmi-serving.com default). - faq.md: Anthropic/Google/OpenAI examples — direct providers exist (not just via OpenRouter), refresh the OpenAI model list. getting-started/ - installation.md: PortableGit (not MinGit) is what the Windows installer fetches; document the 32-bit MinGit fallback. - installation.md / termux.md: installer prefers .[termux-all] then falls back to .[termux]. - nix-setup.md: Python 3.12 (not 3.11), Node.js 22 (not 20); fix invalid 'nix flake update --flake' invocation. - updating.md: 'hermes backup restore --state pre-update' doesn't exist — point at the snapshot/quick-snapshot flow; correct config key 'updates.pre_update_backup' (was 'update.backup'). user-guide/ - configuration.md: api_max_retries default 3 (not 2); display.runtime_footer is the real key (not display.runtime_metadata_footer); checkpoints defaults enabled=false / max_snapshots=20 (not true / 50). - configuring-models.md: 'hermes model list' / 'hermes model set ...' don't exist — hermes model is interactive only. - tui.md: busy_indicator -> tui_status_indicator with values kaomoji|emoji|unicode|ascii (not kawaii|minimal|dots|wings|none). - security.md: SSH backend keys (TERMINAL_SSH_HOST/USER/KEY) live in .env, not config.yaml. - windows-wsl-quickstart.md: there is no 'hermes api' subcommand — the OpenAI-compatible API server runs inside hermes gateway. user-guide/features/ - computer-use.md: approvals.mode (not security.approval_level); fix broken ./browser-use.md link to ./browser.md. - fallback-providers.md: top-level fallback_providers (not model.fallback_providers); the picker is subcommand-based, not modal. - api-server.md: API_SERVER_* are env vars — write to per-profile .env, not 'hermes config set' which targets YAML. - web-search.md: drop web_crawl as a registered tool (it isn't); deep-crawl modes are exposed through web_extract. - kanban.md: failure_limit default is 2, not '~5'. - plugins.md: drop hard-coded '33 providers' count. - honcho.md: fix unclosed quote in echo HONCHO_API_KEY snippet; document that 'hermes honcho' subcommand is gated on memory.provider=honcho; reconcile subcommand list with actual --help output. - memory-providers.md: legacy 'hermes honcho setup' redirect documented. Verified via 'npm run build' — site builds cleanly; broken-link count went from 149 to 146 (no regressions, fixed a few in passing). * docs: round 2 audit fixes + regenerate skill catalogs Follow-up to the previous commit on this branch: Round 2 manual fixes: - quickstart.md: KIMI_CODING_API_KEY mentioned alongside KIMI_API_KEY; voice-mode and ACP install commands rewritten — bare 'pip install ...' doesn't work for curl-installed setups (no pip on PATH, not in repo dir); replaced with 'cd ~/.hermes/hermes-agent && uv pip install -e ".[voice]"'. ACP already ships in [all] so the curl install includes it. - cli.md / configuration.md: 'auxiliary.compression.model' shown as 'google/gemini-3-flash-preview' (the doc's own claimed default); actual default is empty (= use main model). Reworded as 'leave empty (default) or pin a cheap model'. - built-in-plugins.md: added the bundled 'kanban/dashboard' plugin row that was missing from the table. Regenerated skill catalogs: - ran website/scripts/generate-skill-docs.py to refresh all 163 per-skill pages and both reference catalogs (skills-catalog.md, optional-skills-catalog.md). This adds the entries that were genuinely missing — productivity/teams-meeting-pipeline (bundled), optional/finance/* (entire category — 7 skills: 3-statement-model, comps-analysis, dcf-model, excel-author, lbo-model, merger-model, pptx-author), creative/hyperframes, creative/kanban-video-orchestrator, devops/watchers, productivity/shop-app, research/searxng-search, apple/macos-computer-use — and rewrites every other per-skill page from the current SKILL.md. Most diffs are tiny (one line of refreshed metadata). Validation: - 'npm run build' succeeded. - Broken-link count moved 146 -> 155 — the +9 are zh-Hans translation shells that lag every newly-added skill page (pre-existing pattern). No regressions on any en/ page.
174 lines
5.2 KiB
Markdown
174 lines
5.2 KiB
Markdown
---
|
|
title: "Inference Sh Cli — Run 150+ AI apps via inference"
|
|
sidebar_label: "Inference Sh Cli"
|
|
description: "Run 150+ AI apps via inference"
|
|
---
|
|
|
|
{/* This page is auto-generated from the skill's SKILL.md by website/scripts/generate-skill-docs.py. Edit the source SKILL.md, not this page. */}
|
|
|
|
# Inference Sh Cli
|
|
|
|
Run 150+ AI apps via inference.sh CLI (infsh) — image generation, video creation, LLMs, search, 3D, social automation. Uses the terminal tool. Triggers: inference.sh, infsh, ai apps, flux, veo, image generation, video generation, seedream, seedance, tavily
|
|
|
|
## Skill metadata
|
|
|
|
| | |
|
|
|---|---|
|
|
| Source | Optional — install with `hermes skills install official/devops/cli` |
|
|
| Path | `optional-skills/devops/cli` |
|
|
| Version | `1.0.0` |
|
|
| Author | okaris |
|
|
| License | MIT |
|
|
| Platforms | linux, macos, windows |
|
|
| Tags | `AI`, `image-generation`, `video`, `LLM`, `search`, `inference`, `FLUX`, `Veo`, `Claude` |
|
|
|
|
## Reference: full SKILL.md
|
|
|
|
:::info
|
|
The following is the complete skill definition that Hermes loads when this skill is triggered. This is what the agent sees as instructions when the skill is active.
|
|
:::
|
|
|
|
# inference.sh CLI
|
|
|
|
Run 150+ AI apps in the cloud with a simple CLI. No GPU required.
|
|
|
|
All commands use the **terminal tool** to run `infsh` commands.
|
|
|
|
## When to Use
|
|
|
|
- User asks to generate images (FLUX, Reve, Seedream, Grok, Gemini image)
|
|
- User asks to generate video (Veo, Wan, Seedance, OmniHuman)
|
|
- User asks about inference.sh or infsh
|
|
- User wants to run AI apps without managing individual provider APIs
|
|
- User asks for AI-powered search (Tavily, Exa)
|
|
- User needs avatar/lipsync generation
|
|
|
|
## Prerequisites
|
|
|
|
The `infsh` CLI must be installed and authenticated. Check with:
|
|
|
|
```bash
|
|
infsh me
|
|
```
|
|
|
|
If not installed:
|
|
|
|
```bash
|
|
curl -fsSL https://cli.inference.sh | sh
|
|
infsh login
|
|
```
|
|
|
|
See `references/authentication.md` for full setup details.
|
|
|
|
## Workflow
|
|
|
|
### 1. Always Search First
|
|
|
|
Never guess app names — always search to find the correct app ID:
|
|
|
|
```bash
|
|
infsh app list --search flux
|
|
infsh app list --search video
|
|
infsh app list --search image
|
|
```
|
|
|
|
### 2. Run an App
|
|
|
|
Use the exact app ID from the search results. Always use `--json` for machine-readable output:
|
|
|
|
```bash
|
|
infsh app run <app-id> --input '{"prompt": "your prompt here"}' --json
|
|
```
|
|
|
|
### 3. Parse the Output
|
|
|
|
The JSON output contains URLs to generated media. Present these to the user with `MEDIA:<url>` for inline display.
|
|
|
|
## Common Commands
|
|
|
|
### Image Generation
|
|
|
|
```bash
|
|
# Search for image apps
|
|
infsh app list --search image
|
|
|
|
# FLUX Dev with LoRA
|
|
infsh app run falai/flux-dev-lora --input '{"prompt": "sunset over mountains", "num_images": 1}' --json
|
|
|
|
# Gemini image generation
|
|
infsh app run google/gemini-2-5-flash-image --input '{"prompt": "futuristic city", "num_images": 1}' --json
|
|
|
|
# Seedream (ByteDance)
|
|
infsh app run bytedance/seedream-5-lite --input '{"prompt": "nature scene"}' --json
|
|
|
|
# Grok Imagine (xAI)
|
|
infsh app run xai/grok-imagine-image --input '{"prompt": "abstract art"}' --json
|
|
```
|
|
|
|
### Video Generation
|
|
|
|
```bash
|
|
# Search for video apps
|
|
infsh app list --search video
|
|
|
|
# Veo 3.1 (Google)
|
|
infsh app run google/veo-3-1-fast --input '{"prompt": "drone shot of coastline"}' --json
|
|
|
|
# Seedance (ByteDance)
|
|
infsh app run bytedance/seedance-1-5-pro --input '{"prompt": "dancing figure", "resolution": "1080p"}' --json
|
|
|
|
# Wan 2.5
|
|
infsh app run falai/wan-2-5 --input '{"prompt": "person walking through city"}' --json
|
|
```
|
|
|
|
### Local File Uploads
|
|
|
|
The CLI automatically uploads local files when you provide a path:
|
|
|
|
```bash
|
|
# Upscale a local image
|
|
infsh app run falai/topaz-image-upscaler --input '{"image": "/path/to/photo.jpg", "upscale_factor": 2}' --json
|
|
|
|
# Image-to-video from local file
|
|
infsh app run falai/wan-2-5-i2v --input '{"image": "/path/to/image.png", "prompt": "make it move"}' --json
|
|
|
|
# Avatar with audio
|
|
infsh app run bytedance/omnihuman-1-5 --input '{"audio": "/path/to/audio.mp3", "image": "/path/to/face.jpg"}' --json
|
|
```
|
|
|
|
### Search & Research
|
|
|
|
```bash
|
|
infsh app list --search search
|
|
infsh app run tavily/tavily-search --input '{"query": "latest AI news"}' --json
|
|
infsh app run exa/exa-search --input '{"query": "machine learning papers"}' --json
|
|
```
|
|
|
|
### Other Categories
|
|
|
|
```bash
|
|
# 3D generation
|
|
infsh app list --search 3d
|
|
|
|
# Audio / TTS
|
|
infsh app list --search tts
|
|
|
|
# Twitter/X automation
|
|
infsh app list --search twitter
|
|
```
|
|
|
|
## Pitfalls
|
|
|
|
1. **Never guess app IDs** — always run `infsh app list --search <term>` first. App IDs change and new apps are added frequently.
|
|
2. **Always use `--json`** — raw output is hard to parse. The `--json` flag gives structured output with URLs.
|
|
3. **Check authentication** — if commands fail with auth errors, run `infsh login` or verify `INFSH_API_KEY` is set.
|
|
4. **Long-running apps** — video generation can take 30-120 seconds. The terminal tool timeout should be sufficient, but warn the user it may take a moment.
|
|
5. **Input format** — the `--input` flag takes a JSON string. Make sure to properly escape quotes.
|
|
|
|
## Reference Docs
|
|
|
|
- `references/authentication.md` — Setup, login, API keys
|
|
- `references/app-discovery.md` — Searching and browsing the app catalog
|
|
- `references/running-apps.md` — Running apps, input formats, output handling
|
|
- `references/cli-reference.md` — Complete CLI command reference
|