Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions runpodctl/reference/runpodctl-hub.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,7 @@ runpodctl hub list --owner runpod
#### List flags

<ResponseField name="--type" type="string">
Filter by deployment type (`POD` or `SERVERLESS`).
Filter by deployment type (`POD` or `SERVERLESS`). This filter is applied client-side, so `--limit` may return fewer results than specified.
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added client-side filtering note for --type flag based on PR #262 in runpod/runpodctl. The code in cmd/hub/list.go:198 and cmd/hub/search.go:274 shows the filter is applied client-side after API results are fetched.

Source: runpod/runpodctl#262

</ResponseField>

<ResponseField name="--category" type="string">
Expand Down Expand Up @@ -83,7 +83,7 @@ runpodctl hub search stable-diffusion --limit 5
#### Search flags

<ResponseField name="--type" type="string">
Filter by deployment type (`POD` or `SERVERLESS`).
Filter by deployment type (`POD` or `SERVERLESS`). This filter is applied client-side, so `--limit` may return fewer results than specified.
</ResponseField>

<ResponseField name="--category" type="string">
Expand Down
11 changes: 10 additions & 1 deletion runpodctl/reference/runpodctl-serverless.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -68,9 +68,14 @@ runpodctl serverless create --name "my-endpoint" --template-id "tpl_abc123"
# Create from a Hub repo
runpodctl hub search vllm # Find the hub ID
runpodctl serverless create --hub-id cm8h09d9n000008jvh2rqdsmb --name "my-vllm"

# Create from a Hub repo with custom environment variables
runpodctl serverless create --hub-id cm8h09d9n000008jvh2rqdsmb --name "my-vllm" \
--env MODEL_NAME=meta-llama/Llama-3.1-8B-Instruct \
--env MAX_TOKENS=4096
```

When using `--hub-id`, GPU IDs and container disk size are automatically pulled from the Hub release config. You can override the GPU type with `--gpu-id`.
When using `--hub-id`, GPU IDs and container disk size are automatically pulled from the Hub release config. You can override the GPU type with `--gpu-id`. Environment variables from the Hub release are included automatically, and you can override or add to them with `--env`.

<Note>

Expand Down Expand Up @@ -150,6 +155,10 @@ Enable or disable flash boot for faster worker startup. When enabled, workers st
Execution timeout in seconds. Jobs that exceed this duration are terminated. The CLI accepts seconds but converts to milliseconds internally.
</ResponseField>

<ResponseField name="--env" type="string">
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added --env flag documentation based on PR #262 in runpod/runpodctl. The flag uses KEY=VALUE format, supports multiple values, and overrides Hub release defaults when using --hub-id.

Source: runpod/runpodctl#262

Environment variable in `KEY=VALUE` format. Use multiple `--env` flags to set multiple variables. When deploying from `--hub-id`, these values override the Hub release defaults.
</ResponseField>

### Update an endpoint

Update endpoint configuration:
Expand Down
Loading