Inference
Send chat completion requests to managed inference endpoints. Supports text prompts and optional video URL inputs.
Command
cosmicac inference <subcommand> [options]Subcommands
| Subcommand | Description |
|---|---|
| chat | Chat with AI models |
inference chat
Send chat completion requests to a managed inference endpoint. Supports text prompts and optional video URL inputs. Run without --message to start an interactive chat session.
Usage
cosmicac inference chat [options]Options
| Option | Description |
|---|---|
--api-key | API key for authentication (prompted if not provided) |
--endpoint-id | Inference endpoint ID |
--model | AI model to use for completion (e.g., qwen3-vl-thinking-fp8-prod) |
--message | Message to send to the model (omit for interactive mode) |
--video-url | URL of video to analyze |
--stream | Enable streaming response for real-time output |