Inference

Send chat completion requests to managed inference endpoints. Supports text prompts and optional video URL inputs.

Command

cosmicac inference <subcommand> [options]

Subcommands

Subcommand	Description
chat	Chat with AI models

`inference chat`

Send chat completion requests to a managed inference endpoint. Supports text prompts and optional video URL inputs. Run without --message to start an interactive chat session.

Usage

cosmicac inference chat [options]

Options

Option	Description
`--api-key`	API key for authentication (prompted if not provided)
`--endpoint-id`	Inference endpoint ID
`--model`	AI model to use for completion (e.g., qwen3-vl-thinking-fp8-prod)
`--message`	Message to send to the model (omit for interactive mode)
`--video-url`	URL of video to analyze
`--stream`	Enable streaming response for real-time output

Command

Subcommands

inference chat

Usage

Options

On this page

`inference chat`