Optional
backgroundOptional
includeDirect instruction of the UI action to perform (e.g., 'click the login button', 'input username in the email field', 'scroll down', 'swipe left')
Optional
optionsAction common options
Optional
outputOptional
presigned⚠️ DEPRECATED: Use options.screenshot.presignedExpiresIn
instead.
Presigned url expires in. Only takes effect when outputFormat is storageKey.
This field will be ignored when options.screenshot
is provided.
Supported time units: ms (milliseconds), s (seconds), m (minutes), h (hours) Example formats: "500ms", "30s", "5m", "1h" Default: 30m
Optional
screenshot⚠️ DEPRECATED: Use options.screenshot.delay
instead. This field
will be ignored when options.screenshot
is provided.
Delay after performing the action, before taking the final screenshot.
Execution flow:
Example: '500ms' means wait 500ms after the action before capturing the final screenshot.
Supported time units: ms (milliseconds), s (seconds), m (minutes), h (hours) Example formats: "500ms", "30s", "5m", "1h" Default: 500ms Maximum allowed: 30s
Optional
settingsAI action settings
Optional
streamWhether to stream progress events using Server-Sent Events (SSE). When true, the API returns an event stream. When false or omitted, the API returns a normal JSON response.
The background of the UI action to perform. The purpose of background is to let the action executor to understand the context of why the instruction is given including important previous actions and observations