Text2Video

The Text2Video service helps generate a video based on the text introduced as input by the user. Considering the time and cost to generate videos, only one video will be generated per request (unlike the image generation services). Note, that considering the big number of different models that are supported with this service, some limitations may apply. For example, some models (e.g. Grok Imagine Video / OVI) don't support the option to disable sound. With these models the result video will always be generated regardless to the provided input params. Similarly, note that different models do support different resolutions and output video quality parameters. In this case as well, the output video's resolutions and quality are not guaranteed to be exactly the same as provided input params. The service will provide what's "closer" to the original request. To make sure there are minimal deviations, we recommend doing test requests and checking the output video as well as doing research on original model's official documentation.

Recent Requests
Log in to see full request history
TimeStatusUser Agent
Retrieving recent requests…
LoadingLoading…
Body Params
string
required

The prompt.

number
64 to 1024
Defaults to 1024

Desired width used to determine the aspect ratio and resolution tier. The actual output width is determined by the AI provider based on the closest matching supported aspect ratio. Supported ratios vary by model (common: 1:1, 4:3, 3:4, 16:9, 9:16, 21:9).

number
64 to 1024
Defaults to 1024

Desired height used to determine the aspect ratio and resolution tier. The actual output height is determined by the AI provider based on the closest matching supported aspect ratio. Supported ratios vary by model (common: 1:1, 4:3, 3:4, 16:9, 9:16, 21:9).

string
enum
Defaults to 480p

Video output quality/resolution.

Allowed:
boolean
Defaults to false

Generate the video with audio or not. Some models (e.g. Grok Imagine Video / OVI) don't support the option to disable sound. With these models the result video will always be generated regardless to the provided input params.

number
1 to 20
Defaults to 3

Desired video length in seconds. Mapped to the closest supported duration for the selected model.

string | null
enum
Defaults to urn:air:wan:model:wan:wan-2.7-text-to-video@1

Optionally choose a specific AI model to use for this video generation. If not specified, a default model will be applied. Please note that the default model may change over time as Picsart continues to improve performance and accuracy. Any change to the default will be made only after thorough testing and validation to ensure it delivers better results. If you require consistent behavior or wish to evaluate different models on your own, we recommend explicitly setting this parameter.

Responses

Language
Credentials
Header
LoadingLoading…
Response
Click Try It! to start a request and see the response here! Or choose an example:
application/json