Text2Video

post

https://genai-api.picsart.io/v1/text2video

The Text2Video service helps generate a video based on the text introduced as input by the user. Considering the time and cost to generate videos, only one video will be generated per request (unlike the image generation services).
Note, that considering the big number of different models that are supported with this service, some limitations may apply. For example, some models (e.g. Grok Imagine Video / OVI) don't support the option to disable sound. With these models the result video will always be generated regardless to the provided input params.
Similarly, note that different models do support different resolutions and output video quality parameters. In this case as well, the output video's resolutions and quality are not guaranteed to be exactly the same as provided input params. The service will provide what's "closer" to the original request. To make sure there are minimal deviations, we recommend doing test requests and checking the output video as well as doing research on original model's official documentation.

Recent Requests

Time	Status	User Agent
Retrieving recent requests…

Loading…

Body Params

prompt

string

required

The prompt.

width

number

64 to 1024

Defaults to 1024

Desired width used to determine the aspect ratio and resolution tier. The actual output width is determined by the AI provider based on the closest matching supported aspect ratio.
Supported ratios vary by model (common: 1:1, 4:3, 3:4, 16:9, 9:16, 21:9).

height

number

64 to 1024

Defaults to 1024

Desired height used to determine the aspect ratio and resolution tier. The actual output height is determined by the AI provider based on the closest matching supported aspect ratio.
Supported ratios vary by model (common: 1:1, 4:3, 3:4, 16:9, 9:16, 21:9).

quality

string

enum

Defaults to 480p

Video output quality/resolution.

Allowed:

audio

boolean

Defaults to false

Generate the video with audio or not.
Some models (e.g. Grok Imagine Video / OVI) don't support the option to disable sound. With these models the result video will always be generated regardless to the provided input params.

length

number

1 to 20

Defaults to 3

Desired video length in seconds. Mapped to the closest supported duration for the selected model.

model

string | null

enum

Defaults to urn:air:wan:model:wan:wan-2.7-text-to-video@1

Optionally choose a specific AI model to use for this video generation.
If not specified, a default model will be applied. Please note that the default model may change over time as Picsart continues to improve performance and accuracy. Any change to the default will be made only after thorough testing and validation to ensure it delivers better results.
If you require consistent behavior or wish to evaluate different models on your own, we recommend explicitly setting this parameter.

Responses

Text2Video

Next

Next

202Accepted.

400Bad Request

401Unauthorized

402Payment Required

403Forbidden

404Not Found

405Method Not Allowed

413Request Entity Too Large

415Unsupported Media Type

429Too Many Requests

431Request Header Fields Too Large

500Internal Server Error

503Service Unavailable

Next

Next