Z Image Turbo API
Integrate high-performance AI image generation capabilities. This documentation provides complete API reference, integration guides, and code examples to help developers quickly build AI image applications.
Quick Start
Complete your first API call in just two steps: submit a generation task, then poll for status.
Submit Task
Send a POST request to the generate endpoint to get a Task ID.
Query Status
Use the Task ID to check progress until you get the image URL.
Authentication
Authorization: Bearer {YOUR_API_KEY}Note: Please keep your API Key secure and do not expose it in client-side code.
https://zimageturbo.ai/api/generate
Create a new image generation task. This endpoint is asynchronous and returns a task_id plus initial status; call /api/status to retrieve the final images.
Body Parameters
| Parameter Name | Type | Required | Description |
|---|---|---|---|
| prompt | string | Yes | Text prompt for generation (max 1000 characters). |
| aspect_ratio | string | Yes | Aspect ratio for the generated image. Allowed values: 1:1, 4:3, 3:4, 16:9, 9:16. |
{
"code": 200,
"message": "success",
"data": {
"task_id": "task_1234567890",
"status": "IN_PROGRESS"
}
}https://zimageturbo.ai/api/status
Query the execution status and results of a task.
Query Parameters
| Parameter Name | Required | Description |
|---|---|---|
| task_id | Yes | The ID returned when submitting the task. |
{
"code": 200,
"message": "success",
"data": {
"status": "SUCCESS",
"task_id": "xxxxxxxx",
"request": {
"prompt": "xxx",
"size": "1024*1024",
"seed": "-1"
},
"response": [
"https://xxx.com/sample.jpeg"
],
"consumed_credits": 15,
"created_at": "2025-12-05 13:05:09",
"error_message": null
}
}
Billing
Cost per generation
Failed tasks are not charged.
Error Handling
- 401 UnauthorizedInvalid API Key
- 402 Payment RequiredInsufficient balance
- 429 Too Many RequestsRate limit triggered
About Z Image Turbo
Z Image Turbo is a generation engine optimized based on the latest diffusion models. It focuses on increasing inference speed to 300% of traditional models while maintaining extremely high fidelity. Suitable for real-time interaction, game asset generation, and e-commerce design scenarios.
Z-Image-Turbo — 6B-parameter, ultra-fast text-to-image
Z-Image-Turbo is a 6B-parameter text-to-image model from Tongyi-MAI, engineered for production workloads where latency and throughput really matter. It uses only 8 sampling steps to render a full image, achieving sub-second latency on data-center GPUs and running comfortably on many 16 GB VRAM consumer cards.
Ultra-fast generation with production-ready quality
Where many diffusion models need dozens of steps, Z-Image-Turbo is aggressively optimised around an 8-step sampler. That keeps inference extremely fast while still delivering photorealistic images and reliable on-image text, making it a strong fit for interactive products, dashboards, and large-scale backends—not just offline batch jobs.
Why it looks so good?
- • Photorealistic output at speed — Generates high-fidelity, realistic images that work for product photos, hero banners, and UI visuals without multi-second waits.
- • Bilingual prompts and text — Understands prompts in English and Chinese, and can render multilingual text directly in the image—helpful for cross-market campaigns, posters, and screenshots.
- • Low-latency, low-step design — Only 8 function evaluations per image deliver extremely low latency, ideal for chatbots, configuration tools, design assistants, and any “click → image” experience.
- • Friendly VRAM footprint — Runs well in 16 GB VRAM environments, reducing hardware costs and making local or edge deployments more realistic.
- • Scales for bulk generation — Its efficiency makes large jobs—catalogues, continuous feed images, or auto-generated thumbnails—practical without blowing up compute budgets.
- • Reproducible generations — A controllable seed parameter lets you recreate a previous image or generate small, controlled variations for brand safety and experimentation.
How to use
- prompt – natural-language description of the scene, style, and any on-image text (English or Chinese).
- size (width / height) – choose the output resolution; supports square and rectangular images up to high resolutions (for example, 1536 × 1536).
- seed – set to -1 for random results, or use a fixed integer to make outputs reproducible.