Chat API
The chat API (/api/chat) handles design generation requests with support for reference images, garment mockups, and multiple AI models.
Endpoint
Request Body
Response
Example Request
Example Response
Two-Step Generation
The chat API uses a two-step process:1
Design Generation
AI generates clean design graphic from prompt
2
Mockup Compositing
Design placed on garment blanks (if provided)
Model Selection
| Model | Tokens | Quality | Speed |
|---|---|---|---|
nano-banana | 12 | Standard | Fast (~5s) |
nano-banana-pro | 45 | High | Medium (~15s) |
nano-banana-pro-4k | 90 | Print-ready | Slower (~25s) |
nano-banana. Pro model recommended for final designs.
Reference Images
Garment Blanks
Provide garment images for mockup generation:Design References
Provide existing designs for style reference:Combined
Token Validation
Before generation, the API checks token balance:Database Records
Each generation creates:- Thread (if new conversation)
- User message with prompt
- Assistant message with mockups
- Render job with generation details
- Render assets for each output image
- Usage ledger entry for tokens
Error Responses
| Status | Error | Description |
|---|---|---|
| 400 | ”Prompt is required” | Missing or too short prompt |
| 401 | ”Unauthorized” | No valid session |
| 402 | ”Token balance too low” | Insufficient tokens |
| 500 | ”Generation failed” | AI provider error |
Provider Fallback
If Replicate fails, the API falls back to fal.ai:Rate Limiting
- 10 requests per minute per user
- 100 requests per hour per project
- Enforced via middleware