gpt-image-2
GitHub通过AceDataCloud调用gpt-image-2生成及编辑图像。支持文本转图,以及将Logo、二维码等真实图片精准合成或重绘至新场景,保持角色一致性与文字清晰,适用于视频素材制作。
Trigger Scenarios
Install
npx skills add NeverSight/learn-skills.dev --skill gpt-image-2 -g -y
SKILL.md
Frontmatter
{
"name": "gpt-image-2",
"license": "Apache-2.0",
"metadata": {
"author": "acedatacloud",
"version": "1.0"
},
"description": "Generate and EDIT images with OpenAI gpt-image-2 via AceDataCloud API. Use when you need high-fidelity images from a prompt, or to edit\/composite existing images (e.g. fuse a real logo\/QR\/screenshot into a scene, keep characters consistent, restyle). Strong at legible text and faithful editing.",
"compatibility": "Requires ACEDATACLOUD_API_TOKEN in .env file (see _shared\/authentication.md)."
}
gpt-image-2 — Image Generation & Editing
OpenAI gpt-image-2 through AceDataCloud. Two endpoints, both synchronous (return image url(s) directly). Its standout is editing: feed real images (logos, QR codes, product shots, screenshots) and it composites/restyles them faithfully — great for on-brand video assets and character consistency.
Setup: See authentication for token setup.
1. Generate (text → image)
curl -X POST https://api.acedata.cloud/openai/images/generations \
-H "Authorization: Bearer $ACEDATACLOUD_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{"model":"gpt-image-2","prompt":"a clean dark tech hero background with a glowing API hub, lots of negative space","size":"1792x1024","n":1}'
2. Edit / composite (images + prompt → image) ← the powerful one
Multipart. Pass one or more source images via repeated image[] (local files with
@, or URLs). Use it to fuse a real logo/QR into a generated scene, keep a subject
consistent across scenes, or restyle a screenshot.
curl -X POST https://api.acedata.cloud/openai/images/edits \
-H "Authorization: Bearer $ACEDATACLOUD_API_TOKEN" \
-F "model=gpt-image-2" \
-F "prompt=Place this logo crisply in the top-left on the tech background; keep the logo's exact colors and shape." \
-F "image[]=@background.png" \
-F "image[]=@logo.png" \
-F "size=1792x1024" \
-F "n=1"
Response (both endpoints): {"data":[{"url":"https://...png"}]} → download data[0].url.
Sizes
size is WxH (a preset) or "auto". Common presets:
| Aspect | Sizes |
|---|---|
| 16:9 | 1792x1024 (HD), 2048x1152, 3840x2160 (4K) |
| 9:16 | 1024x1792, 1152x2048, 2160x3840 |
| 1:1 | 1024x1024, 2048x2048, 4096x4096 |
(Omit size or use "auto" to let the model pick. Invalid sizes 400.)
Tips
- Editing keeps things faithful — to place a logo/QR exactly, pass it as one of the
image[]and say "keep its exact colors/shape, do not redraw it". - For character/scene consistency across video beats, generate one hero image, then
editsit per beat instead of regenerating from scratch. - Text in images renders legibly — good for titles/labels you don't want to overlay in HTML.
- Both endpoints are synchronous; no
/taskspolling.
Version History
- e0220ca Current 2026-07-05 22:54


