P
Prompt Master

Prompt 大师

掌握和 AI 对话的艺术

Image Generation

Image generation prompts (overview)

This section collects prompts for exploring image generation / multimodal capabilities. The focus is on iterative prompt refinement to gradually steer the output toward what you want. The core of image tasks: clear visual goal + structured description + repeatable iteration strategy.

Common Scenarios

  • Product visual direction exploration (style, mood, color palette)
  • Quick visual asset drafts (posters, covers, illustrations)
  • Brand identity generation (characters, mascots, IP)
  • Scenario concept validation (interior/street/futuristic)

Key Elements (spell these out every time)

  1. Subject: Person/object/scene
  2. Style: Realistic/illustration/pixel/3D/paper craft
  3. Composition: Angle, depth of field, subject position
  4. Lighting & color tone: Warm/cool, soft/hard, time of day
  5. Detail constraints: Materials, textures, key elements

Prompt Template (General)

Subject: {{SUBJECT}}
Style: {{STYLE}}
Composition: {{COMPOSITION}}
Lighting: {{LIGHTING}}
Color tone: {{COLOR_TONE}}
Details: {{DETAILS}}

Example 1: Brand Visual Direction

Subject: Hero poster for an AI learning brand
Style: Modern flat illustration, slight paper texture
Composition: Center-symmetric, main title centered, whitespace below
Lighting: Soft light, subtle gradient
Color tone: Warm palette, orange + off-white + dark blue accents
Details: Abstract circuit lines, learning elements (books, notes)

Example 2: Product Scene

Subject: Smart learning device on a desk
Style: Realistic photography
Composition: 45-degree overhead angle, shallow depth of field
Lighting: Afternoon natural light, entering from the left
Color tone: Neutral, slightly warm
Details: Wooden desk, coffee cup, sticky notes

Example 3: Character / IP

Subject: A cute AI assistant cat
Style: 3D cartoon
Composition: Front-facing half body, smiling
Lighting: Soft studio lighting
Color tone: Blue and white color scheme
Details: Wearing small headphones, simple logo on chest

Iteration Strategy

  • Rough first, details later: Lock down subject and style first, then add details
  • Positive first, then constraints: Describe "what you want" first, then add "what you don't want"
  • Change one variable at a time: Only tweak one element per iteration to isolate the effect
  • Log versions: Save each prompt for easy rollback and comparison

Common Problems & Fixes

  • Blurry subject: Add subject position and action descriptions
  • Style drift: Pin down style keywords + reference objects
  • Messy composition: Specify camera angle, depth of field, subject placement
  • Missing details: Add material and key element qualifiers

Index

📚 相关资源

❓ 常见问题

关于本章主题最常被搜索的问题,点击展开答案

图像生成 Prompt 应该写哪几个要素?

本章列了 5 个:主体(人物/物体/场景)、风格(写实/插画/像素/3D)、构图(视角/景深/主体位置)、光线(暖冷/柔硬/时间)、色调(主色 + 点缀色)、再加细节(材质、关键元素)。任何一个不写,模型就自由发挥;写齐了出图风格才稳定可控。

图像 Prompt 出图跑偏,最高效的修法是什么?

本章给了三条迭代铁律:先粗后细(先定主体+风格再补细节)、先正向再约束(先说要什么再说不要什么)、每次只改一个变量。常见错误是一次性改 5 个词,最后不知道是哪个起作用。建议保留每版 prompt 当 changelog,方便回滚。

Midjourney / Stable Diffusion / DALL-E 用同一套 Prompt 模板可以吗?

结构能复用,关键词不能。本章的「主体 / 风格 / 构图 / 光线 / 色调 / 细节」骨架对三家都成立,但权重语法(Midjourney 的 `::`、SD 的 `(word:1.3)`)和负面提示(SD 才支持 negative prompt 字段)不通用。换平台时复用骨架,细节再按各家文档调。

出图主体老是模糊或位置乱,Prompt 要怎么改?

三步:1) 增加主体的位置和动作描述(「中心对称,主标题居中,下方留白」);2) 指定镜头与景深(「45 度侧俯视,浅景深」);3) 用风格关键词锁定参考对象(「现代扁平插画,轻微纹理纸感」)。本章「品牌视觉海报」示例就是这种全要素结构。

做品牌视觉时,怎么用 Prompt 保证多张图风格一致?

把风格、色调、构图作为「锁定段」固定写在每条 Prompt 顶部,只换主体和细节。再加一个 style anchor 词组(例如「modern flat illustration, slight paper texture, warm palette: orange + cream + deep blue accent」),所有图都带这一段。能 reference image 的工具直接传同一张参考图最稳。