The quality difference between a vague prompt and a specific one isn’t subtle. It’s the difference between something generic and something you’d actually use. Most people who are disappointed with AI image generation are working with prompts that don’t give the model enough to work with.
This cheat sheet covers the prompt components that consistently produce better results across most tools, organized so you can mix and match them for whatever you’re trying to create.
The AI Image Generator and similar tools respond well to all of these. Save this and reference it when results aren’t landing the way you expected.
The core structure
Every effective prompt has the same basic shape, even if the specific components vary:
Subject + Setting + Style + Lighting + Composition + Mood + Quality modifiers
You don’t need all of these every time. But the more dimensions you specify, the more control you have over the output.
Describing subjects
How you describe the main subject shapes everything downstream. The more specific, the better.
For people:
Instead of “a woman” try “a woman in her early 40s, relaxed expression, natural makeup, wearing a linen shirt, looking slightly off-camera.”
Instead of “a professional” try “a man in his 50s, confident expression, in a well-lit office, slight smile, not looking at the camera.”
Useful things to specify: approximate age, expression, posture, clothing style, whether they’re making eye contact, their mood.
For products:
Instead of “a perfume bottle” try “a minimal cylindrical glass perfume bottle, frosted cap, centered on a marble surface, soft light from the left.”
Instead of “food” try “a rustic ceramic bowl of açaí topped with granola, banana, and coconut, overhead flat lay, marble background.”
For environments:
Instead of “a city street” try “a narrow cobblestone street in an old European city at dusk, ambient yellow streetlight, few pedestrians.”
Instead of “nature” try “ancient redwood forest floor, morning mist, shafts of light filtering through the canopy, no people visible.”
Lighting descriptors
Lighting is the single variable with the biggest impact on whether an image looks professional or amateur. These terms all translate well across most tools.
| Lighting type | What it creates | Use for |
|---|---|---|
| Golden hour | Warm, directional, soft | Lifestyle, outdoor, food |
| Overcast natural light | Even, diffused, no harsh shadows | Portraits, products, editorial |
| Studio lighting | Clean, controlled | Products, headshots, commercial |
| Rim lighting | Subject outlined by light against dark background | Dramatic portraits, hero products |
| Soft box lighting | Evenly lit, flattering | Beauty, skincare, food |
| Dappled light | Spotted light through leaves | Nature, café settings, lifestyle |
| Neon and ambient urban | Moody, colorful | Editorial, nightlife, fashion |
| Low key | Dark overall, deep shadows | Dramatic, mysterious, cinematic |
| High key | Bright overall, minimal shadow | Optimistic, clean, commercial |
| Candle or firelight | Warm, intimate, flickering quality | Lifestyle, cozy, interior |
Style references
Telling the AI what style to target is one of the fastest ways to get coherent, distinctive results.
Photography styles:
Documentary, candid, editorial, commercial product, fine art, street, architectural, fashion editorial, portraiture, photojournalism.
Art and illustration styles:
Watercolor, flat design, vector illustration, oil painting, charcoal sketch, digital illustration, pixel art, ink drawing, linocut print, gouache.
Aesthetic references:
Art Deco, Bauhaus, Brutalist, Scandinavian minimalist, wabi-sabi, 1970s vintage, 1990s retro, cottagecore, dark academia, cyberpunk.
Camera and lens references that work well:
Shot on 35mm film. Leica aesthetic. Medium format. Anamorphic lens (produces cinematic widescreen distortion). Macro photography. Tilt-shift.
Composition and framing
These guide how the AI positions and frames the scene.
Close-up / extreme close-up: fills the frame with subject detail.
Wide shot: shows the subject in full environmental context.
Bird’s eye view: looking straight down.
Worm’s eye view: looking up at the subject, dramatic perspective.
Rule of thirds: subject offset to one side, compositionally balanced.
Centered / symmetrical: formal, architectural.
Over-the-shoulder: implies the perspective of another person in the scene.
Negative space: significant empty area around the subject, good for text overlay.
Leading lines: architectural or natural elements that draw the eye toward the subject.
Mood and atmosphere
These single words and short phrases shape the emotional quality of the output.
Warm and inviting: cozy, nostalgic, intimate, golden, rustic, homely.
Clean and professional: crisp, polished, minimal, precise, clinical.
Dramatic: brooding, cinematic, atmospheric, moody, intense.
Energetic: bold, vibrant, dynamic, electric, saturated.
Calm: peaceful, tranquil, serene, still, soft.
Luxurious: refined, opulent, premium, sophisticated.
Quality modifiers
These terms consistently push output quality higher across most models. Add two or three to the end of any prompt.
For resolution and detail: highly detailed, sharp focus, ultra-realistic, photorealistic, 8K resolution.
For professional finish: professional lighting, studio quality, magazine quality.
For composition: perfect composition, well-composed.
Negative prompts
Negative prompts tell the AI what to exclude. Use them to filter out common problems.
For general quality issues: blurry, out of focus, low resolution, pixelated, grainy, artifacts, watermark, text overlay.
For anatomy issues: extra fingers, missing fingers, deformed hands, distorted face, asymmetrical eyes, extra limbs, floating limbs.
For style contamination: add “cartoon, anime” as negatives when you want photorealism. Add “photorealistic, photograph” as negatives when you want illustration.
For composition issues: cropped heads, cut off, cluttered background.
Five complete prompts you can use or adapt
Professional portrait:
“Headshot of a woman in her late 30s, warm confident smile, natural makeup, wearing a dark blazer, soft neutral background, soft box studio lighting, shallow depth of field, Canon 85mm lens aesthetic, sharp focus, high key, professional quality”
Negative: “blurry, extra fingers, distorted face, watermark”
Product photography:
“Premium hand cream in a white ceramic jar, placed on a marble slab with dried eucalyptus and a linen cloth, overhead flat lay, soft diffused natural light, clean white background, minimalist aesthetic, sharp focus, commercial product photography”
Negative: “cluttered, shadows too harsh, plastic looking, low quality”
Lifestyle brand:
“Young couple laughing while cooking in a modern kitchen, warm evening light through large windows, candid photography style, depth of field, Kodak Portra film aesthetic, cozy and intimate”
Negative: “posed, artificial, stock photo feeling, overexposed”
Social media background:
“Abstract geometric shapes overlapping, soft circles and lines in dusty rose and cream, flat design, minimalist, high key, clean, suitable for text overlay, vertical format”
Negative: “busy, cluttered, dark, text, watermark”
Editorial concept:
“Conceptual image of creative thinking, scattered papers and open notebooks on a wooden desk, morning light from a large window, warm and focused atmosphere, overhead flat lay, documentary style”
Negative: “stock photo, staged, artificial lighting, plastic”
The quick reference
When something isn’t working, run through this list:
Is the subject described specifically enough?
Have I included a lighting reference?
Have I specified a style (photography or art direction)?
Have I described the composition or framing?
Have I added quality modifiers?
Have I used negative prompts to filter common problems?
One or two adjustments across these dimensions usually fixes the issue faster than a complete prompt rewrite.
