Nano Banana Pro Prompt Engineering: The Complete Guide (2026)
The difference between a mediocre AI image and a stunning one is almost never the model. It is the prompt.
When Sophie, a content manager at a travel startup, first tried Nano Banana Pro, she typed "beautiful beach sunset" and got a generic, forgettable image. Two weeks later, after learning prompt structure, she typed: "Aerial drone shot of a secluded turquoise cove in the Maldives at golden hour. Crystal clear water reveals coral beneath the surface. A single overwater villa with thatched roof sits at the edge of frame. Cinematic color grading with warm amber highlights and cool teal shadows. Shot on a medium format camera, 16:9 aspect ratio."
The result looked like a National Geographic cover.
Same model. Same platform. Completely different output. The only variable was prompt quality.
This guide teaches you the prompt engineering techniques that produce professional output consistently. By the end, you will write prompts that generate usable images on the first or second attempt, not the tenth.
Practice as you read: Open the AI Generator with 12 free credits.
The Prompt Structure Formula
Every effective Nano Banana prompt follows this structure:
[Subject] + [Action/Pose] + [Setting] + [Lighting] + [Camera/Lens] + [Style/Mood]Each element adds specificity. The more specific you are, the closer the output matches your vision.
Element Breakdown
| Element | What to specify | Example |
|---|---|---|
| Subject | Who or what, with detailed appearance | "A 30-year-old Korean woman with short black hair wearing a cream linen blazer" |
| Action/Pose | What they are doing | "Leaning against a cafe counter, holding an espresso cup" |
| Setting | Where, with environmental details | "In a minimalist Scandinavian coffee shop with light wood and white walls" |
| Lighting | Type, direction, quality | "Soft diffused window light from the left, warm morning tones" |
| Camera/Lens | Focal length, angle, depth | "Shot on an 85mm lens at f/1.8, eye level, shallow depth of field" |
| Style/Mood | Aesthetic and feeling | "Editorial fashion photography, muted earth tones, calm and confident" |
Bad Prompt vs Good Prompt
Bad: "A woman in a coffee shop"
This is too vague. The model fills in every unspecified detail randomly, producing unpredictable output.
Good: "A confident 30-year-old Korean woman with short black hair wearing a cream linen blazer, leaning against a cafe counter holding an espresso cup. Minimalist Scandinavian coffee shop with light wood and white walls. Soft diffused window light from the left, warm morning tones. Shot on an 85mm lens at f/1.8, eye level, shallow depth of field. Editorial fashion photography, muted earth tones."
Every detail you specify is one less random choice the model makes.
5 Essential Prompting Techniques
1. Positive Framing (Say What You Want, Not What You Do Not)
The model responds better to additive descriptions than negations.
Bad: "A portrait with no background clutter and no harsh shadows" Good: "A portrait against a clean, soft gradient background with gentle, diffused lighting"
2. Camera and Lens Specifications
Camera terminology gives you precise control over composition and feel:
| Specification | Effect |
|---|---|
| "35mm lens" | Wide angle, environmental context |
| "50mm lens" | Natural perspective, versatile |
| "85mm lens" | Portrait compression, soft background |
| "200mm telephoto" | Compressed background, isolated subject |
| "f/1.4 or f/1.8" | Extreme background blur (bokeh) |
| "f/8 or f/11" | Everything in sharp focus |
| "Low angle" | Subject appears powerful, imposing |
| "High angle" | Subject appears vulnerable, environment emphasis |
| "Eye level" | Neutral, relatable perspective |
| "Bird's eye / aerial" | Looking straight down |
3. Lighting Vocabulary
Lighting is the single most impactful prompt element after the subject:
| Lighting Type | Look |
|---|---|
| "Golden hour" | Warm, romantic, soft |
| "Blue hour" | Cool, cinematic, moody |
| "Rembrandt lighting" | Dramatic, one side lit |
| "Flat lighting" | Even, commercial, product photography |
| "Backlighting" | Rim light, silhouettes, glow |
| "Neon lighting" | Cyberpunk, urban, colorful |
| "Overcast diffused" | Soft, even, no harsh shadows |
| "High-key" | Bright, airy, minimal shadows |
| "Low-key" | Dark, dramatic, high contrast |
4. Text Rendering Rules
When you need text in your image:
Include the exact text in quotation marks: "SUMMER SALE"
Specify typography: "bold sans-serif white text"
Define position: "centered at the top"
Keep it short: 3-5 words maximum
Specify background contrast: "on a dark overlay"Example prompt with text: "A sleek product advertisement banner. Dark gradient background transitioning from navy to black. A pair of white wireless earbuds floating in center frame with soft studio lighting. Bold white sans-serif text reading 'SALE 50% OFF' at the top center. Minimal, premium advertising aesthetic. 16:9 aspect ratio."
5. Reference Image Mastery
Nano Banana supports up to 14 reference images per generation. Use them for:
- Style matching: Upload an image whose style you want to replicate
- Product consistency: Upload your actual product photo as reference
- Character consistency: Keep the same person or mascot across multiple images
- Scene composition: Show the layout you want the AI to follow
Pro tip: Combine 2-3 reference images with a detailed text prompt for maximum control. Use one reference for style, one for composition, and let the text prompt handle specific details.
10 Prompt Examples: Before and After
1. Product Photography
- Before: "A watch on a table"
- After: "A luxury men's chronograph watch with a black dial and rose gold case, resting on a dark slate surface. Three-quarter angle. Single directional warm spotlight from upper right creating dramatic shadows. The watch face reflects soft ambient light. Dark moody product photography with shallow depth of field. 4K resolution."
2. Food Photography
- Before: "A bowl of ramen"
- After: "Steaming bowl of tonkotsu ramen in a ceramic bowl. Rich creamy pork broth with chashu pork, soft-boiled egg cut in half, nori, bamboo shoots, and sliced scallions. Shot from 45 degrees at a wooden counter in a dimly lit ramen shop. Warm overhead lighting with visible steam. Editorial food photography, moody and appetizing."
3. Portrait
- Before: "A business portrait"
- After: "Professional headshot of a confident middle-aged Black man with a short beard, wearing a charcoal suit and open-collar white shirt. Soft three-quarter lighting from the left against a neutral gray gradient background. Shot on an 85mm lens at f/2.0. Corporate editorial style, approachable but authoritative."
4. Architecture
- Before: "A modern house"
- After: "A contemporary minimalist residence with floor-to-ceiling glass walls and a cantilevered concrete roofline, nestled into a hillside overlooking the Pacific Ocean. Shot at blue hour with interior warm lights glowing. Infinity pool reflecting the twilight sky. Architectural photography, 24mm wide angle, symmetrical composition."
5. Social Media Ad
- Before: "An ad for skincare"
- After: "Clean minimalist product advertisement featuring a frosted glass serum bottle against a soft pink gradient background. Two drops of golden serum in mid-air beside the bottle. Soft studio lighting with no shadows. Bold white text reading 'GLOW SERUM' at the top. Premium beauty brand aesthetic. Square 1:1 format."
6-10. More Examples
Browse the full Prompt Gallery for 500+ production-ready examples across categories: product ads, portraits, 3D renders, anime, architecture, food, fashion, and more.
Advanced Techniques
Multi-Reference Composition
Upload multiple reference images for complex scenes:
- Reference 1: Your product (maintains appearance)
- Reference 2: Background scene you want
- Reference 3: Lighting style reference
Text prompt ties them together: "Place [product from ref 1] in [scene from ref 2] with [lighting from ref 3]."
Iterative Refinement Workflow
- Start at 0.5K with Nano Banana 2 for rapid concept testing (2 seconds per image)
- Generate 10-15 variations to explore different directions
- Pick the best 2-3 and regenerate at 2K for review
- Finalize at 4K with Nano Banana Pro for maximum quality
This workflow costs less and produces better results than trying to get a perfect 4K image on the first attempt.
Hiroshi, a game concept artist in Osaka, developed a variant of this workflow. He generates 20+ rough concepts at 0.5K in under a minute, picks three, refines the prompts based on what worked, then generates finals at 4K with Pro. "My concept exploration phase went from 2 hours of sketching to 5 minutes of prompting," he told us. "The AI handles exploration. I handle curation and refinement."
FAQ
Do I need to learn programming for prompt engineering?
No. Prompt engineering for image generation is entirely text-based. If you can describe what you want to see in detailed English, you can engineer effective prompts.
Can I save and reuse prompts?
You can copy prompts from the Prompt Gallery and modify them for your needs. Save your best prompts in a document for reuse.
Do prompts work the same on Nano Banana 2 and Nano Banana Pro?
Yes. Prompts are compatible across both models. You may notice slight differences in interpretation, but the same prompt structure works for both.
What's Next
- Practice now: AI Image Generator with 12 free credits
- Copy tested prompts: 500+ Prompt Templates
- Read the docs: Nano Banana Documentation
- Compare models: Nano Banana 2 vs Pro
Prompt engineering is not talent. It is a learnable skill. Start with the formula, use the examples as templates, and iterate. Within a dozen prompts, you will be generating images that look like they came from a professional studio.
Prompt examples are guidelines. Results may vary based on model selection and generation randomness.

