Ivan NychyporukAI Media / Content Creator / VFX Digital Compositor / AI Integration Consultant

Project

AI Advertising Film — Character & Cinematography Prompts

2026video / advertising / workflow

A complete pipeline for AI-generated advertising — from character system design and era-specific scene prompts to sound design, voice-over, and final editing in a 30-second café commercial.

Context

This project was developed as part of the KIMUVA program. The goal: build a repeatable, cinematic AI video workflow that takes a creative concept through to a finished 30-second advertisement — visual footage, voice-over, and sound effects included.

Tools used: Google Veo 3.1 Fast · Adobe Firefly · Gemini Nano Banana · Suno · ElevenLabs · Adobe Premiere Pro

Success metrics:

  • Match rate vs. reference image per frame
  • Artefact count per shot
  • VO words-per-second vs. perceived calm

Character System — Waitress

A consistent character ("Mila") anchors the entire spot. Continuity is achieved through locked identity markers, not by re-using the same source image.

  • Identity markers: hair colour/length, skin tone, age group, posture, micro-expressions
  • Wardrobe base: café apron + neutral blouse + subtle accessory (repeatable across scenes)
  • Continuity markers: name tag, cup style, characteristic gesture — half-smile, eyes left toward camera
  • Variability is key: lighting and colour palette may shift; anatomy, silhouette and props stay locked
  • Prompt hygiene: precise descriptions; no contradictions; max 1–2 central props

"Mila, late-20s café server; fair skin with light freckles across nose; hazel eyes; chestnut-brown hair in a low ponytail with a few baby hairs; subtle natural makeup; oval face; small silver stud earrings and thin chain necklace; brown leather watch on right wrist; crisp white oxford shirt, sleeves rolled; navy cotton apron tied in a center knot; calm, kind, attentive; signature details: tiny beauty mark under left cheekbone; slightly frayed apron strap; gentle, reassuring smile."

Character reference – Mila, portrait 1Character reference – Mila, portrait 2

Cinematography Prompt — Opening Shot

The opening shot is generated from a single fully-specified prompt. Two generated outputs from the same prompt:

Café door – generated output ACafé door – generated output B

Prompt (excerpt): Wide Shot through closed café double-door glass from the street. Straight-on, eye level. 50 mm spherical, f/2, 1/100, ISO 200, Pro-Mist 1/8, CPL. Interior 2700K warm amber vs exterior 5600K cool blue/gray. Negative: teal door, bright daytime sun, oversized sign blocking face, motion blur, hands on counter outside.

Prompt Anatomy

Every shot follows the same six-part template:

ElementContent
FrameShot type, camera angle, composition rule
CameraFocal length (50–65 mm), shallow DOF, max 1 movement
Action1 clear subject action (set cup down, gentle glance)
LightingKey / fill / rim + practicals; softness, direction, contrast ratio
Palette & Texture3–5 colour accents; wet asphalt, porcelain, matte cotton
ConstraintsNo logos/text; negative prompts for unwanted geometry

Era Characters — 80s to Today

The commercial spans three time periods, each requiring its own character set with locked continuity parameters.

80s street scene

80s characters: roller skater (quad rollerskates, high-waist shorts, pastel windbreaker) · boombox carrier (colourful track jacket, cassette recorder) · skater (bleached print tee, flannel tied at waist) · office couple (shoulder-pad blazer, trenchcoat)

90s to present street scene

90s–Today characters: couple under umbrella (oversized denim jacket, plaid shirt) · runner with AirPods (turquoise functional shirt, fitness band) · e-scooter rider (quilted vest, helmet) · pedestrian with smartphone (olive parka, screen glow in rain)

Each era character has locked continuity: clothing palette, props, movement rhythm and hairstyle stay consistent across all shots they appear in.

Color & Lighting

Rainy café street – color and lighting reference

The commercial shifts from warm café interior to rainy street exterior, requiring a deliberate colour temperature transition:

  • Interior: 3000K warm amber — pendant bokeh, glazed tile reflections, steam haze
  • Exterior: ≈5600K cool blue/grey — wet pavement reflections, diffuse sky light, muted saturation
  • Correction notes: protect skin tones; soft halation; lift blacks gently; narrow lantern cone outdoors

Artifact Minimization

Coffee grinder – animation reference
  • Overloaded prompts → split into "1 subject + 1 action"
  • Ambiguous prop lists → detailed foreground / midground / background inventory
  • Phantom limbs / metal rods → add to negative prompt
  • Unstable composition → lock with reference image; run edge scans and tangent checks

For the coffee grinder rotation, keyframe checkpoints were specified at t=15%, 35%, 55%, 75%, 92% with a final elastic settle <5° — no description of the full arc, only the reference angles.

Time Transition Troubleshooting

The camera transitions from inside the café through the glass to the street in a single continuous shot. The fix: specifying dolly movement — not zoom — with explicit parallax instructions.

Time transition – inside caféTime transition – mid-movementTime transition – street revealed

Camera prompt (required): Slow forward dolly-in along the optical axis — not a zoom, no pan/tilt/roll. From t=0→90%, move forward ~40 cm; horizon steady. At t≈70–80% the window frame is completely out of view so the street is fully revealed. Show parallax: chalkboard/bike/cars scale up ~15–20%.

Packshot Design

The product (AURUM coffee tin) was designed and placed in-scene using Gemini Nano Banana, then animated with Google Veo 3.1 Fast.

Packshot – AURUM coffee tin

Nano Banana: package design generation, colour and font customisation, environment compositing Veo 3.1 Fast: motion graphics and product reveal animation

Sound Design

  • Diegetic sounds: quiet room ambience · soft street noise · cup set down · coffee grinder hum · steam hiss
  • Music: Sweet Silence (0.75×) — slow, dreamy, piano-dominated ballad; generated in Suno v4.5-all (6:08)
  • Dynamics: low base level, soft transient peaks only — no constant background noise

Voice-Over System

ElevenLabs voice settings

Cadence card:

TimecodeContent
:00–:08Scene introduction
:10–:18Product value
:20–:26Sensory line
:27–:30Brand name / CTA (very quiet)

Process: draft → trim to duration → cut ~40% for headroom → map to cadence card

TTS settings (ElevenLabs): speed slightly below normal · softer voice onset · soft sibilants — Voice: Jessica, Eleven Multilingual v2

Editing — Shot List

Shot 01 – Café openingShot 02 – 80s street energyShot 03 – First dropShot 04 – Time sweepShot 04a – Grinding beansShot 05 – 90s rain streetShot 06 – SteamShot 07 – Today streetShot 08 – Latte art finishShot 09 – Packshot
#ShotTimecode
1Café Opening0.00–2.50
280s Street Energy2.50–6.00
3First Drop6.00–8.00
4Quick Camera Sweep Through Time8.00–11.00
4aGrinding the Beans11.00–13.00
5Street in the 90s — Rain13.00–16.00
6Steam Cleaning16.00–19.00
7The Street Today19.00–22.00
8Latte Art Finish22.00–25.50
9Packshot25.50–30.00
Adobe Premiere Pro editing timeline

Outcome

A repeatable end-to-end workflow: Concept → Character system → Prompt engineering → Generation → Sound & VO → Edit.

Results: higher image accuracy; consistent colour palette with rainy-mood transitions; reduced artefact count; VO speaking rate matched to the calm editing pace.

What to repeat:

  • Multi-take sequences (2 × 4-second takes per shot)
  • Alternative lens focal lengths per scene
  • Locked colour palette as a generation anchor

Final checklist: Composition ✓ | Colour palette ✓ | Artefacts ✓ | SFX background ✓ | VO ✓