Drop in two reference images — your face and your favorite player's face — and end up with a chaotic, viral, half-blurred smartphone selfie taken seconds before security drags you off the pitch. Post-match floodlights, a packed stadium, photographers running, security closing in. Looks like a real once-in-a-lifetime moment, not a posed photo.
A two-reference image-to-image build that takes your face and your favorite player's face and rebuilds them into a chaotic, half-blurred smartphone selfie taken seconds after a pitch invasion — security closing in, photographers running, post-match floodlights, packed stadium in the background. The whole shot reads as a genuine accidental viral moment, not a posed photo. The example in the hero shows the workflow applied during a World Cup 2026 final aftermath — swap in any player you want, any tournament, any uniform.
The Workflow
THREE STEPS
01
Setup · Open The Model
GO TO NANO BANANA 2
Open your preferred platform with access to the Nano Banana 2 image model. This prompt is more demanding than the single-reference guides — it needs the model to handle two separate facial identity references simultaneously (you + the player) while rebuilding an entirely new chaotic scene around them. Nano Banana 2 handles this well. Pick the platform you already use (Higgsfield, fal.ai, or any AI studio that hosts Nano Banana 2) and start a fresh image edit.
Open a platform with Nano Banana 2
Start a new image edit / generation
Confirm multi-image input is supported
02
Input · Two Reference Images
UPLOAD TWO PORTRAITS
Upload two reference images in this exact order: Reference Image 1 is your face (the fan), Reference Image 2 is the player's face. The prompt is explicit about which reference belongs to which subject, so the order matters. Use clean, well-lit, head-on portraits for both — Nano Banana 2 uses each photo only as a facial identity reference and won't copy the source poses, so the sharper the faces in the inputs, the more accurately your real likenesses carry through.
Reference 1 = the fan (your face)
Reference 2 = the player's face
Both should be sharp & well-lit
Upload in the correct order
03
The Brief · Full Prompt
PASTE THE PROMPT & ENJOY :)
Paste the full prompt below into Nano Banana 2 exactly as written and hit generate. It locks in the entire concept — chaotic pitch invasion, accidental smartphone framing, security closing in, photographers in the background — and explicitly maps each reference image to its subject (Reference 1 = you, Reference 2 = the player). The long "NO" exclusion list at the end is what keeps the model from sliding into a posed, smiling, professional-portrait look. Keep the whole thing intact and run a few passes.
Nano Banana 2 Prompt — Full Brief
Reference Image 1 = fan identity reference.
Reference Image 2 = football player identity reference.
Use Reference Image 1 ONLY for the fan's face, hairstyle, skin tone, facial structure, age, clothing and body proportions.
Use Reference Image 2 ONLY for the football player's face, hairstyle, skin tone, facial structure, body proportions and appearance.
The fan must clearly resemble Reference Image 1.
The football player must clearly resemble Reference Image 2.
Create a genuine viral smartphone selfie captured during a chaotic football pitch invasion immediately after a FIFA World Cup 2026 match.
IMPORTANT:
This is NOT a posed selfie.
This is NOT a planned photo.
This is NOT a meet-and-greet.
This is a spontaneous moment captured in the middle of chaos.
The fan has successfully reached the football player on the pitch and quickly raises the phone to capture a selfie before security removes him.
The image should feel accidental, rushed and authentic.
CAMERA:
Front-facing smartphone camera.
Vertical smartphone photo.
Natural wide-angle selfie lens.
Handheld camera shake.
Motion blur from sudden movement.
Slight framing imperfections.
Slight tilt.
Real smartphone image quality.
The photo should look like it was captured within one second during a chaotic moment.
SUBJECTS:
The fan from Reference Image 1 occupies the foreground and is holding the phone.
The fan's face must closely match Reference Image 1.
The fan has a serious, focused, composed expression.
Expression should match the emotional tone of Reference Image 1.
NO smile.
NO grin.
NO teeth visible.
NO surprised expression.
NO exaggerated excitement.
NO laughing.
NO cheering expression.
NO wide-open eyes.
Mouth closed.
Eyes naturally focused on the phone screen.
Natural facial tension caused by the fast-moving situation.
The fan appears concentrated on capturing the selfie before being removed by security.
The expression feels candid, authentic and unposed.
The football player from Reference Image 2 is standing beside the fan.
The football player's face must closely match Reference Image 2.
The football player appears naturally caught in the moment.
Neutral or very subtle natural expression.
No exaggerated smile.
No professional pose.
No staged interaction.
The football player appears relaxed and realistic, as if unexpectedly included in the selfie.
ACTION:
A security guard is grabbing the fan from behind while attempting to remove him from the pitch.
Another security staff member is rushing toward them.
The fan is partially being pulled away while taking the selfie.
The fan's body is slightly off balance.
The football player remains clearly visible in frame.
The scene feels frozen in the middle of action.
The moment should feel genuine, chaotic and completely unplanned.
BACKGROUND:
Inside a packed football stadium.
Match has just ended.
Crowd visible in the stands.
Floodlights illuminating the stadium.
Players walking around the pitch.
Referees.
Photographers.
Broadcast camera operators.
Security personnel running toward the incident.
Championship atmosphere.
Post-match energy.
The background should feel busy and alive.
IMAGE CHARACTERISTICS:
Strong motion energy.
Natural motion blur.
Slight subject blur.
Slight camera shake.
Authentic smartphone dynamic range.
Natural stadium lighting.
Sports journalism aesthetic.
Viral social media moment.
Looks like an image that would immediately spread across Instagram, X, Reddit and sports news pages.
EMOTION:
Adrenaline.
Urgency.
Focus.
Chaos.
Determination.
Post-match championship atmosphere.
Rare once-in-a-lifetime fan moment.
The fan's emotion comes from intense concentration and urgency rather than excitement or surprise.
The final image should look like a real viral football selfie captured seconds before security removes the fan from the field.
Ultra photorealistic.
Authentic smartphone photography.
No AI-art look.
No CGI.
No face swap effect.
No pasted faces.
No professional photoshoot appearance.
No cinematic movie lighting.
No staged posing.
No watermark.
No logo.
No text.
Extremely realistic.
Indistinguishable from a genuine viral sports photograph.
Paste the full prompt unchanged
Confirm reference order in your platform
Generate & run a few passes
Export the final viral still
Pro Tips
PUSH IT FURTHER
🔢
Reference Order
DON'T SWAP THE INPUTS
The prompt is explicit: Reference 1 = fan, Reference 2 = player. If you upload them in the wrong order, the model maps your face onto the player and vice versa — a fast way to get an unintentionally weird result. Most platforms label the slots (image_1, image_2 or similar). Double-check the slot order before generating, especially if your platform's UI hides the labels behind hover states.
😐
Expression Lock
KEEP THE NO-SMILE LIST
That long block of "NO smile, NO grin, NO teeth, NO surprised expression" is the entire reason this looks like a real viral moment instead of a posed photo. Models default to smiling subjects unless told not to. Don't trim those lines. If a render comes back with a grin or excited expression, regenerate and re-emphasize the no-smile clause — it's doing more work than it looks like.
📱
Smartphone Look
EMBRACE THE BLUR
The slight motion blur, camera shake, and framing imperfections are intentional — they sell the "caught in chaos" feel. If a render comes back too sharp, too clean, too "photoshoot", re-emphasize the smartphone camera, handheld shake, and motion blur lines. A perfectly composed crisp result actually hurts the authenticity. Real viral selfies are imperfect.
🏟️
Swap The Player
PICK YOUR LEGEND
The prompt is generic on purpose — "the football player from Reference Image 2." Drop in whichever player you'd want to bump into on the pitch, swap the World Cup 2026 line to a different tournament if you want, and the rest of the scene (chaos, security, photographers, floodlights) re-stages around them. Same template, infinite once-in-a-lifetime moments.