Midjourney Prompt Structure: From Basic to Advanced in 5 Minutes

1987 Ferrari F40
Written by
Johnny Cache
Published on
August 9, 2023

How Midjourney Interprets A Prompt

When you type your prompt into Midjourney it breaks down the prompt into tokens or a series of ideas. For example, the prompt "a photorealistic portrait of a cat" would be interpreted as "photorealistic", "portrait", and "cat."

Then it gets transformed into a mathematical representation and fed into a machine learning model that creates the image.

Midjourney tends to place more emphasis on the tokens at the beginning of a prompt. The longer the prompt, the less emphasis will be placed on each token. Starting with a simple 3-5 word prompt (or less) is a great way to build a foundation for your final prompt.

Additive Prompt Structure

Nick St. Pierre has created a technique to structure your prompt that he calls "Additive Prompting". The idea is that you start with your basic idea "photorealistic portrait of a cat", then slowly add more tokens or details once Midjourney has given you an output resembling what you expect to see. Below is the order of the separate parts or types of tokens that can be used to transform your prompt from simple to advanced:

  1. General Style (street style, editorial style, food photography, 1990s Punk style)
  2. Composition (Off-center closeup, medium-full side angle, full body shot, two-shot)
  3. Medium (photo, film still, illustration, sketch, sculpture)
  4. Film Type (Kodak Gold, Agfa Vista, Kodachrome)
  5. Subject Description (a woman walking, a man talking, a dog running)
  6. Subject Styling (fashion descriptions, subject details, etc)
  7. Environment (New Hork, Ancient Egypt, Underwater)
  8. Lighting (natural lighting, studio lighting, off-camera flash, overcast)
  9. Atmosphere (misty, smokey, steamy, foggy)
  10. Mood (dreamy, elegant, spooky, romantic, luxurious)

Before we hop into the tutorial below, it's important to note that this technique is not the only way to structure your prompt. Sometimes the order doesn't matter and you get the result you want, but this structure serves as a foundation and you can rest assured that using this technique will get you very close to the outcome you're looking for.

The Joker Tutorial

We're big fans of The Joker and all things Batman, so we decided to start with a basic prompt of Joaquin Phoenix and transform him into The Joker without using "The Joker" in the prompt. Using a specific artist or aesthetic gets you close to the endpoint faster, but Additive Prompting gives you much more control over each element in the piece. Let's take a look at how this works.

Step 1: General Style

street style photo of joaquin phoenix

Our starting point is a basic prompt that gives us a street style photo of Joaquin Phoenix. We wanted to start as simple as possible, almost as if we are taking him from the street to the movie set where we then start applying all the details that make him The Joker.

Step 2: Composition

street style photo of a joaquin phoenix, medium shot

Next, we apply "medium shot" to the prompt because we want to see him from the waist up. Here are a few other shots we could have used.

Step 3: Medium

street style photo of a joaquin phoenix, medium shot, film still

Then we apply "film still" to the prompt so that the image has a more cinematic feel to it.

Step 4: Film Type

street style photo of a joaquin phoenix, medium shot, film still, Kodak Gold

Next, we add "Kodak Gold" to the prompt to give it a more vintage look. Kodak Gold is great for portraits because it adds warm color and has medium contrast properties.

Step 5: Subject Description

street style photo of a joaquin phoenix, medium shot, film still, Kodak Gold, walking

Then, we add "walking" to the prompt so that there is a sense of motion. You could use "dancing" to get an output that resembles the movie trailer scenes. Scroll all the way to the last step to see our final version where we used "dancing" and added "--ar 16:9" to give the image a more cinematic composition.

Step 6: Subject Styling

street style photo of a joaquin phoenix, medium shot, film still, Kodak Gold, walking, red wool suit

Now comes the fun part. Step 6 is two-part step because we applied the styling slowly. For this first part we simply added "red wool suit".

Step 6: Subject Styling (Part Two)

street style photo of joaquin phoenix, medium shot, film still, Kodak Gold, walking, red wool suit, dark green slicked-back hair, clown makeup

Then, we added "dark green slicked-back hair, clown makeup" to the prompt. We could have applied those separately but we got excited.

Step 7: Environment

street style photo of joaquin phoenix, medium shot, film still, Kodak Gold, walking, red wool suit, dark green slicked-back hair, clown makeup, 1970s new york city

In this step we added "1970s new york city". The environment doesn't change much, but his suit definitely does! The step above looks much more modern.

Step 8: Lighting

street style photo of joaquin phoenix, medium shot, film still, Kodak Gold, walking, red wool suit, dark green slicked-back hair, clown makeup, 1970s new york city, overcast

This is a subtle step where we added "overcast" and the image became a bit more gray. The suit is much less vibrant.

Step 9: Atmosphere

street style photo of joaquin phoenix, medium shot, film still, Kodak Gold, walking, red wool suit, dark green slicked-back hair, clown makeup, 1970s new york city, overcast, foggy

Next, we add "foggy" to the prompt. This creates a much more dramatic scene and the makeup is looking better somehow.

Step 10: Mood

street style photo of joaquin phoenix, medium shot, film still, Kodak Gold, dancing, red wool suit, dark green slicked-back hair, clown makeup, 1970s new york city, overcast, foggy, depressing --ar 16:9

Finally, we add "depressing", replaced "walking" with "dancing" and added "--ar 16:9" so that we could try and replicate a scene from the movie trailer. We're really happy with how it turned out.

What do you think?

Weekly newsletter (soon)
No spam. Just the latest releases and tips, interesting articles, and exclusive interviews in your inbox every week.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Subscribe

Related Prompts

street style photo of joaquin phoenix, medium shot, film still, Kodak Gold, dancing, red wool suit, dark green slicked-back hair, clown makeup, 1970s new york city, overcast, foggy, depressing --ar 16:9
cinematic shot of jesse james customs lowrider, matte black paint job, highly detailed, hyperrealism, low angle shot, hd, 8k --ar 7:4
cinematic shot of baywatch, pamela anderson running towards the camera, hyper realism, highly detailed, hd, 8k --ar 3:2
cinematic shot of tim kennedy mma fighter, octagon, action shot, fighting opponent, hyperrealism, hd, 8k --ar 3:2
cinematic shot from a balcony in paris overlooking the city, film noir, highly detailed, hd, 8k --ar 7:4
the mandalorian blasting droids in a gunfight, action shot, cinematic, highly detailed, hd, 8k --ar 7:4
cinematic shot of a fireman fighting a house fire, firetruck with bright lights, smoke filling the air, water soaked street, hd, 8k --ar 7:4
cinematic shot of a night out in las vegas, bright lights, attractive women, tourists, alcohol, excitement, cinematic lighting, hd, highly detailed, hyperrealism, 8k --ar 3:2
sunrise on tatooine, morning light, cinematic, hd, 8k --ar 7:4
the joker, dark knight, sitting on a train gazing out the window, new york city subway, grunge, cinematic, cinematic lighting, hd, 8k --ar 3:2
mighty mouse cartoon, cinematic shot, animation, clean lines, hd, 8k --ar 7:4
ford mustang shelby cobra 1955
call of duty, first person shooter, in-game graphics, unreal engine
cinematic landscape of tatooine, star wars landscape, photorealistic, hyperrealistic, cinematic quality, 8k
cinematic shot, casa blanca, black and white, highly detailed, 8k
futuristic architecture, highly detailed, cinematic, frank lloyd wright
the sound of silence, abstract, atmospheric