Midjourney V6. Part 1 | In-depth Guide | Andrei Kovalev's Midlibrary

/midlibrary

All styles

Masters Of Midjourney

my Library

exit

my Library

Do you know this feeling when you look at a picture and immediately know—it's Midjourney? V6 is yet another step towards the time when this feeling will become rare.

Copied!

Tim Walker's haute-couture frontal portrait of clear white ethereal android with translucent skin drowning in sea of wires. Red and cyan hues, glowing highlights, dark shadows --v 6.0

Copied!

Tim Walker's haute-couture frontal portrait of clear white ethereal android with translucent skin drowning in sea of wires. Red and cyan hues, glowing highlights, dark shadows --v 5.2

V5.2

In Part 1 of this study, we look in-depth at Midjourney’s newly released model, its strengths, weaknesses, and key changes from V5.2.

Copied!

MRI scan of robot samurai --v 6.0

Copied!

MRI scan of robot samurai --v 5.2

V5.2

Copied!

Japanese fox god of winter death and rebirth --v 6.0

Copied!

Japanese fox god of winter death and rebirth --v 5.2

V5.2

Copied!

intricate raven god --v 6.0

Copied!

intricate raven god --v 5.2

V5.2

Quick Facts About V6

The current V6 is an Alpha test; thus, things may change.

V6 is more accurate in following a prompt (and is better with longer prompts).

The new model is more coherent.

It has improved image-prompting capabilities.

There are two new upscalers, with Subtle and Creative modes (both increase resolution by 2×).

There is an “unopinionated” --style raw mode (similar to V5.2).

You can add text to your images now!

Updated: the Pan, Zoom, Vary (Region) work now (and the in-depth guide on those is coming soon!), while /tune, and the new V6 /describe aren't available. But they are announced to arrive in the nearest weeks.

But apart from these lines—what is V6?

Hyperrealism

Every new version of Midjourney comes with a boost in photorealism, and V6 is no exception. In fact, the keyword that defines the new model best is Hyperrealism.

Copied!

Rinko Kawauchi's photographic portrait of girl space pilot --v 6.0 --style raw

Copied!

Rinko Kawauchi's photographic portrait of girl space pilot --v 5.2 --style raw

V5.2

To summarize briefly, the level of photorealism in V6, especially in photographic styles, is mind-bending.

Copied!

steampunk traveler monk in desert by Lynsey Addario --v 6.0

Copied!

steampunk traveler monk in desert by Lynsey Addario --v 5.2

V5.2

Copied!

Brett Walker's photographic portrait of Daniel Defoe --v 6.0 --style raw

Copied!

Brett Walker's photographic portrait of Daniel Defoe --v 5.2 --style raw

V5.2

Copied!

Kourtney Roy's photographic portrait of old seafarer --v 6.0

Copied!

Kourtney Roy's photographic portrait of old seafarer --v 5.2

V5.2

What makes photorealistic images in Midjourney V6 look so amazing are the imperfections: lens aberrations, intentionally over-highlighted areas, accidental out-of-focus elements, and various film effects (which we will dive into in the 'Details' chapter).

Copied!

Serge Lutens's photographic portrait of young pirate-queen --v 6.0 --style raw

Copied!

Serge Lutens's photographic portrait of young pirate-queen --v 5.2 --style raw

V5.2

Copied!

Mitsuo Katsui's photograph of Hatsune Miku --stylize 175 --v 6.0

Copied!

Mitsuo Katsui's photograph of Hatsune Miku --stylize 175 --v 5.2

V5.2

Copied!

last samurai. Cinematic portrait by Wong Kar-Wai --v 6.0 --style raw

Copied!

last samurai. Cinematic portrait by Wong Kar-Wai --v 5.2 --style raw

V5.2

And, of course, it's not just about portraits...

Copied!

gigantic floating fortress in Northern Sea port. Photograph by Allan Sekula --v 6.0

Copied!

gigantic floating fortress in Northern Sea port. Photograph by Allan Sekula --v 5.2

V5.2

Copied!

time-lapse photography over Tbilisi --stylize 175 --v 6.0

Copied!

time-lapse photography over Tbilisi --stylize 175 --v 5.2

V5.2

Copied!

top view haute-couture advertising food photograph for Japanese restaurant with molecular cuisine. Minimal composition, cyan and red hues --v 6.0

Copied!

top view haute-couture advertising food photograph for Japanese restaurant with molecular cuisine. Minimal composition, cyan and red hues --v 5.2

V5.2

In some cases, however, where a more subtle look-and-feel would be preferable, V6's hyperrealism—with a tendency to oversharp things—may be considered an overkill.

Copied!

16th century knight. Official portrait by Richard Mosse --v 6.0

Copied!

16th century knight. Official portrait by Richard Mosse --v 5.2

V5.2

Copied!

Martin Schoeller's close-up portrait of frontman of death-metal band --v 6.0

Copied!

Martin Schoeller's close-up portrait of frontman of death-metal band --v 5.2

V5.2

Copied!

Ara Guler's portrait of 1970s Istanbul casino gambler --v 6.0

V5.2

Copied!

Ara Guler's portrait of 1970s Istanbul casino gambler --v 5.2

V6 indeed marks a significant milestone in the evolution of ultra-realistic AI art. The Midjourney team has—once again!—surpassed expectations, blurring the line between real photographs and AI-generated imagery like never before. Which is both thrilling and a bit unsettling. (-’๏_๏’-)

Detail Insanity

Every time before the release of Midjourney’s next model, I hold my breath for what they will do with details.

Copied!

Rinko Kawauchi's photographic portrait of girl space pilot --v 6.0 --style raw

Copied!

Rinko Kawauchi's photographic portrait of girl space pilot --v 5.2 --style raw

V5.2

At the risk of repeating myself: the level of details in V6 is, for the lack of a better word, insane. And the new upscalers take it even further (more on them in Part 4 of this deep dive).

Copied!

Carlo Crivelli's painting depicting red knight --v 6.0 --style raw

Copied!

Carlo Crivelli's painting depicting red knight --v 5.2 --style raw

V5.2

Copied!

Leon Bakst's illustration for Darth Vader's ballet sceni ostume --v 6.0

Copied!

Leon Bakst's illustration for Darth Vader's ballet scenic costume --v 5.2

V5.2

Copied!

Ivan Bilibin's painting depicting lord of winter riding white wolf --stylize 275 --v 6.0

V5.2

Copied!

Ivan Bilibin's painting depicting lord of winter riding white wolf --stylize 275 --v 5.2

But while the intricacy of the images went up steadily with each new model, some things were lost along the way, namely, the textures. The refinement of V4 dialed up tenfold in V5+ made it almost impossible to achieve effects like film grain or true brushstrokes, for instance.

Copied!

street photograph by Miroslav Tichy --v 6.0 --style raw

Copied!

street photograph by Miroslav Tichy --v 5.2 --style raw

V5.2

Copied!

broad brushstrokes painted portrait of Troll Hunter in Dutch Golden Age style --v 6.0

Copied!

broad brushstrokes painted portrait of Troll Hunter in Dutch Golden Age style --v 5.2

V5.2

Copied!

pixelated screenshot of 1980s PC game. Underworld level --v 6.0

V5.2

Copied!

pixelated screenshot of 1980s PC game. Underworld level --v 5.2

And while true grain still seems out of reach, V6 is a definite step towards that lost rawness.The textures are amazing, and the new model shines against V5, where “non-refined” visual styles are required.

Copied!

cross processing print of flowers in wind --v 6.0

V5.2

Copied!

cross processing print of flowers in wind --v 5.2

Copied!

dreamy double-exposure portrait of Amelia Erhart. Planes overlay --stylize 55 --v 6.0

V5.2

Copied!

dreamy double-exposure portrait of Amelia Erhart. Planes overlay --stylize 55 --v 5.2

Copied!

dreamy girl by Marianna Rothen --v 6.0 --style raw

V5.2

Copied!

dreamy girl by Marianna Rothen --v 5.2 --style raw

Shifted Composition

Another concept that describes V6 well is unconventional composition, a significant shift from the more structured and balanced approach of V5.2.

Copied!

interdimensional arcane beast by Tyrus Wong --stylize 175 --v 6.0 --style raw

Copied!

interdimensional arcane beast by Tyrus Wong --stylize 175 --v 5.2 --style raw

V5.2

Whereas previous models seek geometrical perfection, golden ratio, and central subjects, V6 strives for asymmetry and often moves its main subjects away from the middle of the frame.

Copied!

Runner. Motion blur movement --v 6.0 --style raw

V5.2

Copied!

Runner. Motion blur movement --v 5.2 --style raw

Copied!

Black Panther by Jamel Shabazz --v 6.0

V5.2

Copied!

Black Panther by Jamel Shabazz --v 5.2

Copied!

lonely post-apocalyptic ranger by Corey Arnold --v 6.0

V5.2

Copied!

lonely post-apocalyptic ranger by Corey Arnold --v 5.2

This may offer more dynamic and engaging visuals, but it also requires adapting prompts accordingly. Even a small adjustment can bring back both a central composition and symmetry.

Copied!

Margaret Bourke-White's photograph of enormous Doomsday device in clouds. Colossal scale --v 6.0

V5.2

Copied!

Margaret Bourke-White's photograph of enormous Doomsday device in clouds. Colossal scale --v 5.2

Copied!

Margaret Bourke-White's photograph of enormous Doomsday megastructure floating in clouds. Colossal scale. Central composition --v 6.0

V5.2

Copied!

Margaret Bourke-White's photograph of enormous Doomsday megastructure floating in clouds. Colossal scale. Central composition --v 5.2

Copied!

Margaret Bourke-White's symmetrical photograph of enormous Doomsday megastructure floating in clouds. Colossal scale. Central composition --v 6.0

V5.2

Copied!

Margaret Bourke-White's symmetrical photograph of enormous Doomsday megastructure floating in clouds. Colossal scale. Central composition --v 5.2

Although, at times, V6's shifts in composition might seem unrequested and unjustified, it is encouraging to see that Midjourney experiments with unconventional compositions, and expanded the overall variability of the output. Speaking of which…

Variability

With V6, one of the first things you'll notice is the increased variability in outcomes from the same prompt.

Copied!

Silver Warlock by Jessie Willcox Smith --v 6.0

Copied!

Silver Warlock by Jessie Willcox Smith --v 5.2

V5.2

It's like the Midjourney developers cranked up the default --chaos dial. As a result, four variations from a single prompt often show more distinct differences than in earlier models.

Copied!

retrofuturistic home appliance --v 6.0

V5.2

Copied!

retrofuturistic home appliance --v 5.2

Copied!

Op-art depicting Chimera --v 6.0

V5.2

Copied!

Op-art depicting Chimera --v 5.2

Copied!

black flowers by Maria Sibylla Merian --v 6.0

V5.2

Copied!

black flowers by Maria Sibylla Merian --v 5.2

This increased diversity is especially noticeable with abstract concepts or prompts that leave space for interpretation, and is also apparent in basic prompts: by [artist name], designed to show how an artistic style works in Midjourney on its own.

Copied!

by Dieter Rams --v 6.0

V5.2

Copied!

by Dieter Rams --v 5.2

Copied!

by Annie Soudain --v 6.0

V5.2

Copied!

by Annie Soudain --v 5.2

Copied!

by Kazumasa Nagai --v 6.0

V5.2

Copied!

by Kazumasa Nagai --v 5.2

This, in many cases, leads to an artistic style representation that is more creative, more interesting, and, sometimes, more faithful towards the diversity of the real-life prototype.
‍
And on the topic of artistic styles…

Artistic Styles In Midjourney V6

Style modifiers, or simply styles, are the names or titles you can reference in your prompts to summon a specific visual flair, technique, genre, subject, or context for your image.

Copied!

Pixel art --v 6.0

Copied!

Pixel art --v 5.2

V5.2

It’s a tradition by now that every new model becomes better at knowing the source material and re-creating it in its output. However, Midjourney never ceases to surprise by how dramatic the change is.

Copied!

by Jody Bergsma --v 6.0

V5.2

Copied!

by Jody Bergsma --v 5.2

Copied!

Japanese vintage poster --v 6.0

V5.2

Copied!

Japanese vintage poster --v 5.2

Copied!

by Leonetto Cappiello --v 6.0

V5.2

Copied!

by Leonetto Cappiello --v 5.2

Let’s compare how style modifiers work in V6 vs. V5.2 using styles from our catalog and a variety of prompts.

Copied!

cutout animation scene from Peppa Pig --v 6.0

V5.2

Copied!

cutout animation scene from Peppa Pig --v 5.2

Copied!

dark fantasy photoshoot by Annie Leibovitz --v 6.0 --style raw

V5.2

Copied!

dark fantasy photoshoot by Annie Leibovitz --v 5.2 --style raw

Copied!

Moebius' illustration depicting robot portrait --v 6.0

V5.2

Copied!

Moebius' illustration depicting robot portrait --v 5.2

In certain instances, the most significant shift isn't in quality, but in the understanding of the source material. V6 appears to have a slightly different familiarity with some artists' work when compared to the same artists’ interpretations by V5+ models.

Copied!

by Erwin Olaf --v 6.0

V5.2

Copied!

by Erwin Olaf --v 5.2

Copied!

Comic-strip style --v 6.0

V5.2

Copied!

Comic-strip style --v 5.2

Copied!

by Luigi Ghirri --v 6.0

V5.2

Copied!

by Luigi Ghirri --v 5.2 --style raw

But however great the styles might be by themselves, what truly turns them from a mere interpretation of the original work to something unique is the prompt that you add to them—e.g., your creative vision, converted into text.

Text!

Remember how we fought Midjourney to make it NOT add text to our pictures? Well, now you can ADD text to your pictures deliberately. Well… to a certain extent. (-‿◦)

Copied!

hands holding newspaper with heading "Text in Midjourney- real or hoax?" text --stylize 55 --v 6.0 --style raw

Copied!

hands holding newspaper with heading "Text in Midjourney- real or hoax?" text --stylize 55 --v 5.2 --style raw

V5.2

The key word here is perseverance. You can get the (almost-)perfect result, but it will be paid in countless re-rolls and updates to your prompt.

Copied!

text "Midlibrary" written in floral-motives-font --v 6.0

V5.2

Copied!

text "Midlibrary" written in floral-motives-font --v 5.2

Copied!

text "Bond, James Bond". Elegant letters on a minimalist bold graphical 1950s James Bond movie poster --stylize 85 --v 6.0

V5.2

Copied!

text "Bond, James Bond". Elegant letters on a minimalist bold graphical 1950s James Bond movie poster --stylize 85 --v 5.2

Copied!

letters "Fhtagn!" old CGA-screen pixelized text with little pixelated Cthulhu ASCII-pictogram --stylize 75 --v 6.0

V5.2

Copied!

letters "Fhtagn!" old CGA-screen pixelized text with little pixelated Cthulhu ASCII-pictogram --stylize 75 --v 5.2

Here are a few important things to remember when working with text in Midjourney V6:

Put your text in “quotations.”

Leveraging --stylize values (lower might be better), and switching to --style raw can improve the result noticeably.

Expectedly, shorter words work better. However, with enough persistence you can make Midjourney produce longer words and even whole phrases.

Adding words like text, letters, etc. to your prompt may improve the outcome.

Sometimes, you might want to double the presence of your target text in a prompt by repeating it twice in different parts of the prompt.

To demonstrate how these reccomendations apply—a few more live examples:

Copied!

text "Fly, you fools!". Large letters "Fly, you fools!" in trash poster duotone-print collage style --stylize 75 --v 6.0

V5.2

Copied!

text "Fly, you fools!". Large letters "Fly, you fools!" in trash poster duotone-print collage style --stylize 75 --v 5.2

Copied!

text "Follow the white rabbit" graffiti in blacklight paint --stylize 55 --v 6.0

V5.2

Copied!

text "Follow the white rabbit" graffiti in blacklight paint --stylize 55 --v 5.2

Copied!

text "2049". Cyberpunk nixie-tubes font number glowing in the dark --stylize 50 --v 6.0 --style raw

V5.2

Copied!

text "2049". Cyberpunk nixie-tubes font number glowing in the dark --stylize 50 --v 5.2 --style raw

Another key element influencing the outcome in V6 is the Aspect Ratio, or --ar, of your image.

Here is how Midjourney tried to squeeze letters into a vertical format in my numerous attempts.

Copied!

noodle-letters. Text "Ramen" made of noodles --v 6.0

V5.2

Copied!

noodle-letters. Text "Ramen" made of noodles --v 5.2

Copied!

noodle-letters. Text "Ramen" made of noodles --stylize 75 --v 6.0

V5.2

Copied!

noodle-letters. Text "Ramen" made of noodles --stylize 75 --v 5.2

Copied!

noodle-letters. Text "Ramen" made of noodles --stylize 35 --v 6.0

V5.2

Copied!

noodle-letters. Text "Ramen" made of noodles --stylize 35 --v 5.2

Yet, the moment I allowed for more horizontal space, the model promptly delivered a more accurate result:

Copied!

noodle-letters. Text "Ramen" made of noodles --stylize 35 --v 6.0 --style raw

Copied!

noodle-letters. Text "Ramen" made of noodles --stylize 35 --v 5.2 --style raw

V5.2

V6 Raw Mode

Raw is an alternative—“unopinionated”—model of Midjourney, existing for both, V6 and V5.2. It activates if you add --style raw to your prompt, and creates an image with ”less automatic beautification applied, which can result in a more accurate match when prompting for specific styles.“

Copied!

anaglyph --v 6

Default

Copied!

anaglyph --v 6.0 --style raw

Raw

The Raw model is more literal, and thus less “creative.” It also tends to lean more towards existing titles when the prompt is ambiguous, i.e., offers multiple interpretations. Like in cases with destiny (also a game), perfect circle (also a music band), and David (also a statue (-‿◦)).

Copied!

perfect circle --v 6.0

Copied!

perfect circle --v 6.0 --style raw

Copied!

destiny --v 6.0

Copied!

destiny --v 6.0 --style raw

Copied!

David --v 6.0

Copied!

David --v 6.0 --style raw

Whereas the default outcome is varied and presents different options, the raw results are consistently focused on reproducing the titles behind the prompts (the game, the music band's frontman, and the statue).

One other difference between the two versions is that the default V6 offers more variability in its results for each prompt, whereas the Raw model’s options are much closer to each other.

Copied!

imagination --v 6.0

Copied!

imagination --v 6.0 --style raw

Copied!

film --v 6.0

Copied!

film --v 6.0 --style raw

Copied!

cube --v 6.0

Copied!

cube --v 6.0 --style raw

And what about more-than-one-word prompts? How does switching from the default to the raw model affect the outcome then?

Copied!

colored bas-relief depicting alien flower growing though concrete brutalist buildings --v 6.0

Copied!

colored bas-relief depicting alien flower growing though concrete brutalist buildings --v 6.0 --style raw

Copied!

elaborate electromagnetic quantum collider mechanism. Inside details, technical labels and markings, complex electronics, mechanics, engineering schematics overlay --v 6.0

Copied!

elaborate electromagnetic quantum collider mechanism. Inside details, technical labels, markings, complex electronics, mechanics, engineering schematics overlay --v 6.0 --style raw

Copied!

weather map --v 6.0

Copied!

weather map --v 6.0 --style raw

Finally, let's experiment with how styles/style modifiers work in both modes.

Copied!

ethereal star beast. Mike Mignola's illustration --v 6.0

Copied!

ethereal star beast. Mike Mignola's illustration --v 6.0 --style raw

Copied!

David Uhl's painting depicting Amelia Earhart --v 6.0

Copied!

David Uhl's painting depicting Amelia Earhart --v 6.0 --style raw

Copied!

Eugenio Recuenco's photograph of Thom Yorke --v 6.0

Copied!

Eugenio Recuenco's photograph of Thom Yorke --v 6.0 --style raw

In conclusion, the --style raw parameter in Midjourney is an effective tool for diversifying results, strengthening styles that imply photorealism, or pushing the AI towards a more literal interpretation of your prompt.

In Part 2 of this exploration we will learn to talk to the new Midjourney, see what prompting strategies work best, and how complex our prompts can get. We will also test the new Image Prompting capabilities, observe how Image Weight affects the outcome, and how Blending works in V6. Continue reading →

In-depth Guide to Midjourney V6

/discuss

this Guide

If you like our Guides, you can help us maintain and expand Midlibrary and produce more regular educational content of higher quality. And keep it free for all!

Support Midlibrary on Patreon! →

Subscribe to Newsletter Suggest a style

Report a bug

Email us

All samples are produced by Midlibrary team using Midjourney AI (if not stated otherwise). Naturally, they are not representative of real artists' works/real-world prototypes.

Support Midlibrary on

Patreon →

Ver. 2.9.1
♡

All content in the Midlibrary catalog is generated by the Midlibrary team using Midjourney AI. We do not feature real artists' images, artworks, or any copyrighted material in our catalog. The samples provided by Midlibrary are intended for educational and illustrative purposes only and are not representative of real artists' works or real-world prototypes. Midlibrary is a non-profit initiative, not affiliated with real artists or authors, aiming to educate and inspire through the demonstration of the technology's potential in creative explorations.

I understand, don't show this again

Encountered a bug?

We do our best to keep this website running as smoothly as possible. However, stuff happens, and we thank you for letting us know!

Thank you!

Midlibrary Groundskeeper has been notified.

✕ Close

Something went wrong while submitting the form. Please, check if you filled all fields.
We're here to help! If you're unable to resolve the issue, please, contact us.

Subscribe to Midlibrary Newsletter

We regularly publish new Midjourney Guides, compile new Style Tops, update the website, and have fun! Want to be the first to get Midlibrary news? Subscribe to our newsletter and never miss a thing!

Thank you for subscribing!

Please, expect emails from [email protected]. If you're not receiving our newsletter for a long enough time, please, check your Spam folder.

✕ Close

Something went wrong... Please, check if you filled all required fields.
If you're unable to resolve the issue, please, contact us.

Personal Libraries are available to our Patreon Community

Learn more about the benefits of supporting us by becoming Midlibrary Patron—and start your Personal Library ↗︎

You have just become a Patron, and cannot log in?

Please, allow our team some time (usually not more than 24 hours) to set up your Personal Library.

You may be using different emails for your Patreon and Discord accounts. If that is the case, please, send your Discord email to [email protected].

If the issue perists, or you didn't get a response to your email, please, inform us via Bug Report form

✕ Close

We are currently updating the Personal Libraires' infrastructure

In the nearest future, it will allow you to access your Collections much quicker, add covers to them, tag the styles you save to quickly find them, and—most importantly—save your --sref (numerical) styles!

However, at the moment, logging in to your Library is unavailable. We apologize for the inconvenience. If you are a Midlibrary Patron, please, check this Patreon post ↗︎ for Personal Libraries status updates.

To start creating Collections and save favorite styles:

Learn more about Personal Style Libraries, saving favorite styles, and organizing them into Collections.

Learn more about supporting Midlibrary and the benefits of joining our Patreon community →

✕ Close