An investigation into whether Midjourney or Flux is the best AI image generator.
At Indie Hackers, we use Midjourney to generate all of our cover images, and it's been working out pretty well. But Midjourney now has some competition from Flux, the new AI image generator from Black Forest Labs.
Judging by the examples they’ve been putting out, it looks really good.
But, is it better than Midjourney?
That’s what I’ll be finding out in this article by using both models to generate some cover images.
My first muse today is the beautiful Mubashar Iqbal (aka Mubs), who just gave me a great interview giving his advice for new makers. For Mubs, the background image we will be using is this:
Let’s start with Midjourney.
For those who don’t know, Midjourney is a Discord bot. So the way you use it is by joining their Discord server, typing /imagine, and then putting in your prompt.
The prompt we use to generate cover images for interviews is:
Cover image for an article about a startup founder with a background of ___, URL_TO_ORIGINAL_PHOTO, blind contour illustration, background color ___. --ar 16:9
So for Mubs, the prompt is:
Cover image for an article about a startup founder with a background of https://s.mj.run/xlVJtCzMT7A, blind contour illustration, background color maroon. --ar 16:9
That returns these four options:
I really like version 3 (bottom left corner) because it actually looks like him and I’m digging the background, so let’s do some strong variations on that and see what we get:
These aren’t great because Mubs doesn’t wear glasses, so let’s try to tone it down with some subtle variations instead:
These all look pretty similar, but I like #4 (bottom right corner) the best because I think the beard and hair are most like Mubs’s. So, here’s the final output from Midjourney:
Now, let’s try Flux.
To do so, I’m using Flux Pro via Replicate, a front-end that lets me use Flux’s best model for $0.055 an image. However, if you prefer to access it directly via the API, you can do so here.
So, let’s try out the same prompt and see what we get:
That’s not great. Let’s see what happens if we get rid of the glasses, increase the number of diffusion steps to 35, and decrease the interval number (which makes it adhere to the prompt more) to 1.5:
That’s definitely better, but I still can’t get rid of the glasses. Let’s try raising the guidance number to 4, which should make it stick to the prompt even more:
MORE GLASSES! UGH! Let’s change the prompt to include “don’t under any circumstance put glasses on him” and set the guidance number to 5 and the interval number to 1, the maximum amounts allowed:
Ok, I give up. Flux refuses to take off Mubs’s glasses. So, score the interview article point for Midjourney.
Now, let’s try a more abstract cover image.
For example, what if we had an article on AI agents? In this case, our prompt for Midjourney would be cover image of an AI agent. blind contour illustration, background color light blue. --ar 3:2. That gives us:
#1 (top left) is really good, so in just one prompt Midjourney gave us a really good abstract image.
When we run the same prompt in Flux, we get:
Not gonna lie; that thing is pretty cool. If I give it a bit more creative freedom by lowering the guidance number to 2.5 and increasing the interval number to 2.5, we get:
Sheeesh. That thing looks insane, but it’s impossible to say if it’s “better” than Midjourney. Midjourney’s is more abstract, while Flux’s looks like something straight out of Ex Machina. It just depends on which one is more your style.
In any case, it’s amazing that just a few years ago, this was considered state-of-the-art AI image generation:
Now, the same prompt of “nursing school leader” generates this:
And with the amount of brainpower and money now in AI, things will only get better from here.
This comparison between Midjourney and Flux is fascinating! I've been using Midjourney for my own projects, and while it's incredibly powerful for creating abstract and stylized images, I’ve encountered similar challenges when trying to get specific details right, like avoiding certain features.
Flux sounds promising, especially for those who want more creative control, but it’s interesting to see that even with fine-tuning, certain quirks like the persistent glasses remain. It highlights how each tool has its strengths and quirks, and choosing between them might come down to the type of project or visual style you’re aiming for.
Looking forward to seeing how both evolve—AI image generation is advancing so quickly, and it’s exciting to be experimenting with these tools in real time!
100%! No AI image generator is perfect yet, but with all of the good options out there now, you can usually find one to do the job right.
However, from my point of view, there are also the layers that are added to the base model, as Krut AI is currently doing. They are also able to generate more realistic images you can check on our page.
Very cool!
I think Flux is mindblowing, but I cannot help but feel fond of Midjourney because of its abstract style, really useful for creating images that are supposed to look like art projects.
Flux is absolutely a game-changer in the fight for hyper-detailed realism, it's beating MJ.
But in my POV, there are also the layers you add to the base model like Krut AI is doing currently. They are also able to generate more realistic images.
Buy authentic documents online Driver’s License, Passport, ID card and many more. Contact us on WhatsApp +32 466 90 20 51
Flux blows MJ out of the water. Not only because of the quality and the realism, but because it's open-source.
A good point.
Thank you for this great article!
I think one should also mention that the team behind Flux, i.e. the startup’s co-founders ( Robin Rombach, Patrick Esser, and Andreas Blattmann) were formerly researchers who helped create Stability AI’s Stable Diffusion models. So they clearly know what they're doing, but comparing Flux already with Midjourney who went through so many development cycles might be a bit too early.
That's a great point! Still think it's fair to compare Flux to Midjourney though. At the end of the day, both products are on the market.
Thats Cool
Thanks man!