Summary
- 4o Image Generation is simply a important upgrade to ChatGPT.
- The images dilatory look from the apical down, conscionable similar images downloaded implicit dial-up connections.
- Waiting for images is simply a invited alteration from the instant gratification of astir modern tech.
In March this year, OpenAI released a diagnostic called 4o Image Generation. This is an update to ChatGPT's representation procreation capabilities that brings astir a fig of improvements, specified arsenic much close text, amended acquisition adherence, and improved photorealism.
The process isn't instantaneous, however. The mode that you tin ticker the images appearing successful existent clip takes maine backmost to the bully aged days of dial-up.
ChatGPT Images and the Slow Reveal
Many AI images are generated by starting with a random noise, similar the static you spot successful the intro to HBO shows. The AI exemplary past refines that noise based connected the prompt, with each iteration becoming little similar random sound and much similar the intended image. Eventually, aft capable iterations, the representation should lucifer the prompt.
This means that generating an representation takes time. With immoderate AI models, you tin ticker the process happen, seeing the representation spell from fuzzy static to a finished image. Each measurement shows the authorities of the afloat representation earlier the adjacent iteration takes place.

4o Image Generation is simply a small different, however. It volition archetypal amusement a precise blurry depiction of what the last representation would look like, but past the representation gradually clarifies. Rather than this happening to the full representation astatine once, however, it happens from the apical down.
The apical of the representation is finished first, portion the remainder remains a blur. The bound betwixt the completed and fuzzy representation dilatory moves down the representation truthful that you don't spot the completed representation until it reaches the bottom.
A Flashback to the Dial-Up Days
The archetypal clip I saw this happen, I was instantly thrown backmost 30 years to the days of dial-up internet. Back then, the fastest speeds you could get were 56 Kbps, and the world was usually overmuch slower. These speeds were truthful dilatory that downloading a 100 KB representation could easy instrumentality 30 seconds oregon more.
The mode that images downloaded implicit dial-up is precise akin to however ChatGPT's caller images appear. Each enactment of pixels would load from the apical down, meaning you would spot the apical of the representation archetypal and person to hold for the remainder of the representation to load earlier you could spot it.
Why the Slowdown?
It's not wholly wide wherefore ChatGPT's caller image-generation diagnostic uses this caller top-down method. DALL-E, the previous image-generation exemplary from OpenAI, didn't behave successful the aforesaid way.
The images generated utilizing 4o Image Generation are surely acold superior to those generated utilizing DALL-E, and producing amended images is apt to instrumentality much time. According to a tweet from OpenAI's CEO Sam Altman, it seems that a batch of ChatGPT users are utilizing the diagnostic rather heavily, to the constituent wherever the institution is considering limiting its usage temporarily. If OpenAI's GPUs are "melting" past the representation procreation is apt to instrumentality longer than it mightiness otherwise.
This would explicate wherefore the images are loading dilatory but not the mode that images are refined from the apical down. Whether this is simply a effect of the mode the images are generated oregon due to the fact that idiosyncratic astatine OpenAI truly misses the dial-up days is unclear.
There's Something to beryllium Said For Having to Wait
We unrecorded successful a satellite of instant gratification. You person entree to the sum full of each quality cognition successful your backmost pocket, and we mostly instrumentality it for granted. We ne'er truly person to hold for things anymore, but erstwhile companies similar Apple cruelly crockery retired episodes of Severance astatine a complaint of 1 a week.
I hatred the information that if I person to hold 30 seconds for a assistance oregon for the commercials to finish, my manus volition automatically beryllium reaching for my phone, to capable those seconds with immoderate mindless scrolling. I person to spell to utmost lengths to halt myself from doomscrolling astatine each disposable opportunity.

Related
10 Ways to Stop Doomscrolling connected Your iPhone
Get assistance to flight the rhythm truthful you tin spell interaction immoderate grass.
But there's thing to beryllium said for having to hold for thing good. The dilatory loading of images successful the dial-up days was frustrating, particularly if the accusation you needed (or the spot of the representation you astir wanted to see) was astatine the bottommost and was the past happening to load.
There was thing rather magical astir watching the representation look earlier your eyes, however, and I didn't recognize however overmuch I missed that until ChatGPT reminded maine of it.
Slow Generation May Not Be Around For Long
While I'm truly enjoying the acquisition of watching my images dilatory look earlier my eyes, I whitethorn not beryllium capable to bask it for long. The gait of AI developments shows nary motion of slowing down. It wasn't agelong agone that AI images were hilariously casual to observe conscionable by looking astatine the mangled hands, but existent AI-generated images are getting earnestly hard to spot.

As this exertion improves, it's apt that representation procreation volition get adjacent quicker, and the dilatory uncover volition beryllium gone forever. I program to bask it portion I can, due to the fact that you don't cognize what you've got until it's gone.