AI denoising - fact or fake?

You are currently viewing AI denoising - fact or fake?

In my test of the new in DxO Pho­to­lab 8 includ­ed AI-based denois­ing Deep­Prime XDS2, I ques­tioned, based on the incred­i­bly detailed results, how many of the result­ing details are real­ly present in the sub­ject and how many are just faked. I would like to inves­ti­gate this in more detail here.

AI denoising

Arti­fi­cial intel­li­gence (AI) has now also rev­o­lu­tion­ized image pro­cess­ing (inci­den­tal­ly, the lead image in this arti­cle was also cre­at­ed using Chat­G­PT 😉). Par­tic­u­lar­ly impres­sive for us pho­tog­ra­phers are the results of AI-based denois­ing process­es, which make it pos­si­ble to take very usable pic­tures even with pre­vi­ous­ly incred­i­bly high ISO values.

Since the avail­abil­i­ty of AI-based denois­ing meth­ods, I no longer have any prob­lems work­ing with 5-dig­it ISO val­ues. The results that can be achieved in this way still amaze me, how­ev­er. Below I show again a pic­ture of our tom­cat Tom in direct com­par­i­son of the orig­i­nal and the denoised ver­sion. As it was at that time still too dark despite ISO 12,800, I had to push the pic­ture in Light­room by one stop. So it actu­al­ly cor­re­sponds to a pho­to tak­en at ISO 25,600!

On the left you can see the orig­i­nal and on the right the ver­sion processed with DxO’s Deep­PRIME XD2s, in my opin­ion the best AI process cur­rent­ly avail­able ( enlarged by 200%):

That’s real­ly very impres­sive, isn’t it?

Fur­ther com­par­isons - also with oth­er AI denois­ing meth­ods - can be found in my arti­cle Deep­PRIME XD2s in DxO Pho­to­Lab 8. In prin­ci­ple, much of the fol­low­ing also applies to the oth­er cur­rent­ly avail­able AI-based methods.

How does it work?

In con­trast to the pre­vi­ous­ly used algo­rithm-based denois­ing mech­a­nisms in con­ven­tion­al image pro­cess­ing pro­grams, the way AI works is com­plete­ly dif­fer­ent. This involves train­ing a com­plex neur­al net­work with a vast num­ber of images.

In prin­ci­ple, you can shoot the same sub­ject with low and high ISO val­ues and train the AI with the image pairs so that it learns that the denoised high-ISO ver­sion comes clos­est to the ver­sion with the low ISO val­ue. As you can see, this works very well. The more dif­fer­ent images are avail­able for train­ing, the more com­put­ing pow­er is used for train­ing, the bet­ter the process­es become.

Nowa­days, AI denois­ing reveals details that are not vis­i­ble to the naked eye in the orig­i­nal even by the best of inten­tions. The ques­tion there­fore aris­es whether the details vis­i­ble in the processed image are real­ly there, or if the AI is just gen­er­at­ing struc­tures that it believes are appropriate.

In that sense:

This is a cat with lots of hair. I can see some of them. So I replace the noise in between with more par­al­lel hairs. But I’m not allowed to do that in the eye….

How­ev­er, this usu­al­ly seems to work very well. But is the result

Fact or fake?

Cur­rent text-based AI sys­tems are known to hal­lu­ci­nate when faced with uncer­tain­ty. If they don’t know some­thing, they invent a plau­si­ble-appear­ing answer - it’s almost human 😉. Wikipedia has an inter­est­ing and detailed arti­cle on this top­ic.

It can also be shown in image pro­cess­ing that the AI some­times only hal­lu­ci­nates. This is par­tic­u­lar­ly vis­i­ble with small geo­met­ric shapes that we know well: Let­ters. I have already noticed sev­er­al times, that the AI denois­ing mech­a­nisms have par­tic­u­lar dif­fi­cul­ties with small letters.

I cre­at­ed a real “endurance test” for this. I print­ed out a text with dif­fer­ent font sizes and took pic­tures of it with my Canon EOS R5 at ISO val­ues 100 and 51200 from a dis­tance of approx. 3m (by the way, Chat­G­PT cre­at­ed the mean­ing­less text for me 😉).

Here you can see a 400% enlarged sec­tion in the Adobe Light­room com­par­i­son - on the left tak­en with ISO 100, on the right with ISO 51200 - both with­out denoising:

While I can read the text on the left up to the third para­graph with­out any prob­lems, I even have dif­fi­cul­ty read­ing the first para­graph on the right.

But what is excit­ing is what Deep­PRIME XD2s makes of it. Here is anoth­er com­par­i­son of the ISO 100 image (left) with the ISO 51,200 image processed with Deep­PRIME XD2s (right):

Here you can see - in my opin­ion - very well what the AI does. It cre­ates con­tours around the orig­i­nal let­ters that seem plau­si­ble to it, but which have lit­tle to do with real­i­ty. At first glance, the image on the right appears much sharp­er and less noisy than before, but it still con­tains sig­nif­i­cant­ly less real infor­ma­tion than the orig­i­nal on the left.

The loss of infor­ma­tion due to denois­ing can be seen even bet­ter in a direct com­par­i­son of the denoised ver­sion (right) with the non-denoised ver­sion (left) at ISO 51200:

I per­son­al­ly find it almost eas­i­er to deci­pher the text in the unprocessed, very noisy ver­sion on the left. So in this exam­ple, the AI denois­ing only achieves the illu­sion of image enhance­ment - admit­ted­ly, with ISO 51200 this is also a very extreme example.

With the usu­al every­day motifs, how­ev­er, this loss of real detail is nor­mal­ly not notice­able. The arti­fi­cial­ly gen­er­at­ed enhance­ments with AI usu­al­ly blend in very unob­tru­sive­ly. Nev­er­the­less, some of the vis­i­ble image enhance­ments are only fic­ti­tious - in oth­er words, more fake than fact.

But there is one area where DxO Deep­PRIME XD2s actu­al­ly gets more real detail out of my RAW file: at low ISO values

Low ISO values

This can be seen if you also process the ISO 100 image (left) with Deep­PRIME XD2s (right):

In this com­par­i­son, Deep­PRIME XD2s can actu­al­ly extract more use­ful infor­ma­tion from the RAW file than Light­room. In the edit­ed ver­sion on the right, the text looks much more sharply con­toured and I can now still read the text in the fourth para­graph clear­ly. There is also sig­nif­i­cant­ly less col­or fring­ing around the letters.

Appar­ent­ly, DxO can actu­al­ly han­dle the Bay­er Matrix of the sen­sor bet­ter than Adobe in Light­room. DxO’s approach of apply­ing the AI algo­rithm direct­ly to the raw sen­sor data before the de-Bay­er algo­rithm prob­a­bly helps here. Since not all col­or infor­ma­tion is avail­able for each sen­sor pix­el, the col­or val­ues of neigh­bor­ing sen­sor pix­els are aver­aged to deter­mine the col­or of each indi­vid­ual pix­el. The DxO AI actu­al­ly seems to man­age this bet­ter than the algo­rithm inte­grat­ed in Lightroom.

This makes the use of DxO Deep­PRIME a worth­while option, even for RAW files that are not noisy, in order to get the last bit of qual­i­ty out of them.

And now?

With all the AI hype, I some­times ask myself whether this is still pho­tog­ra­phy. The term pho­tog­ra­phy is com­posed of the ancient Greek φῶς phōs, (“light”) and the ancient Greek γράφειν gráphein (“to draw”) and there­fore means “to draw with light”. A photographer’s mantra is also “It’s just the light - the light makes the pic­ture”. How­ev­er, light seems to be becom­ing less and less important.

Of course, cam­eras have always dis­tort­ed real­i­ty. Three-dimen­sion­al objects become two-dimen­sion­al. In dig­i­tal cam­eras, pix­el col­ors are inter­po­lat­ed by the col­or fil­ters of the neigh­bor­ing pix­els in the Bay­er matrix. Many algo­rithms auto­mat­i­cal­ly process the raw data from the sen­sor, and the image pro­cess­ing pro­gram does the rest. But these algo­rithms are com­pre­hen­si­bly defined and repro­ducible. With AI it is dif­fer­ent, AI is a “black box”.

I’m quite ambiva­lent about this myself. The top­ic extends far beyond denois­ing. In the post-pro­cess­ing of pho­tos, image areas can be replaced with AI based gen­er­at­ed fills, unwant­ed parts of the image can be removed, skies can be replaced and you can even cre­ate pho­to­re­al­is­tic images based sole­ly on text input.

As a self-con­fessed nerd, I have of course already tried all this out by myself and I admit - I am very impressed. But I’ve also lost some of the mag­ic and joy of the “craft of pho­tog­ra­phy”. Mas­ter­ing the tech­nol­o­gy is becom­ing less and less important.

When I think of the excit­ed feel­ing when an image slow­ly mate­ri­al­ized in the red light of the pho­to lab on the white pho­to paper in the devel­op­er bath - but I digress…

What do you think of this devel­op­ment? I am very curi­ous about comments…

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.