Approximating images with Voronoi diagrams

pointernil · on May 18, 2015

An idea breading for long time already: in image compression, wouldn't a processing pass subtracting such a raw approximation of the image from the actual image allow for better compression? Is this used in any of the actually used image compression formats? If not why not? Why is jpg f.e. using only tiny local windows/blocks for processing?

hvm · on May 18, 2015

Lossy video compression currently does that - it creates key frames at regular intervals and all the frames between (i-frames) are based on the previous key frame. Some codecs have i-frames depend on multiple previous frames.

E.g. a 320x240 video has 3 seconds at 25fps, i.e. 75 frames and key frames are created each second. Frame sizes will then be something like:

  20kB 2kB 3kB 1kB 2kB [20 more frames] 22kB 2kB 1kB 2kB [...]

Now, this technique might be good when compressing images - have a large image that roughly approximates the original and then a smaller one reducing the errors. Or even better, multiple smaller images that correct local errors?

I suggest we create a startup immediately. Someone might already have a cool video explaining how their compression algorithm is going to change the web and the world.

edit: others already thought of this: https://sonnati.wordpress.com/2010/10/19/h-264-for-image-com...

pointernil · on May 18, 2015

First! We need a cool name! ;)

hvm · on May 18, 2015

shadow codec - the 'shadow' is either the large approximated image or the small one that only shows the details. Probably the small one :D

pointernil · on May 18, 2015

Yea! To get the js-css-trickster-folks on board the first version works 'obviously' by splitting an image into the shadow and non-shadow parts offline. The result gets recombined in the browser by overlaying them and using some fancy blending modes. Some micro-benchmarking should be able to show that the two shadows (jpg compressed 'obviously') plus css and js put together are smaller than a simple jpg compressed version of the input image... Go! Go! Go! Before Hooli steals the idea... ;)

maxpersson · on May 18, 2015

Pied Piper :)

pointernil · on May 18, 2015

To prevent confusion and to stay consistent with my nick, I suggest: "Piper pied"

recmo · on May 18, 2015

JPEG is quite old. Some of the best still image compressors are found as I-frame compressors in video codecs. Daala for example uses adaptive block sizes and inter-block prediction.

See: https://people.xiph.org/~xiphmont/demo/daala/demo2.shtml

ska · on May 18, 2015

Lots of compression techniques use variations of this idea. For example, all wavelet tree codecs (eg JPEG2000) effectively do this.

As to why it wasn't done for JPEG, etc.? Mostly the cost of hardware implementations in compute and memory. Think of all the cheap cameras etc. that used JPEG, the 8x8 block coding and integer representation of the discrete cosine transform coding made this possible.

maurits · on May 18, 2015

A very nice visualization and explanation of Poisson disc sampling for images can be found here: [1]

[1]: http://bost.ocks.org/mike/algorithms/

amelius · on May 18, 2015

The problem with this approach is that this does not correspond to the way the brain compresses/processes images. So a coarse image gets a lot of extra "features" which the brain processes as such (features). Instead, going to a coarser representation should remove features.

Cool effect, though.