What does HackerNews think of waifu2x?

Image Super-Resolution for Anime-Style Art

Language: Lua

The dangers behind image resizing | Feb 2023

Waifu2x - I've used the library to upscale both old photos and videos with enough success to be pleased with the results.

https://github.com/nagadomi/waifu2x

QualityScaler: Image/video deeplearning upscaler with any GPU | Nov 2022

Expand Context ↕

There's waifu2x, it does scaling but also denoising:

https://github.com/nagadomi/waifu2x

And other implementations thereof, e.g:

https://github.com/nihui/waifu2x-ncnn-vulkan

Stitch It in Time: GAN-Based Facial Editing of Real Videos | Apr 2022

Expand Context ↕

Not sure about open-source, but a number of friends of mine who work as freelance professional creatives (e.g. ad hoc video advertisements for smaller companies) espouse Topaz Labs’ Video Enhance AI [0] for this sort of work. Anecdotally, seeing the results of their applications of it, it does seem to work astoundingly well— especially for an application that runs fine on hobbyist/enthusiast grade hardware (albeit the render times will likely still be in a multiple of hours for longer videos; personal experience on 3090 / 5950X / 128GB home work machine was roughly 5-8 frames rendered/sec when upscaling 720p footage to 1440p, roughly 12 seconds render time for every second of footage. I imagine the render time would scale with the demands of the selected AI model / output resolution).

I am not aware of any singular FOSS project in creative work that performs similarly to Topaz Labs’ product for all input videos (note to reader: if I am wrong, correct me— I only have second-hand anecdotes to go off of from my friends who work with this kind of work in their careers, and precious little first-hand experience from upscaling old family memories to experiment with the software). As far as I am aware, this is because different upscaling models are trained on, and thus effective / mainly used for a specific type of content. The same upscaling model that was trained on interpolated 480p videos with compression artifacts will not produce the same results as one trained on, say, anime videos/manga, e.g. the models used by Waifu2X [1]. Hence, with Topaz Labs’ application, you select the model that was trained on footage that best matches the footage you wish to upscale.

All that being said, I do know that some (if not all) of the upscaling models Topaz uses are FOSS. Much of their application is just syntactical sugar upon the models it uses, making it easier for non-SWEs to use. I’m not sure whether or not the models that Topaz distributes have any level of training done by them, in house— logically, I would assume as such, otherwise their product wouldn’t be as performant as it is.

[0] https://www.topazlabs.com/video-enhance-ai

[1] https://github.com/nagadomi/waifu2x

MuZero’s first step from research into the real world | Feb 2022

Expand Context ↕

In procedurally generated Anime there is Waifu 2x https://github.com/nagadomi/waifu2x . After procedural generation it is recommended to do denoiseing on the resulting image to improve quality.

DeepCreamPy- Decensoring Hentai with Deep Neural Networks (2018) | Dec 2021

Expand Context ↕

It tries to predict patterns from blurry shots. It might introduce artifacts that were never there: some drawings I upscaled turned distant woods into houses. You can see why this might be bad when viewing something for historical accuracy.

See e.g., https://www.topazlabs.com and https://github.com/nagadomi/waifu2x

Ask HN: What free software do you prefer for image upscaling? | Dec 2020

https://github.com/nagadomi/waifu2x is good for line drawings

Mobilizon. Your events. Your groups. Your data | Oct 2020

Expand Context ↕

You can use Waifu2x to upscale artwork:

Online demo: http://waifu2x.udp.jp/

Repo: https://github.com/nagadomi/waifu2x

Reddit discussion: https://www.reddit.com/r/VRchat/comments/bkw543/tutorial_how...

Lossless Image Compression Through Super-Resolution | Apr 2020

This is a lot like "waifu2x".[1] That's super-resolution for anime images.

[1] https://github.com/nagadomi/waifu2x

Better Portable Graphics | Jul 2019

Expand Context ↕

A few times I have looked at the possibility of switching from JPEG to another format for photo web sites and every time I have I've come to the conclusion that you can't really win.

There are three benefits that one could get from reducing the file size:

1. Reduced storage cost

2. Reduced bandwidth cost

3. Better user experience

In my models, storage cost matters a lot. You can't come out ahead here, however, if you still have to keep JPEG copies of all the images.

Benefits in terms of 2 are real.

Benefits in terms of 3 are hard to realize. Part of it is that adding any more parts to the system will cause problems for somebody somewhere. For instance, you can decompress a new image format with a Javascript polyfill, but is download+decompress really going to be faster for all users?

Another problem is that much of the source material is already overcompressed JPEG so simply recompressing it with another format doesn't lead to a really improved experience. When I've done my own trials, and when I've looked closely at other people's trials, I don't see a revolutionary improvement.

A scenario that I am interested in now is making desktop backgrounds from (often overcompressed) photos I find on the web. In these cases, JPEG artifacts look like hell when images are blown up, particularly when images have the sharp-cornered bokeh that you get when people take pictures with the kit lens. In that case I can accept a slow and expensive process to blow the image up and make a PNG, something like

https://www.mathworks.com/help/images/jpeg-image-deblocking-...

https://github.com/nagadomi/waifu2x

The other approach I imagine is some kind of maximum entropy approach that minimizes the blocking artifacts.

Let's Enhance – free online image upscale and enhancement with neural networks | Jan 2018

Expand Context ↕

Looking at that project on github[1] I can't help but to think the algorithm need to upsample anime images (like in this project) might be different than the one needed to upsample photos. Anime has the benefit of having a lot of sections of solid colors.

Edit: Actually looking at the source further it appears they also have models for photos as part of that project in addition to the model for anime.

[1] https://github.com/nagadomi/waifu2x

Netflix Originals: Production and Post-Production Requirements v2.1 | Jun 2017

Expand Context ↕

It's an area of active research: https://github.com/nagadomi/waifu2x

Pixel Recursive Super Resolution | Feb 2017

Expand Context ↕

There's the waifu2x project [1] which also uses neural networks for super resolution. There's also the MPDN extensions project [2] which has various kinds of image scaling methods. It depends what kind of algorithms you want to play with really.

[1]: https://github.com/nagadomi/waifu2x

[2]: https://github.com/zachsaw/MPDN_Extensions

Pixel Recursive Super Resolution | Feb 2017

It's really cool to see a writeup for this on real-life images. Similar work at Pinterest http://engineering.flipboard.com/2015/05/scaling-convnets/, for Anime scaling https://github.com/nagadomi/waifu2x and for sprite scaling (can't find the reference I'm thinking of).

This tech has been around for several years, and some variation was presented in concert with the Boston Marathon investigation. https://arstechnica.com/information-technology/2013/05/hallu...

(Not clear if this was used as part of the investigation, or if it could be used for future investigations)

With Google's RAISR, images can be up to 75% smaller without losing detail | Jan 2017

Expand Context ↕

Existing pixel-art-specific upscaling methods are going to be better than this, for pixel art.

But if they wanted they could probably try to train their neural network for pixel art specifically, similar to https://github.com/nagadomi/waifu2x

With Google's RAISR, images can be up to 75% smaller without losing detail | Jan 2017

Here's a list of various image interpolation techniques:

- http://www.ifp.illinois.edu/~jyang29/papers/chap1.pdf

* Image super-resolution: Historical overview and future challenges

- http://www.robots.ox.ac.uk/~vgg/publications/papers/pickup08...

* Machine Learning in Multi-frame Image Super-resolution

- http://www.cs.huji.ac.il/~peleg/papers/icpr90-SuperResolutio...

- http://www.wisdom.weizmann.ac.il/~vision/SingleImageSR.html

- http://chiranjivi.tripod.com/EDITut.html

- http://www.tecnick.com/pagefiles/appunti/iNEDI_tesi_Nicola_A...

- http://www.eurasip.org/Proceedings/Eusipco/Eusipco2009/conte...

- http://bengal.missouri.edu/~kes25c/

* http://bengal.missouri.edu/~kes25c/nnedi3.zip

* http://forum.doom9.org/showthread.php?t=147695

- http://arxiv.org/pdf/1501.00092v2.pdf

* http://waifu2x.udp.jp/

* https://github.com/nagadomi/waifu2x

* http://waifu2x-avisynth.sunnyone.org/

* https://github.com/sunnyone/Waifu2xAvisynth

- http://i-programmer.info/news/192-photography-a-imaging/1010...

* https://github.com/david-gpu/srez

- http://arxiv.org/pdf/1609.04802v1.pdf

- https://github.com/alexjc/neural-enhance

RAISR Sharp Images with Machine Learning | Nov 2016

https://github.com/nagadomi/waifu2x

Here's a previous project that attempted this as well.

https://arxiv.org/abs/1501.00092

GraphicsMagick Image Processing System | Oct 2016

Expand Context ↕

Thanks for the detailed reply. I'd assumed your source files were digital photos of artwork (e.g. 20 megapixel). I want to print some photos of Japanese wood block artwork, and there's an RNN implementation which apparently works well for doing 2x, 4x etc. upscaling: https://github.com/nagadomi/waifu2x

Neural Enhance – Super Resolution for images using deep learning | Oct 2016

Here's a list of various image interpolation techniques, with similar goal:

- http://www.wisdom.weizmann.ac.il/~vision/SingleImageSR.html

- http://chiranjivi.tripod.com/EDITut.html

- http://www.tecnick.com/pagefiles/appunti/iNEDI_tesi_Nicola_A...

- http://www.eurasip.org/Proceedings/Eusipco/Eusipco2009/conte...

- http://bengal.missouri.edu/~kes25c/

http://bengal.missouri.edu/~kes25c/nnedi3.zip

http://forum.doom9.org/showthread.php?t=147695

- http://arxiv.org/pdf/1501.00092v2.pdf

http://waifu2x.udp.jp/

https://github.com/nagadomi/waifu2x

http://waifu2x-avisynth.sunnyone.org/

https://github.com/sunnyone/Waifu2xAvisynth

- http://i-programmer.info/news/192-photography-a-imaging/1010...

https://github.com/david-gpu/srez

- http://arxiv.org/pdf/1609.04802v1.pdf

Image super-resolution through deep learning – GitHub | Aug 2016

Suuuper late to the discussion, but there is a project for anime-style images and denoising that uses CNNs.

https://github.com/nagadomi/waifu2x

Recurrent Neural Networks Hardware Implementation on FPGA | Nov 2015

Expand Context ↕

https://community.sony.co.uk/t5/blog-news-from-sony/inside-4...

They won't tell you it's NNs but it is. Sony distributes a lot of movies, they use their database of movies to train the upscaling models (which is obviously NNs https://github.com/nagadomi/waifu2x ) and then put the chip in the TV.

It's something almost equivalent to storing Pride and Prejudice and Zombies in your TV in 4K, and then reproducing it when they match it with what's playing on TV.