What does HackerNews think of latent-diffusion?

Show HN: Image Upscaler AI | Jun 2023

There are a lot but the one implemented as LDSR in most stable guis is this one. https://github.com/CompVis/latent-diffusion

upscale wiki is really the place to explore everything image scaling:

https://upscale.wiki/

I replaced all our blog thumbnails using DALL·E 2 | Aug 2022

Expand Context ↕

The latent-diffusion[1] I've got running at home frequently generates stock image watermarks (e.g. "The London Skyline at night in the style of Carboni"[2], images 1, 2, and 6)

[1] https://github.com/CompVis/latent-diffusion [2] https://imgur.com/a/8tOI9QU

Show HN: [NSFW] Diffusion models for porn generation | Jun 2022

FYI if you only have a limited amount of compute, latent diffusion models will converge much faster.

https://github.com/CompVis/latent-diffusion

Asking robots to design stained glass windows | May 2022

Expand Context ↕

I don't have any of the DALL-Es but I do have a couple from github [1], [2] which gave these outputs[3]

[1] https://github.com/nerdyrodent/VQGAN-CLIP [2] https://github.com/CompVis/latent-diffusion [3] https://imgur.com/a/DjQYLUz

Imagen, a text-to-image diffusion model | May 2022

Expand Context ↕

Don't have access to Dall-E 2 or Imagen but I do have [1] and [2] locally and they produced [3] with that prompt.

[1] https://github.com/nerdyrodent/VQGAN-CLIP.git [2] https://github.com/CompVis/latent-diffusion.git [3] https://imgur.com/a/dCPt35K

Imagen, a text-to-image diffusion model | May 2022

Expand Context ↕

The latent-diffusion[1] one I've been playing with is not terrible at drawing legible text but generally awful at actually drawing the text you want (cf. [2]) (or drawing text when you don't want any.)

[1] https://github.com/CompVis/latent-diffusion.git [2] https://imgur.com/a/Sl8YVD5

Imagen, a text-to-image diffusion model | May 2022

Expand Context ↕

latent diffusion model trained on LAION-400M https://github.com/CompVis/latent-diffusion

DALL-E 2 open source implementation | May 2022

Expand Context ↕

An older, but similar and still impressive alternative is available here: https://github.com/CompVis/latent-diffusion

If you have a decent amount of VRAM, you can use it to start generating images with their pre-trained models. They're nowhere near as impressive as DALL-E 2, but they're still pretty damn cool. I don't know what the exact memory requirements are, but I've gotten it to run on a 1080 TI with 11gb.

EDIT: I also tried a 980 with 4GB of RAM a while back, but that failed...so you probably need more than that.

The Weird and Wonderful World of AI Art | Apr 2022

Hi, I'm the author of this post. I hope you all enjoy it! I researched and wrote this back in January, and although the main ideas are still relevant, the landscape of AI art generation has changed quite a bit in just three months. Here are some important new developments:

- DALL-E 2: https://openai.com/dall-e-2/

- Midjourney: https://twitter.com/midjourney

- Laion 5B dataset: https://laion.ai/laion-5b-a-new-era-of-open-large-scale-mult...

- Compvis latent diffusion: https://github.com/CompVis/latent-diffusion

Since the field is moving so quickly, this newsletter is a good way to try to stay on top of things: https://multimodal.art/news

Also I went on Yannic Kilcher's podcast to talk about this! https://www.youtube.com/watch?v=DdkenV-ZdJU&ab_channel=Yanni...

Show HN: an app to create images with AI | Apr 2022

Expand Context ↕

latent difussion models (see: https://github.com/CompVis/latent-diffusion).

we also use our own open-source library https://github.com/thegeniverse/geniverse

Playing with DALL-E 2 | Apr 2022

Expand Context ↕

This isn’t true, the quality of images generated by DALL-E are really good, but they are an incremental improvement and based on a long chain of prior work. See e.g. https://github.com/CompVis/latent-diffusion