What does HackerNews think of latent-diffusion?
High-Resolution Image Synthesis with Latent Diffusion Models
upscale wiki is really the place to explore everything image scaling:
[1] https://github.com/CompVis/latent-diffusion [2] https://imgur.com/a/8tOI9QU
[1] https://github.com/nerdyrodent/VQGAN-CLIP [2] https://github.com/CompVis/latent-diffusion [3] https://imgur.com/a/DjQYLUz
[1] https://github.com/nerdyrodent/VQGAN-CLIP.git [2] https://github.com/CompVis/latent-diffusion.git [3] https://imgur.com/a/dCPt35K
[1] https://github.com/CompVis/latent-diffusion.git [2] https://imgur.com/a/Sl8YVD5
If you have a decent amount of VRAM, you can use it to start generating images with their pre-trained models. They're nowhere near as impressive as DALL-E 2, but they're still pretty damn cool. I don't know what the exact memory requirements are, but I've gotten it to run on a 1080 TI with 11gb.
EDIT: I also tried a 980 with 4GB of RAM a while back, but that failed...so you probably need more than that.
- DALL-E 2: https://openai.com/dall-e-2/
- Midjourney: https://twitter.com/midjourney
- Laion 5B dataset: https://laion.ai/laion-5b-a-new-era-of-open-large-scale-mult...
- Compvis latent diffusion: https://github.com/CompVis/latent-diffusion
Since the field is moving so quickly, this newsletter is a good way to try to stay on top of things: https://multimodal.art/news
Also I went on Yannic Kilcher's podcast to talk about this! https://www.youtube.com/watch?v=DdkenV-ZdJU&ab_channel=Yanni...
we also use our own open-source library https://github.com/thegeniverse/geniverse