Koke_Cacao-VQGAN+CLIP
(05-VQGAN+CLIP)
Using: wikiart_16384 + ViT-B/32 + default paremeter
Prompt: A student suffering from his coding homework
The image quality is not very good as other synthesizers based on image inputs (compared to models related to style
transfer) since the natural language processing pipeline restricted the latent space or because they are not trained
end-to-end. The style of the images generated can get very cliche very soon and the tool doesn’t give the artists
very much control over the generated image.