Since OpenAI released CLIP, trained on internet pictures and their nearby text, people have been using it to generate images. In all these methods - CLIP+Dall-E, CLIP+BigGAN, CLIP+FFT, CLIP+VQGAN, CLIP+diffusion - you come up with a text prompt, some algorithm presents its images to CLIP,