DALL-E 2 is a new artificial intelligence algorithm that creates a picture from a short phrase or sentence in less than a minute.
The program offers a significant leap in the quality and realism of text-to-image systems, mimicking specific styles with high accuracy.
But the technology raises questions about what it means to be creative when DALL-E 2 automates so much of the creative process.
The program also has the potential for harm, such as its reliance on stereotypes and possible uses for disinformation.
Copyright: weforum.org – “Give this AI a few words of description and it produces a stunning image – but is it art?”
A picture may be worth a thousand words, but thanks to an artificial intelligence program called DALL-E 2, you can have a professional-looking image with far fewer.
DALL-E 2 is a new neural network algorithm that creates a picture from a short phrase or sentence that you provide. The program, which was announced by the artificial intelligence research laboratory OpenAI in April 2022, hasn’t been released to the public. But a small and growing number of people – myself included – have been given access to experiment with it.
As a researcher studying the nexus of technology and art, I was keen to see how well the program worked. After hours of experimentation, it’s clear that DALL-E – while not without shortcomings – is leaps and bounds ahead of existing image generation technology. It raises immediate questions about how these technologies will change how art is made and consumed. It also raises questions about what it means to be creative when DALL-E 2 seems to automate so much of the creative process itself.
A staggering range of style and subjects
OpenAI researchers built DALL-E 2 from an enormous collection of images with captions. They gathered some of the images online and licensed others.
Using DALL-E 2 looks a lot like searching for an image on the web: you type in a short phrase into a text box, and it gives back six images.
But instead of being culled from the web, the program creates six brand-new images, each of which reflect some version of the entered phrase. (Until recently, the program produced 10 images per prompt.) For example, when some friends and I gave DALL-E 2 the text prompt “cats in devo hats,” it produced 10 images that came in different styles.
— Aaron Hertzmann (@AaronHertzmann) June 9, 2022
Nearly all of them could plausibly pass for professional photographs or drawings. While the algorithm did not quite grasp “Devo hat” – the strange helmets worn by the New Wave band Devo – the headgear in the images it produced came close.[…]
Read more: www.weforum.org