Warm up
—- * * FOR NEW STUDENTS ** ————————————— ————
- What industry do you work in and what is your role?
- What are your responses in your role / position?
- Can you describe to the function of your workplace / company?
- How many departments, how many offices. National or International?
- What are the minimum requirements for employment ie Education or Experience?
- How many opportunities are there to ‘move up the ladder’?
- What is the process for changing job roles ie Interview? Test?
————————————————– —— ——————————————————
- Current projects? Deadlines? Opportunities?
- Anything of interest happening?
————————————————————————————————————
1. DALL-E, a portmanteau of the artist “Salvador Dalí” and the robot “WALL-E,” debuted in January of 2021. It was a limited but fascinating test of AI’s ability to visually represent concepts, from mundane depictions of a mannequin in a flannel shirt to “a giraffe made of turtle” or an illustration of a radish walking a dog. At the time, OpenAI said it would continue to build on the system while examining potential dangers like bias in image generation or the production of misinformation. It’s attempting to address those issues using technical safeguards and a new content policy while also reducing its computing load and pushing forward the basic capabilities of the model.
2. One of the new DALL-E 2 features, inpainting, applies DALL-E’s text-to-image capabilities on a more granular level. Users can start with an existing picture, select an area, and tell the model to edit it. You can block out a painting on a living room wall and replace it with a different picture, for instance, or add a vase of flowers on a coffee table. The model can fill (or remove) objects while accounting for details like the directions of shadows in a room.
3. Another feature, variations, is sort of like an image search tool for pictures that don’t exist. Users can upload a starting image and then create a range of variations similar to it. They can also blend two images, generating pictures that have elements of both. The generated images are 1,024 x 1,024 pixels, a leap over the 256 x 256 pixels the original model delivered.
What are some practical usages of this technology? Which industry could benefit from its usage?
4. We compressed images into a series of words and we just learned to predict what comes next,” says OpenAI research scientist Prafulla Dhariwal. But the word-matching didn’t necessarily capture the qualities humans found most important, and the predictive process limited the realism of the images.
5. CLIP was designed to look at images and summarize their contents the way a human would, and OpenAI iterated on this process to create “unCLIP” — an inverted version that starts with the description and works its way toward an image. DALL-E 2 generates the image using a process called diffusion, which Dhariwal describes as starting with a “bag of dots” and then filling in a pattern with greater and greater detail.
6. DALL-E’s full model was never released publicly, but other developers have honed their own tools that imitate some of its functions over the past year. One of the most popular mainstream applications is Wombo’s Dream mobile app, which generates pictures of whatever users describe in a variety of art styles. OpenAI isn’t releasing any new models today, but developers could use its technical findings to update their own work.
Do you think AI will have an impact on the art world? How do you think the art world will change in the future?
7. OpenAI has implemented some built-in safeguards. The model was trained on data that had some objectionable material weeded out, ideally limiting its ability to produce objectionable content. There’s a watermark indicating the AI-generated nature of the work, although it could theoretically be cropped out. As a preemptive anti-abuse feature, the model also can’t generate any recognizable faces based on a name — even asking for something like the Mona Lisa would apparently return a variant on the actual face from the painting.
8. DALL-E 2 will be testable by vetted partners with some caveats. Users are banned from uploading or generating images that are “not G-rated” and “could cause harm,” including anything involving hate symbols, nudity, obscene gestures, or “major conspiracies or events related to major ongoing geopolitical events.”
9. They must also disclose the role of AI in generating the images, and they can’t serve generated images to other people through an app or website — so you won’t initially see a DALL-E-powered version of something like Dream. But OpenAI hopes to add it to the group’s API toolset later, allowing it to power third-party apps. “Our hope is to keep doing a staged process here, so we can keep evaluating from the feedback we get how to release this technology safely,” says Dhariwal.