دال-إي DALL-E

DALL-E
	Watermark present on DALL-E images
	An image generated by DALL-E 2, from the prompt Teddy bears working on new AI research underwater with 1990s technology
المطوّر	OpenAI
الإطلاق المبدئي	5 يناير 2021; منذ 4 سنين
الإصدار المستقر	DALL-E 3 / 10 أغسطس 2023; منذ 20 شهرًا
النوع	Text-to-image model
الموقع الإلكتروني	labs.openai.com

DALL-E, DALL-E 2, and DALL-E 3 (stylised DALL·E, and pronounced DOLL-E) are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions known as prompts.

The first version of DALL-E was announced in January 2021. In the following year, its successor DALL-E 2 was released. DALL-E 3 was released natively into ChatGPT for ChatGPT Plus and ChatGPT Enterprise customers in October 2023,^[1] with availability via OpenAI's API^[2] and "Labs" platform provided in early November.^[3] Microsoft implemented the model in Bing's Image Creator tool and plans to implement it into their Designer app.^[4]

History and background

DALL-E was revealed by OpenAI in a blog post on 5 January 2021, and uses a version of GPT-3^[5] modified to generate images.

On 6 April 2022, OpenAI announced DALL-E 2, a successor designed to generate more realistic images at higher resolutions that "can combine concepts, attributes, and styles".^[6] On 20 July 2022, DALL-E 2 entered into a beta phase with invitations sent to 1 million waitlisted individuals;^[7] users could generate a certain number of images for free every month and may purchase more.^[8] Access had previously been restricted to pre-selected users for a research preview due to concerns about ethics and safety.^[9]^[10] On 28 September 2022, DALL-E 2 was opened to everyone and the waitlist requirement was removed.^[11] In September 2023, OpenAI announced their latest image model, DALL-E 3, capable of understanding "significantly more nuance and detail" than previous iterations.^[12] In early November 2022, OpenAI released DALL-E 2 as an API, allowing developers to integrate the model into their own applications. Microsoft unveiled their implementation of DALL-E 2 in their Designer app and Image Creator tool included in Bing and Microsoft Edge.^[13] The API operates on a cost-per-image basis, with prices varying depending on image resolution. Volume discounts are available to companies working with OpenAI's enterprise team.^[14]

The software's name is a portmanteau of the names of animated robot Pixar character WALL-E and the Catalan surrealist artist Salvador Dalí.^[15]^[5]

In February 2024, OpenAI began adding watermarks to DALL-E generated images, containing metadata in the C2PA (Coalition for Content Provenance and Authenticity) standard promoted by the Content Authenticity Initiative.^[16]

Technology

The first generative pre-trained transformer (GPT) model was initially developed by OpenAI in 2018,^[17] using a Transformer architecture. The first iteration, GPT-1,^[18] was scaled up to produce GPT-2 in 2019;^[19] in 2020, it was scaled up again to produce GPT-3, with 175 billion parameters.^[20]^[5]^[21]

DALL-E

DALL-E has three components: a discrete VAE, an autoregressive decoder-only Transformer (12 billion parameters) similar to GPT-3, and a CLIP pair of image encoder and text encoder.^[22]

The discrete VAE can convert an image to a sequence of tokens, and conversely, convert a sequence of tokens back to an image. This is necessary as the Transformer does not directly process image data.^[22]

The input to the Transformer model is a sequence of tokenized image caption followed by tokenized image patches. The image caption is in English, tokenized by byte pair encoding (vocabulary size 16384), and can be up to 256 tokens long. Each image is a 256×256 RGB image, divided into 32×32 patches of 4×4 each. Each patch is then converted by a discrete variational autoencoder to a token (vocabulary size 8192).^[22]

DALL-E was developed and announced to the public in conjunction with CLIP (Contrastive Language-Image Pre-training).^[23] CLIP is a separate model based on contrastive learning that was trained on 400 million pairs of images with text captions scraped from the Internet. Its role is to "understand and rank" DALL-E's output by predicting which caption from a list of 32,768 captions randomly selected from the dataset (of which one was the correct answer) is most appropriate for an image.^[24]

A trained CLIP pair is used to filter a larger initial list of images generated by DALL-E to select the image that is closest to the text prompt.^[22]

DALL-E 2

DALL-E 2 uses 3.5 billion parameters, a smaller number than its predecessor.^[22] Instead of an autoregressive Transformer, DALL-E 2 uses a diffusion model conditioned on CLIP image embeddings, which, during inference, are generated from CLIP text embeddings by a prior model.^[22] This is the same architecture as that of Stable Diffusion, released a few months later.

Capabilities

DALL-E can generate imagery in multiple styles, including photorealistic imagery, paintings, and emoji.^[5] It can "manipulate and rearrange" objects in its images,^[5] and can correctly place design elements in novel compositions without explicit instruction. Thom Dunn writing for BoingBoing remarked that "For example, when asked to draw a daikon radish blowing its nose, sipping a latte, or riding a unicycle, DALL-E often draws the handkerchief, hands, and feet in plausible locations."^[25] DALL-E showed the ability to "fill in the blanks" to infer appropriate details without specific prompts, such as adding Christmas imagery to prompts commonly associated with the celebration,^[26] and appropriately placed shadows to images that did not mention them.^[27] Furthermore, DALL-E exhibits a broad understanding of visual and design trends.^{[بحاجة لمصدر]}

DALL-E can produce images for a wide variety of arbitrary descriptions from various viewpoints^[28] with only rare failures.^[15] Mark Riedl, an associate professor at the Georgia Tech School of Interactive Computing, found that DALL-E could blend concepts (described as a key element of human creativity).^[29]^[30]

Its visual reasoning ability is sufficient to solve Raven's Matrices (visual tests often administered to humans to measure intelligence).^[31]^[32]

An image of accurate text generated by DALL-E 3 based on the text prompt "An illustration of an avocado sitting in a therapist's chair, saying 'I just feel so empty inside' with a pit-sized hole in its center. The therapist, a spoon, scribbles notes"

DALL-E 3 follows complex prompts with more accuracy and detail than its predecessors, and is able to generate more coherent and accurate text.^[33]^[12] DALL-E 3 is integrated into ChatGPT Plus.^[12]

Image modification

Two "variations" of Girl With a Pearl Earring generated with DALL-E 2

Given an existing image, DALL-E 2 can produce "variations" of the image as individual outputs based on the original, as well as edit the image to modify or expand upon it. DALL-E 2's "inpainting" and "outpainting" use context from an image to fill in missing areas using a medium consistent with the original, following a given prompt.

For example, this can be used to insert a new subject into an image, or expand an image beyond its original borders.^[34] According to OpenAI, "Outpainting takes into account the image’s existing visual elements — including shadows, reflections, and textures — to maintain the context of the original image."^[35]

Technical limitations

DALL-E 2's language understanding has limits. It is sometimes unable to distinguish "A yellow book and a red vase" from "A red book and a yellow vase" or "A panda making latte art" from "Latte art of a panda".^[36] It generates images of "an astronaut riding a horse" when presented with the prompt "a horse riding an astronaut".^[37] It also fails to generate the correct images in a variety of circumstances. Requesting more than three objects, negation, numbers, and connected sentences may result in mistakes, and object features may appear on the wrong object.^[28] Additional limitations include handling text — which, even with legible lettering, almost invariably results in dream-like gibberish — and its limited capacity to address scientific information, such as astronomy or medical imagery.^[38]

An attempt to generate Japanese text using the prompt "a person pointing at a tanuki, with a speech bubble that says 'これは狸です！'", which results in the text being rendered with nonsensical kanji and kana

Ethical concerns

DALL-E 2's reliance on public datasets influences its results and leads to algorithmic bias in some cases, such as generating higher numbers of men than women for requests that do not mention gender.^[38] DALL-E 2's training data was filtered to remove violent and sexual imagery, but this was found to increase bias in some cases such as reducing the frequency of women being generated.^[39] OpenAI hypothesize that this may be because women were more likely to be sexualized in training data which caused the filter to influence results.^[39] In September 2022, OpenAI confirmed to The Verge that DALL-E invisibly inserts phrases into user prompts to address bias in results; for instance, "black man" and "Asian woman" are inserted into prompts that do not specify gender or race.^[40]

A concern about DALL-E 2 and similar image generation models is that they could be used to propagate deepfakes and other forms of misinformation.^[41]^[42] As an attempt to mitigate this, the software rejects prompts involving public figures and uploads containing human faces.^[43] Prompts containing potentially objectionable content are blocked, and uploaded images are analyzed to detect offensive material.^[44] A disadvantage of prompt-based filtering is that it is easy to bypass using alternative phrases that result in a similar output. For example, the word "blood" is filtered, but "ketchup" and "red liquid" are not.^[45]^[44]

Another concern about DALL-E 2 and similar models is that they could cause technological unemployment for artists, photographers, and graphic designers due to their accuracy and popularity.^[46]^[47] DALL-E 3 is designed to block users from generating art in the style of currently-living artists.^[12]

In 2023 Microsoft pitched the United States Department of Defense to use DALL-E models to train battlefield management system.^[48] In January 2024 OpenAI removed its blanket ban on military and warfare use from its usage policies.^[49]

Reception

Images generated by DALL-E upon the prompt: "an illustration of a baby daikon radish in a tutu walking a dog"

Most coverage of DALL-E focuses on a small subset of "surreal"^[23] or "quirky"^[29] outputs. DALL-E's output for "an illustration of a baby daikon radish in a tutu walking a dog" was mentioned in pieces from Input,^[50] NBC,^[51] Nature,^[52] and other publications.^[5]^[53]^[54] Its output for "an armchair in the shape of an avocado" was also widely covered.^[23]^[30]

ExtremeTech stated "you can ask DALL-E for a picture of a phone or vacuum cleaner from a specified period of time, and it understands how those objects have changed".^[26] Engadget also noted its unusual capacity for "understanding how telephones and other objects change over time".^[27]

According to MIT Technology Review, one of OpenAI's objectives was to "give language models a better grasp of the everyday concepts that humans use to make sense of things".^[23]

Wall Street investors have had a positive reception of DALL-E 2, with some firms thinking it could represent a turning point for a future multi-trillion dollar industry. By mid-2019, OpenAI had already received over $1 billion in funding from Microsoft and Khosla Ventures,^[55]^[56]^[57] and in January 2023, following the launch of DALL-E 2 and ChatGPT, received an additional $10 billion in funding from Microsoft.^[58]

Japan's anime community has had a negative reaction to DALL-E 2 and similar models.^[59]^[60]^[61] Two arguments are typically presented by artists against the software. The first is that AI art is not art because it is not created by a human with intent. "The juxtaposition of AI-generated images with their own work is degrading and undermines the time and skill that goes into their art. AI-driven image generation tools have been heavily criticized by artists because they are trained on human-made art scraped from the web."^[7] The second is the trouble with copyright law and data text-to-image models are trained on. OpenAI has not released information about what dataset(s) were used to train DALL-E 2, inciting concern from some that the work of artists has been used for training without permission. Copyright laws surrounding these topics are inconclusive at the moment.^[8]

After integrating DALL-E 3 into Bing Chat and ChatGPT, Microsoft and OpenAI faced criticism for excessive content filtering, with critics saying DALL-E had been "lobotomized."^[62] The flagging of images generated by prompts such as "man breaks server rack with sledgehammer" was cited as evidence. Over the first days of its launch, filtering was reportedly increased to the point where images generated by some of Bing's own suggested prompts were being blocked.^[62]^[63] TechRadar argued that leaning too heavily on the side of caution could limit DALL-E's value as a creative tool.^[63]

Open-source implementations

Since OpenAI has not released source code for any of the three models, there have been several attempts to create open-source models offering similar capabilities.^[64]^[65] Released in 2022 on Hugging Face's Spaces platform, Craiyon (formerly DALL-E Mini until a name change was requested by OpenAI in June 2022) is an AI model based on the original DALL-E that was trained on unfiltered data from the Internet. It attracted substantial media attention in mid-2022, after its release due to its capacity for producing humorous imagery.^[66]^[67]^[68]

References

^ David, Emilia (20 سبتمبر 2023). "OpenAI releases third version of DALL-E". The Verge (in الإنجليزية الأمريكية). Archived from the original on 20 سبتمبر 2023. Retrieved 21 سبتمبر 2023.
^ "OpenAI Platform". platform.openai.com (in الإنجليزية). Archived from the original on 20 مارس 2023. Retrieved 10 نوفمبر 2023.
^ Niles, Raymond (10 نوفمبر 2023) [Updated this week]. "DALL-E 3 API". OpenAI help Center (in الإنجليزية). Archived from the original on 10 نوفمبر 2023. Retrieved 10 نوفمبر 2023.
^ Mehdi, Yusuf (21 سبتمبر 2023). "Announcing Microsoft Copilot, your everyday AI companion". The Official Microsoft Blog (in الإنجليزية الأمريكية). Archived from the original on 21 سبتمبر 2023. Retrieved 21 سبتمبر 2023.
^ ^أ ^ب ^ت ^ث ^ج ^ح Johnson, Khari (5 يناير 2021). "OpenAI debuts DALL-E for generating images from text". VentureBeat. Archived from the original on 5 يناير 2021. Retrieved 5 يناير 2021.
^ "DALL·E 2". OpenAI (in الإنجليزية الأمريكية). Archived from the original on 6 أبريل 2022. Retrieved 6 يوليو 2022.
^ ^أ ^ب "DALL·E Now Available in Beta". OpenAI (in الإنجليزية). 20 يوليو 2022. Archived from the original on 20 يوليو 2022. Retrieved 20 يوليو 2022.
^ ^أ ^ب Allyn, Bobby (20 يوليو 2022). "Surreal or too real? Breathtaking AI tool DALL-E takes its images to a bigger stage". NPR (in الإنجليزية). Archived from the original on 20 يوليو 2022. Retrieved 20 يوليو 2022.
^ "DALL·E Waitlist". labs.openai.com (in الإنجليزية). Archived from the original on 4 يوليو 2022. Retrieved 6 يوليو 2022.
^ "From Trump Nevermind babies to deep fakes: DALL-E and the ethics of AI art". the Guardian (in الإنجليزية). 18 يونيو 2022. Archived from the original on 6 يوليو 2022. Retrieved 6 يوليو 2022.
^ "DALL·E Now Available Without Waitlist". OpenAI (in الإنجليزية). 28 سبتمبر 2022. Archived from the original on 4 أكتوبر 2022. Retrieved 5 أكتوبر 2022.
^ ^أ ^ب ^ت ^ث "DALL·E 3". OpenAI (in الإنجليزية الأمريكية). Archived from the original on 20 سبتمبر 2023. Retrieved 21 سبتمبر 2023.
^ "DALL·E API Now Available in Public Beta". OpenAI (in الإنجليزية). 3 نوفمبر 2022. Archived from the original on 19 نوفمبر 2022. Retrieved 19 نوفمبر 2022.
^ Wiggers, Kyle (3 نوفمبر 2022). "Now anyone can build apps that use DALL-E 2 to generate images". TechCrunch. Archived from the original on 19 نوفمبر 2022. Retrieved 19 نوفمبر 2022.
^ ^أ ^ب Coldewey, Devin (5 يناير 2021). "OpenAI's DALL-E creates plausible images of literally anything you ask it to". Archived from the original on 6 يناير 2021. Retrieved 5 يناير 2021.
^ Growcoot, Matt (8 فبراير 2024). "AI Images Generated on DALL-E Now Contain the Content Authenticity Tag". PetaPixel (in الإنجليزية). Retrieved 4 أبريل 2024.
^ Radford, Alec; Narasimhan, Karthik; Salimans, Tim; Sutskever, Ilya (11 يونيو 2018). "Improving Language Understanding by Generative Pre-Training" (PDF). OpenAI. p. 12. Archived (PDF) from the original on 26 يناير 2021. Retrieved 23 يناير 2021.
^ "GPT-1 to GPT-4: Each of OpenAI's GPT Models Explained and Compared". 11 أبريل 2023. Archived from the original on 15 أبريل 2023. Retrieved 29 أبريل 2023.
^ Radford, Alec; Wu, Jeffrey; Child, Rewon; et al. (14 فبراير 2019). "Language models are unsupervised multitask learners" (PDF). cdn.openai.com. 1 (8). Archived (PDF) from the original on 6 فبراير 2021. Retrieved 19 ديسمبر 2020.
^ Brown, Tom B.; Mann, Benjamin; Ryder, Nick; et al. (22 يوليو 2020). "Language Models are Few-Shot Learners". arXiv:2005.14165 [cs.CL].
^ Ramesh, Aditya; Pavlov, Mikhail; Goh, Gabriel; et al. (24 فبراير 2021). "Zero-Shot Text-to-Image Generation". arXiv:2102.12092 [cs.LG].
^ ^أ ^ب ^ت ^ث ^ج ^ح Ramesh, Aditya; Dhariwal, Prafulla; Nichol, Alex; Chu, Casey; Chen, Mark (12 أبريل 2022). "Hierarchical Text-Conditional Image Generation with CLIP Latents". arXiv:2204.06125 [cs.CV].
^ ^أ ^ب ^ت ^ث Heaven, Will Douglas (5 يناير 2021). "This avocado armchair could be the future of AI". MIT Technology Review. Archived from the original on 5 يناير 2021. Retrieved 5 يناير 2021.
^ (2021-07-01) "Learning Transferable Visual Models From Natural Language Supervision" in Proceedings of the 38th International Conference on Machine Learning.: 8748–8763, PMLR.
^ Dunn, Thom (10 فبراير 2021). "This AI neural network transforms text captions into art, like a jellyfish Pikachu". BoingBoing. Archived from the original on 22 فبراير 2021. Retrieved 2 مارس 2021.
^ ^أ ^ب Whitwam, Ryan (6 يناير 2021). "OpenAI's 'DALL-E' Generates Images From Text Descriptions". ExtremeTech. Archived from the original on 28 يناير 2021. Retrieved 2 مارس 2021.
^ ^أ ^ب Dent, Steve (6 يناير 2021). "OpenAI's DALL-E app generates images from just a description". Engadget. Archived from the original on 27 يناير 2021. Retrieved 2 مارس 2021.
^ ^أ ^ب Marcus, Gary; Davis, Ernest; Aaronson, Scott (2 مايو 2022). "A very preliminary analysis of DALL-E 2". arXiv:2204.13807 [cs.CV].
^ ^أ ^ب Shead, Sam (8 يناير 2021). "Why everyone is talking about an image generator released by an Elon Musk-backed A.I. lab". CNBC. Archived from the original on 16 يوليو 2022. Retrieved 2 مارس 2021.
^ ^أ ^ب Wakefield, Jane (6 يناير 2021). "AI draws dog-walking baby radish in a tutu". British Broadcasting Corporation. Archived from the original on 2 مارس 2021. Retrieved 3 مارس 2021.
^ Markowitz, Dale (10 يناير 2021). "Here's how OpenAI's magical DALL-E image generator works". TheNextWeb. Archived from the original on 23 فبراير 2021. Retrieved 2 مارس 2021.
^ "DALL·E: Creating Images from Text". OpenAI (in الإنجليزية). 5 يناير 2021. Archived from the original on 27 مارس 2021. Retrieved 13 أغسطس 2022.
^ Edwards, Benj (20 سبتمبر 2023). "OpenAI's new AI image generator pushes the limits in detail and prompt fidelity". Ars Technica (in الإنجليزية الأمريكية). Archived from the original on 21 سبتمبر 2023. Retrieved 21 سبتمبر 2023.
^ Coldewey, Devin (6 أبريل 2022). "New OpenAI tool draws anything, bigger and better than ever". TechCrunch (in الإنجليزية الأمريكية). Archived from the original on 6 مايو 2023. Retrieved 26 نوفمبر 2022.
^ "DALL·E: Introducing Outpainting". OpenAI (in الإنجليزية). 31 أغسطس 2022. Archived from the original on 26 نوفمبر 2022. Retrieved 26 نوفمبر 2022.
^ Saharia, Chitwan; Chan, William; Saxena, Saurabh; et al. (23 مايو 2022). "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding". arXiv:2205.11487 [cs.CV].
^ Marcus, Gary (28 مايو 2022). "Horse rides astronaut". The Road to AI We Can Trust. Archived from the original on 19 يونيو 2022. Retrieved 18 يونيو 2022.
^ ^أ ^ب Strickland, Eliza (14 يوليو 2022). "DALL-E 2's Failures Are the Most Interesting Thing About It". IEEE Spectrum (in الإنجليزية). Archived from the original on 15 يوليو 2022. Retrieved 16 أغسطس 2022.
^ ^أ ^ب "DALL·E 2 Pre-Training Mitigations". OpenAI (in الإنجليزية). 28 يونيو 2022. Archived from the original on 19 يوليو 2022. Retrieved 18 يوليو 2022.
^ James Vincent (29 سبتمبر 2022). "OpenAI's image generator DALL-E is available for anyone to use immediately". The Verge. Archived from the original on 29 سبتمبر 2022. Retrieved 29 سبتمبر 2022.
^ Taylor, Josh (18 يونيو 2022). "From Trump Nevermind babies to deep fakes: DALL-E and the ethics of AI art". The Guardian. Archived from the original on 6 يوليو 2022. Retrieved 2 أغسطس 2022.
^ Knight, Will (13 يوليو 2022). "When AI Makes Art, Humans Supply the Creative Spark". Wired. Archived from the original on 2 أغسطس 2022. Retrieved 2 أغسطس 2022.
^ Rose, Janus (24 يونيو 2022). "DALL-E Is Now Generating Realistic Faces of Fake People". Vice. Archived from the original on 30 يوليو 2022. Retrieved 2 أغسطس 2022.
^ ^أ ^ب OpenAI (19 يونيو 2022). "DALL·E 2 Preview – Risks and Limitations". GitHub. Archived from the original on 2 أغسطس 2022. Retrieved 2 أغسطس 2022.
^ Lane, Laura (1 يوليو 2022). "DALL-E, Make Me Another Picasso, Please". The New Yorker. Archived from the original on 2 أغسطس 2022. Retrieved 2 أغسطس 2022.
^ Goldman, Sharon (26 يوليو 2022). "OpenAI: Will DALL-E 2 kill creative careers?". Archived from the original on 15 أغسطس 2022. Retrieved 16 أغسطس 2022.
^ Blain, Loz (29 يوليو 2022). "DALL-E 2: A dream tool and an existential threat to visual artists". Archived from the original on 17 أغسطس 2022. Retrieved 16 أغسطس 2022.
^ Biddle, Sam (10 أبريل 2024). "Microsoft Pitched OpenAI's DALL-E as Battlefield Tool for U.S. Military". The Intercept.
^ Biddle, Sam (12 يناير 2024). "OpenAI Quietly Deletes Ban on Using ChatGPT for "Military and Warfare"". The Intercept.
^ Kasana, Mehreen (7 يناير 2021). "This AI turns text into surreal, suggestion-driven art". Input. Archived from the original on 29 يناير 2021. Retrieved 2 مارس 2021.
^ Ehrenkranz, Melanie (27 يناير 2021). "Here's DALL-E: An algorithm learned to draw anything you tell it". NBC News. Archived from the original on 20 فبراير 2021. Retrieved 2 مارس 2021.
^ Stove, Emma (5 فبراير 2021). "Tardigrade circus and a tree of life — January's best science images". Nature. Archived from the original on 8 مارس 2021. Retrieved 2 مارس 2021.
^ Knight, Will (26 يناير 2021). "This AI Could Go From 'Art' to Steering a Self-Driving Car". Wired. Archived from the original on 21 فبراير 2021. Retrieved 2 مارس 2021.
^ Metz, Rachel (2 فبراير 2021). "A radish in a tutu walking a dog? This AI can draw it really well". CNN. Archived from the original on 16 يوليو 2022. Retrieved 2 مارس 2021.
^ Leswing, Kif (8 أكتوبر 2022). "Why Silicon Valley is so excited about awkward drawings done by artificial intelligence". CNBC (in الإنجليزية). Archived from the original on 29 يوليو 2023. Retrieved 1 ديسمبر 2022.
^ Etherington, Darrell (22 يوليو 2019). "Microsoft invests $1 billion in OpenAI in new multiyear partnership". TechCrunch (in الإنجليزية الأمريكية). Archived from the original on 22 يوليو 2019. Retrieved 21 سبتمبر 2023.
^ "OpenAI's first VC backer weighs in on generative A.I." Fortune (in الإنجليزية). Archived from the original on 23 أكتوبر 2023. Retrieved 21 سبتمبر 2023.
^ Metz, Cade; Weise, Karen (23 يناير 2023). "Microsoft to Invest $10 Billion in OpenAI, the Creator of ChatGPT". The New York Times (in الإنجليزية الأمريكية). ISSN 0362-4331. Archived from the original on 21 سبتمبر 2023. Retrieved 21 سبتمبر 2023.
^ "AI-generated art sparks furious backlash from Japan's anime community". Rest of World (in الإنجليزية الأمريكية). 27 أكتوبر 2022. Archived from the original on 31 ديسمبر 2022. Retrieved 3 يناير 2023.
^ Roose, Kevin (2 سبتمبر 2022). "An A.I.-Generated Picture Won an Art Prize. Artists Aren't Happy". The New York Times (in الإنجليزية الأمريكية). ISSN 0362-4331. Archived from the original on 31 مايو 2023. Retrieved 3 يناير 2023.
^ Daws, Ryan (15 ديسمبر 2022). "ArtStation backlash increases following AI art protest response". AI News (in الإنجليزية البريطانية). Archived from the original on 3 يناير 2023. Retrieved 3 يناير 2023.
^ ^أ ^ب Corden, Jez (8 أكتوبر 2023). "Bing Dall-E 3 image creation was great for a few days, but now Microsoft has predictably lobotomized it". Windows Central. Archived from the original on 10 أكتوبر 2023. Retrieved 11 أكتوبر 2023.
^ ^أ ^ب Allan, Darren (9 أكتوبر 2023). "Microsoft reins in Bing AI's Image Creator – and the results don't make much sense". TechRadar. Archived from the original on 10 أكتوبر 2023. Retrieved 11 أكتوبر 2023.
^ Sahar Mor, Stripe (16 أبريل 2022). "How DALL-E 2 could solve major computer vision challenges". VentureBeat. Archived from the original on 24 مايو 2022. Retrieved 15 يونيو 2022.
^ "jina-ai/dalle-flow". Jina AI. 17 يونيو 2022. Archived from the original on 17 يونيو 2022. Retrieved 17 يونيو 2022.
^ Carson, Erin (14 يونيو 2022). "Everything to Know About Dall-E Mini, the Mind-Bending AI Art Creator". CNET. Archived from the original on 15 يونيو 2022. Retrieved 15 يونيو 2022.
^ Schroeder, Audra (9 يونيو 2022). "AI program DALL-E mini prompts some truly cursed images". Daily Dot. Archived from the original on 10 يونيو 2022. Retrieved 15 يونيو 2022.
^ Diaz, Ana (15 يونيو 2022). "People are using DALL-E mini to make meme abominations — like pug Pikachu". Polygon. Archived from the original on 15 يونيو 2022. Retrieved 15 يونيو 2022.

External links

Ramesh, Aditya; Pavlov, Mikhail; Goh, Gabriel; Gray, Scott; Voss, Chelsea; Radford, Alec; Chen, Mark; Sutskever, Ilya (26 فبراير 2021). "Zero-Shot Text-to-Image Generation". arXiv:2102.12092 [cs.CV].. The original report on DALL-E.
DALL-E 3 System Card
DALL-E 3 paper by OpenAI
DALL-E 2 website
Craiyon website

قالب:Generative AI قالب:Artificial intelligence navbox

[David-2023-1] David, Emilia (20 سبتمبر 2023). "OpenAI releases third version of DALL-E". The Verge (in الإنجليزية الأمريكية). Archived from the original on 20 سبتمبر 2023. Retrieved 21 سبتمبر 2023.

[platform.openai.com-2] "OpenAI Platform". platform.openai.com (in الإنجليزية). Archived from the original on 20 مارس 2023. Retrieved 10 نوفمبر 2023.

[Niles-2023-3] Niles, Raymond (10 نوفمبر 2023) [Updated this week]. "DALL-E 3 API". OpenAI help Center (in الإنجليزية). Archived from the original on 10 نوفمبر 2023. Retrieved 10 نوفمبر 2023.

[Mehdi-2023-4] Mehdi, Yusuf (21 سبتمبر 2023). "Announcing Microsoft Copilot, your everyday AI companion". The Official Microsoft Blog (in الإنجليزية الأمريكية). Archived from the original on 21 سبتمبر 2023. Retrieved 21 سبتمبر 2023.

[vb-5] أ ^ب ^ت ^ث ^ج ^ح Johnson, Khari (5 يناير 2021). "OpenAI debuts DALL-E for generating images from text". VentureBeat. Archived from the original on 5 يناير 2021. Retrieved 5 يناير 2021.

[OpenAI-2-6] "DALL·E 2". OpenAI (in الإنجليزية الأمريكية). Archived from the original on 6 أبريل 2022. Retrieved 6 يوليو 2022.

[OpenAI-2022b-7] أ ^ب "DALL·E Now Available in Beta". OpenAI (in الإنجليزية). 20 يوليو 2022. Archived from the original on 20 يوليو 2022. Retrieved 20 يوليو 2022.

[Allyn-2022-8] أ ^ب Allyn, Bobby (20 يوليو 2022). "Surreal or too real? Breathtaking AI tool DALL-E takes its images to a bigger stage". NPR (in الإنجليزية). Archived from the original on 20 يوليو 2022. Retrieved 20 يوليو 2022.

[labs.openai.com-9] "DALL·E Waitlist". labs.openai.com (in الإنجليزية). Archived from the original on 4 يوليو 2022. Retrieved 6 يوليو 2022.

[Guardian-2022-10] "From Trump Nevermind babies to deep fakes: DALL-E and the ethics of AI art". the Guardian (in الإنجليزية). 18 يونيو 2022. Archived from the original on 6 يوليو 2022. Retrieved 6 يوليو 2022.

[OpenAI-2022c-11] "DALL·E Now Available Without Waitlist". OpenAI (in الإنجليزية). 28 سبتمبر 2022. Archived from the original on 4 أكتوبر 2022. Retrieved 5 أكتوبر 2022.

[OpenAI-12] أ ^ب ^ت ^ث "DALL·E 3". OpenAI (in الإنجليزية الأمريكية). Archived from the original on 20 سبتمبر 2023. Retrieved 21 سبتمبر 2023.

[OpenAI-2022d-13] "DALL·E API Now Available in Public Beta". OpenAI (in الإنجليزية). 3 نوفمبر 2022. Archived from the original on 19 نوفمبر 2022. Retrieved 19 نوفمبر 2022.

[Wiggers-2022-14] Wiggers, Kyle (3 نوفمبر 2022). "Now anyone can build apps that use DALL-E 2 to generate images". TechCrunch. Archived from the original on 19 نوفمبر 2022. Retrieved 19 نوفمبر 2022.

[tc-15] أ ^ب Coldewey, Devin (5 يناير 2021). "OpenAI's DALL-E creates plausible images of literally anything you ask it to". Archived from the original on 6 يناير 2021. Retrieved 5 يناير 2021.

[16] Growcoot, Matt (8 فبراير 2024). "AI Images Generated on DALL-E Now Contain the Content Authenticity Tag". PetaPixel (in الإنجليزية). Retrieved 4 أبريل 2024.

[Radford-2018-17] Radford, Alec; Narasimhan, Karthik; Salimans, Tim; Sutskever, Ilya (11 يونيو 2018). "Improving Language Understanding by Generative Pre-Training" (PDF). OpenAI. p. 12. Archived (PDF) from the original on 26 يناير 2021. Retrieved 23 يناير 2021.

[GPT-2023-18] "GPT-1 to GPT-4: Each of OpenAI's GPT Models Explained and Compared". 11 أبريل 2023. Archived from the original on 15 أبريل 2023. Retrieved 29 أبريل 2023.

[Radford-2019-19] Radford, Alec; Wu, Jeffrey; Child, Rewon; et al. (14 فبراير 2019). "Language models are unsupervised multitask learners" (PDF). cdn.openai.com. 1 (8). Archived (PDF) from the original on 6 فبراير 2021. Retrieved 19 ديسمبر 2020.

[Brown-2020-20] Brown, Tom B.; Mann, Benjamin; Ryder, Nick; et al. (22 يوليو 2020). "Language Models are Few-Shot Learners". arXiv:2005.14165 [cs.CL].

[dallepaper-21] Ramesh, Aditya; Pavlov, Mikhail; Goh, Gabriel; et al. (24 فبراير 2021). "Zero-Shot Text-to-Image Generation". arXiv:2102.12092 [cs.LG].

[Ramesh-2022-22] أ ^ب ^ت ^ث ^ج ^ح Ramesh, Aditya; Dhariwal, Prafulla; Nichol, Alex; Chu, Casey; Chen, Mark (12 أبريل 2022). "Hierarchical Text-Conditional Image Generation with CLIP Latents". arXiv:2204.06125 [cs.CV].

[Heaven-2021-23] أ ^ب ^ت ^ث Heaven, Will Douglas (5 يناير 2021). "This avocado armchair could be the future of AI". MIT Technology Review. Archived from the original on 5 يناير 2021. Retrieved 5 يناير 2021.

[24] (2021-07-01) "Learning Transferable Visual Models From Natural Language Supervision" in Proceedings of the 38th International Conference on Machine Learning.: 8748–8763, PMLR.

[boing-25] Dunn, Thom (10 فبراير 2021). "This AI neural network transforms text captions into art, like a jellyfish Pikachu". BoingBoing. Archived from the original on 22 فبراير 2021. Retrieved 2 مارس 2021.

[extreme-26] أ ^ب Whitwam, Ryan (6 يناير 2021). "OpenAI's 'DALL-E' Generates Images From Text Descriptions". ExtremeTech. Archived from the original on 28 يناير 2021. Retrieved 2 مارس 2021.

[engadget-27] أ ^ب Dent, Steve (6 يناير 2021). "OpenAI's DALL-E app generates images from just a description". Engadget. Archived from the original on 27 يناير 2021. Retrieved 2 مارس 2021.

[Marcus-2022-28] أ ^ب Marcus, Gary; Davis, Ernest; Aaronson, Scott (2 مايو 2022). "A very preliminary analysis of DALL-E 2". arXiv:2204.13807 [cs.CV].

[cnbc-29] أ ^ب Shead, Sam (8 يناير 2021). "Why everyone is talking about an image generator released by an Elon Musk-backed A.I. lab". CNBC. Archived from the original on 16 يوليو 2022. Retrieved 2 مارس 2021.

[bbc-30] أ ^ب Wakefield, Jane (6 يناير 2021). "AI draws dog-walking baby radish in a tutu". British Broadcasting Corporation. Archived from the original on 2 مارس 2021. Retrieved 3 مارس 2021.

[dale-31] Markowitz, Dale (10 يناير 2021). "Here's how OpenAI's magical DALL-E image generator works". TheNextWeb. Archived from the original on 23 فبراير 2021. Retrieved 2 مارس 2021.

[OpenAI-2021-32] "DALL·E: Creating Images from Text". OpenAI (in الإنجليزية). 5 يناير 2021. Archived from the original on 27 مارس 2021. Retrieved 13 أغسطس 2022.

[Edwards-2023-33] Edwards, Benj (20 سبتمبر 2023). "OpenAI's new AI image generator pushes the limits in detail and prompt fidelity". Ars Technica (in الإنجليزية الأمريكية). Archived from the original on 21 سبتمبر 2023. Retrieved 21 سبتمبر 2023.

[Coldewey-2022-34] Coldewey, Devin (6 أبريل 2022). "New OpenAI tool draws anything, bigger and better than ever". TechCrunch (in الإنجليزية الأمريكية). Archived from the original on 6 مايو 2023. Retrieved 26 نوفمبر 2022.

[OpenAI-2022-35] "DALL·E: Introducing Outpainting". OpenAI (in الإنجليزية). 31 أغسطس 2022. Archived from the original on 26 نوفمبر 2022. Retrieved 26 نوفمبر 2022.

[Saharia-2022-36] Saharia, Chitwan; Chan, William; Saxena, Saurabh; et al. (23 مايو 2022). "Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding". arXiv:2205.11487 [cs.CV].

[Marcus-2022a-37] Marcus, Gary (28 مايو 2022). "Horse rides astronaut". The Road to AI We Can Trust. Archived from the original on 19 يونيو 2022. Retrieved 18 يونيو 2022.

[Strickland-2022-38] أ ^ب Strickland, Eliza (14 يوليو 2022). "DALL-E 2's Failures Are the Most Interesting Thing About It". IEEE Spectrum (in الإنجليزية). Archived from the original on 15 يوليو 2022. Retrieved 16 أغسطس 2022.

[OpenAI-2022a-39] أ ^ب "DALL·E 2 Pre-Training Mitigations". OpenAI (in الإنجليزية). 28 يونيو 2022. Archived from the original on 19 يوليو 2022. Retrieved 18 يوليو 2022.

[Vincent-2022-40] James Vincent (29 سبتمبر 2022). "OpenAI's image generator DALL-E is available for anyone to use immediately". The Verge. Archived from the original on 29 سبتمبر 2022. Retrieved 29 سبتمبر 2022.

[Taylor-41] Taylor, Josh (18 يونيو 2022). "From Trump Nevermind babies to deep fakes: DALL-E and the ethics of AI art". The Guardian. Archived from the original on 6 يوليو 2022. Retrieved 2 أغسطس 2022.

[Knight-2022-42] Knight, Will (13 يوليو 2022). "When AI Makes Art, Humans Supply the Creative Spark". Wired. Archived from the original on 2 أغسطس 2022. Retrieved 2 أغسطس 2022.

[vice-43] Rose, Janus (24 يونيو 2022). "DALL-E Is Now Generating Realistic Faces of Fake People". Vice. Archived from the original on 30 يوليو 2022. Retrieved 2 أغسطس 2022.

[docs-44] أ ^ب OpenAI (19 يونيو 2022). "DALL·E 2 Preview – Risks and Limitations". GitHub. Archived from the original on 2 أغسطس 2022. Retrieved 2 أغسطس 2022.

[Lane-2022-45] Lane, Laura (1 يوليو 2022). "DALL-E, Make Me Another Picasso, Please". The New Yorker. Archived from the original on 2 أغسطس 2022. Retrieved 2 أغسطس 2022.

[Goldman-2022-46] Goldman, Sharon (26 يوليو 2022). "OpenAI: Will DALL-E 2 kill creative careers?". Archived from the original on 15 أغسطس 2022. Retrieved 16 أغسطس 2022.

[Blain-2022-47] Blain, Loz (29 يوليو 2022). "DALL-E 2: A dream tool and an existential threat to visual artists". Archived from the original on 17 أغسطس 2022. Retrieved 16 أغسطس 2022.

[48] Biddle, Sam (10 أبريل 2024). "Microsoft Pitched OpenAI's DALL-E as Battlefield Tool for U.S. Military". The Intercept.

[49] Biddle, Sam (12 يناير 2024). "OpenAI Quietly Deletes Ban on Using ChatGPT for "Military and Warfare"". The Intercept.

[input-50] Kasana, Mehreen (7 يناير 2021). "This AI turns text into surreal, suggestion-driven art". Input. Archived from the original on 29 يناير 2021. Retrieved 2 مارس 2021.

[nbc-51] Ehrenkranz, Melanie (27 يناير 2021). "Here's DALL-E: An algorithm learned to draw anything you tell it". NBC News. Archived from the original on 20 فبراير 2021. Retrieved 2 مارس 2021.

[nature-52] Stove, Emma (5 فبراير 2021). "Tardigrade circus and a tree of life — January's best science images". Nature. Archived from the original on 8 مارس 2021. Retrieved 2 مارس 2021.

[Knight-2021-53] Knight, Will (26 يناير 2021). "This AI Could Go From 'Art' to Steering a Self-Driving Car". Wired. Archived from the original on 21 فبراير 2021. Retrieved 2 مارس 2021.

[cnn-54] Metz, Rachel (2 فبراير 2021). "A radish in a tutu walking a dog? This AI can draw it really well". CNN. Archived from the original on 16 يوليو 2022. Retrieved 2 مارس 2021.

[Leswing-2022-55] Leswing, Kif (8 أكتوبر 2022). "Why Silicon Valley is so excited about awkward drawings done by artificial intelligence". CNBC (in الإنجليزية). Archived from the original on 29 يوليو 2023. Retrieved 1 ديسمبر 2022.

[Etherington-2019-56] Etherington, Darrell (22 يوليو 2019). "Microsoft invests $1 billion in OpenAI in new multiyear partnership". TechCrunch (in الإنجليزية الأمريكية). Archived from the original on 22 يوليو 2019. Retrieved 21 سبتمبر 2023.

[Fortune-57] "OpenAI's first VC backer weighs in on generative A.I." Fortune (in الإنجليزية). Archived from the original on 23 أكتوبر 2023. Retrieved 21 سبتمبر 2023.

[Metz-2023-58] Metz, Cade; Weise, Karen (23 يناير 2023). "Microsoft to Invest $10 Billion in OpenAI, the Creator of ChatGPT". The New York Times (in الإنجليزية الأمريكية). ISSN 0362-4331. Archived from the original on 21 سبتمبر 2023. Retrieved 21 سبتمبر 2023.

[Rest_of_World-2022-59] "AI-generated art sparks furious backlash from Japan's anime community". Rest of World (in الإنجليزية الأمريكية). 27 أكتوبر 2022. Archived from the original on 31 ديسمبر 2022. Retrieved 3 يناير 2023.

[Roose-2022-60] Roose, Kevin (2 سبتمبر 2022). "An A.I.-Generated Picture Won an Art Prize. Artists Aren't Happy". The New York Times (in الإنجليزية الأمريكية). ISSN 0362-4331. Archived from the original on 31 مايو 2023. Retrieved 3 يناير 2023.

[Daws-2022-61] Daws, Ryan (15 ديسمبر 2022). "ArtStation backlash increases following AI art protest response". AI News (in الإنجليزية البريطانية). Archived from the original on 3 يناير 2023. Retrieved 3 يناير 2023.

[Corden-2023-62] أ ^ب Corden, Jez (8 أكتوبر 2023). "Bing Dall-E 3 image creation was great for a few days, but now Microsoft has predictably lobotomized it". Windows Central. Archived from the original on 10 أكتوبر 2023. Retrieved 11 أكتوبر 2023.

[TechRadar-63] أ ^ب Allan, Darren (9 أكتوبر 2023). "Microsoft reins in Bing AI's Image Creator – and the results don't make much sense". TechRadar. Archived from the original on 10 أكتوبر 2023. Retrieved 11 أكتوبر 2023.

[Mor-2022-64] Sahar Mor, Stripe (16 أبريل 2022). "How DALL-E 2 could solve major computer vision challenges". VentureBeat. Archived from the original on 24 مايو 2022. Retrieved 15 يونيو 2022.

[Jina-2022-65] "jina-ai/dalle-flow". Jina AI. 17 يونيو 2022. Archived from the original on 17 يونيو 2022. Retrieved 17 يونيو 2022.

[CNETmini-66] Carson, Erin (14 يونيو 2022). "Everything to Know About Dall-E Mini, the Mind-Bending AI Art Creator". CNET. Archived from the original on 15 يونيو 2022. Retrieved 15 يونيو 2022.

[DailyDotmini-67] Schroeder, Audra (9 يونيو 2022). "AI program DALL-E mini prompts some truly cursed images". Daily Dot. Archived from the original on 10 يونيو 2022. Retrieved 15 يونيو 2022.

[Polygonmini-68] Diaz, Ana (15 يونيو 2022). "People are using DALL-E mini to make meme abominations — like pug Pikachu". Polygon. Archived from the original on 15 يونيو 2022. Retrieved 15 يونيو 2022.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

أظهر v t e OpenAI
Products	ChatGPT DALL-E GitHub Copilot OpenAI Five Triton
Foundation models	OpenAI Codex GPT-2 GPT-3 GPT-4
Related	AI Dungeon Auto-GPT "Deep Learning" LangChain Microsoft 365 Copilot Microsoft Bing
Category Commons