Generative artificial intelligence (generative AI, GenAI, or GAI) is

artificial intelligence Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech re ...

capable of generating text, images or other data using generative models, often in response to prompts. Generative AI models

learn Learning is the process of acquiring new understanding, knowledge, behaviors, skills, values, attitudes, and preferences. The ability to learn is possessed by humans, animals, and some machines; there is also evidence for some kind of learnin ...

the patterns and structure of their input training data and then generate new data that has similar characteristics. Improvements in

transformer A transformer is a passive component that transfers electrical energy from one electrical circuit to another circuit, or multiple circuits. A varying current in any coil of the transformer produces a varying magnetic flux in the transformer' ...

-based

deep Deep or The Deep may refer to: Places United States * Deep Creek (Appomattox River tributary), Virginia * Deep Creek (Great Salt Lake), Idaho and Utah * Deep Creek (Mahantango Creek tributary), Pennsylvania * Deep Creek (Mojave River tributary), C ...

neural networks A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...

enabled an

AI boom The AI boom, or AI spring, is the ongoing period of rapid Progress in artificial intelligence, progress in the field of artificial intelligence. Prominent examples include AlphaFold, protein folding prediction and Generative artificial intellig ...

of generative AI systems in the early 2020s. These include

large language model A large language model (LLM) is a language model consisting of a neural network with many parameters (typically billions of weights or more), trained on large quantities of unlabelled text using self-supervised learning. LLMs emerged around 2018 an ...

(LLM) chatbots such as

ChatGPT ChatGPT (Generative Pre-trained Transformer) is a chatbot launched by OpenAI in November 2022. It is built on top of OpenAI's GPT-3 family of large language models, and is fine-tuned (an approach to transfer learning) with both supervised and ...

Copilot In aviation, the first officer (FO), also called co-pilot, is the pilot who is second-in-command of the aircraft to the captain, who is the legal commander. In the event of incapacitation of the captain, the first officer will assume command of ...

Bard In Celtic cultures, a bard is a professional story teller, verse-maker, music composer, oral historian and genealogist, employed by a patron (such as a monarch or chieftain) to commemorate one or more of the patron's ancestors and to praise t ...

, and LLaMA, and

text-to-image A text-to-image model is a machine learning model which takes as input a natural language description and produces an image matching that description. Such models began to be developed in the mid-2010s, as a result of advances in deep neural netwo ...

artificial intelligence art systems such as Stable Diffusion, Midjourney, and DALL-E. Companies such as OpenAI, Anthropic,

Microsoft Microsoft Corporation is an American multinational technology corporation producing computer software, consumer electronics, personal computers, and related services headquartered at the Microsoft Redmond campus located in Redmond, Washing ...

Google Google LLC () is an American multinational technology company focusing on search engine technology, online advertising, cloud computing, computer software, quantum computing, e-commerce, artificial intelligence, and consumer electronics. ...

, and Baidu as well as numerous smaller firms have developed generative AI models. Generative AI has uses across a wide range of industries, including software development, healthcare, finance, entertainment, customer service, sales and marketing, art, writing, fashion, and product design. However, concerns have been raised about the potential misuse of generative AI such as

cybercrime A cybercrime is a crime that involves a computer or a computer network.Moore, R. (2005) "Cyber crime: Investigating High-Technology Computer Crime," Cleveland, Mississippi: Anderson Publishing. The computer may have been used in committing the ...

, and the use of fake news or

deepfake Deepfakes (a portmanteau of "deep learning" and "fake") are synthetic media in which a person in an existing image or video is replaced with someone else's likeness. While the act of creating fake content is not new, deepfakes leverage powerful ...

s to deceive or manipulate people or take a mass amount of jobs from real humans.

History

The academic discipline of artificial intelligence was established at a research

workshop Beginning with the Industrial Revolution era, a workshop may be a room, rooms or building which provides both the area and tools (or machinery) that may be required for the manufacture or repair of manufactured goods. Workshops were the only ...

held at

Dartmouth College Dartmouth College (; ) is a private research university in Hanover, New Hampshire. Established in 1769 by Eleazar Wheelock, it is one of the nine colonial colleges chartered before the American Revolution. Although founded to educate Native A ...

in 1956 and has experienced several waves of advancement and optimism in the decades since. Since its inception, researchers in the field have raised philosophical and ethical arguments about the nature of the human mind and the consequences of creating artificial beings with human-like intelligence; these issues have previously been explored by

myth Myth is a folklore genre consisting of Narrative, narratives that play a fundamental role in a society, such as foundational tales or Origin myth, origin myths. Since "myth" is widely used to imply that a story is not Objectivity (philosophy), ...

fiction Fiction is any creative work, chiefly any narrative work, portraying individuals, events, or places that are imaginary, or in ways that are imaginary. Fictional portrayals are thus inconsistent with history, fact, or plausibility. In a traditi ...

and

philosophy Philosophy (from , ) is the systematized study of general and fundamental questions, such as those about existence, reason, knowledge, values, mind, and language. Such questions are often posed as problems to be studied or resolved. Some ...

since antiquity. The concept of automated art dates back at least to the automata of ancient Greek civilization, where inventors such as Daedalus and Hero of Alexandria were described as having designed machines capable of writing text, generating sounds, and playing music. The tradition of creative automatons has flourished throughout history, exemplified by Maillardet's automaton created in the early 1800s. Artificial Intelligence is an idea that has been captivating society since the mid-20th century. It began with science fiction familiarizing the world with the concept but the idea wasn't fully seen in the scientific manner until Alan Turing, a polymath, was curious about the feasibility of the concept. Turing's groundbreaking 1950 paper, " Computing Machinery and Intelligence," posed fundamental questions about machine reasoning similar to human intelligence, significantly contributing to the conceptual groundwork of AI. The development of AI was not very rapid at first because of the high costs and the fact that computers were not able to store commands. This changed during the 1956 Dartmouth Summer Research Project on AI where there was an inspiring call for AI research which led it to be a landmark event as it set the precedent for two decades of rapid advancements in the field. Since the founding of AI in the 1950s, artists and researchers have used artificial intelligence to create artistic works. By the early 1970s, Harold Cohen was creating and exhibiting generative AI works created by

AARON According to Abrahamic religions, Aaron ''′aharon'', ar, هارون, Hārūn, Greek (Septuagint): Ἀαρών; often called Aaron the priest ()., group="note" ( or ; ''’Ahărōn'') was a prophet, a high priest, and the elder brother of ...

, the computer program Cohen created to generate paintings.

Markov chain A Markov chain or Markov process is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, this may be thought of as, "What happe ...

s have long been used to model natural languages since their development by Russian mathematician

Andrey Markov Andrey Andreyevich Markov, first name also spelled "Andrei", in older works also spelled Markoff) (14 June 1856 – 20 July 1922) was a Russian mathematician best known for his work on stochastic processes. A primary subject of his research lat ...

in the early 20th century. Markov published his first paper on the topic in 1906, and analyzed the pattern of vowels and consonants in the novel '' Eugeny Onegin'' using Markov chains. Once a Markov chain is learned on a text corpus, it can then be used as a probabilistic text generator. The field of machine learning often uses

statistical models A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of sample data (and similar data from a larger population). A statistical model represents, often in considerably idealized form, ...

, including generative models, to model and predict data. Beginning in the late 2000s, the emergence of

deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. De ...

drove progress and research in

image classification Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...

, speech recognition,

natural language processing Natural language processing (NLP) is an interdisciplinary subfield of linguistics, computer science, and artificial intelligence concerned with the interactions between computers and human language, in particular how to program computers to pro ...

and other tasks.

Neural network A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...

s in this era were typically trained as discriminative models, due to the difficulty of generative modeling. In 2014, advancements such as the variational autoencoder and generative adversarial network produced the first practical deep neural networks capable of learning generative models, as opposed to discriminative ones, for complex data such as images. These deep generative models were the first to output not only class labels for images but also entire images. In 2017, the Transformer network enabled advancements in generative models compared to older Long-Short Term Memory models, leading to the first generative pre-trained transformer (GPT), known as GPT-1, in 2018. This was followed in 2019 by GPT-2 which demonstrated the ability to generalize unsupervised to many different tasks as a Foundation model. In 2021, the release of DALL-E, a transformer-based pixel generative model, followed by Midjourney and Stable Diffusion marked the emergence of practical high-quality artificial intelligence art from natural language prompts. In March 2023, GPT-4 was released. A team from Microsoft Research argued that "it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system". Other scholars have disputed that GPT-4 reaches this threshold, calling generative AI "still far from reaching the benchmark of ‘general human intelligence’" as of 2023.

Modalities

A generative AI system is constructed by applying

unsupervised ''Unsupervised'' is an American adult animated sitcom created by David Hornsby, Rob Rosell, and Scott Marder which ran on FX from January 19 to December 20, 2012. The show was created, and for the most part, written by David Hornsby, Scott Marder ...

or self-supervised machine learning to a data set. The capabilities of a generative AI system depend on the modality or type of the data set used. Generative AI can be either ''unimodal'' or ''multimodal''; unimodal systems take only one type of input, whereas multimodal systems can take more than one type of input. For example, one version of OpenAI's GPT-4 accepts both text and image inputs.

Text

Generative AI systems trained on words or word tokens include GPT-3, LaMDA, LLaMA,

BLOOM Bloom or blooming may refer to: Science and technology Biology * Bloom, one or more flowers on a flowering plant * Algal bloom, a rapid increase or accumulation in the population of algae in an aquatic system * Jellyfish bloom, a collective n ...

, GPT-4, Gemini and others (see List of large language models). They are capable of

, machine translation, and natural language generation and can be used as foundation models for other tasks. Data sets include

BookCorpus BookCorpus (also sometimes referred to as the Toronto Book Corpus) is a dataset consisting of the text of around 7,000 self-published books scraped from the indie ebook distribution website Smashwords. It was the main corpus used to train the i ...

, Wikipedia, and others (see

List of text corpora Text corpora (singular: ''text corpus'') are large and structured sets of texts, which have been systematically collected. Text corpora are used by corpus linguists and within other branches of linguistics for statistical analysis, hypothesis testi ...

Code

In addition to

natural language In neuropsychology, linguistics, and philosophy of language, a natural language or ordinary language is any language that has evolved naturally in humans through use and repetition without conscious planning or premeditation. Natural languages ...

text, large language models can be trained on programming language text, allowing them to generate source code for new computer programs. Examples include OpenAI Codex.

Images

Producing high-quality visual art is a prominent application of generative AI. Generative AI systems trained on sets of images with text captions include

Imagen ''Imagen'' is a Spanish language monthly women's fashion magazine published in San Juan, Puerto Rico. Profile ''Imagen'' was founded in 1986. The magazine is printed monthly by Casiano Communications. The headquarters is in San Juan. It is Puer ...

, DALL-E, Midjourney, Adobe Firefly, Stable Diffusion and others (see Artificial intelligence art, Generative art, and Synthetic media). They are commonly used for

generation and neural style transfer. Datasets include LAION-5B and others (See List of datasets in computer vision and image processing).

Audio

Generative AI can also be trained extensively on audio clips to produce natural-sounding speech synthesis and text-to-speech capabilities, exemplified by ElevenLabs' context-aware synthesis tools or Meta Platform's Voicebox. Generative AI systems such as MusicLM and MusicGen can also be trained on the audio waveforms of recorded music along with text annotations, in order to generate new musical samples based on text descriptions such as ''a calming violin melody backed by a distorted guitar riff''.

Video

Generative AI trained on annotated video can generate temporally-coherent video clips. Examples include Gen-1 and Gen-2 by Runway and Make-A-Video by Meta Platforms.

Molecules

Generative AI systems can be trained on sequences of

amino acids Amino acids are organic compounds that contain both amino and carboxylic acid functional groups. Although hundreds of amino acids exist in nature, by far the most important are the alpha-amino acids, which comprise proteins. Only 22 alpha am ...

or molecular representations such as SMILES representing DNA or proteins. These systems, such as AlphaFold, are used for protein structure prediction and

drug discovery In the fields of medicine, biotechnology and pharmacology, drug discovery is the process by which new candidate medications are discovered. Historically, drugs were discovered by identifying the active ingredient from traditional remedies or by ...

. Datasets include various biological datasets.

Robotics

Generative AI can also be trained on the motions of a robotic system to generate new trajectories for motion planning or navigation. For example, UniPi from Google Research uses prompts like ''"pick up blue bowl"'' or ''"wipe plate with yellow sponge"'' to control movements of a robot arm. Multimodal "vision-language-action" models such as Google's RT-2 can perform rudimentary reasoning in response to user prompts and visual input, such as picking up a toy dinosaur when given the prompt ''pick up the extinct animal'' at a table filled with toy animals and other objects.

Planning

The terms generative AI planning or generative planning were used in the 1980s and 1990s to refer to

AI planning AI is artificial intelligence, intellectual ability in machines and robots. Ai, AI or A.I. may also refer to: Animals * Ai (chimpanzee), an individual experimental subject in Japan * Ai (sloth) or the pale-throated sloth, northern Amazonian ma ...

systems, especially computer-aided process planning, used to generate sequences of actions to reach a specified goal. Generative AI planning systems used

symbolic AI In artificial intelligence, symbolic artificial intelligence is the term for the collection of all methods in artificial intelligence research that are based on high-level symbolic (human-readable) representations of problems, logic and search. S ...

methods such as state space search and constraint satisfaction and were a "relatively mature" technology by the early 1990s. They were used to generate crisis action plans for military use, process plans for manufacturing and decision plans such as in prototype autonomous spacecraft.

Data

Generative AI systems are often used to develop

synthetic data Synthetic data is information that's artificially generated rather than produced by real-world events. Typically created using algorithms, synthetic data can be deployed to validate mathematical models and to train machine learning models. Data g ...

as an alternative to data produced by real-world events. Such data can be deployed to validate mathematical models and to train machine learning models while preserving user privacy, including for structured data. The approach is not limited to text generation; image generation has been employed to train computer vision models.

Computer aided design

Artificially intelligent

computer-aided design Computer-aided design (CAD) is the use of computers (or ) to aid in the creation, modification, analysis, or optimization of a design. This software is used to increase the productivity of the designer, improve the quality of design, improve c ...

(CAD) can use text-to-3D, image-to-3D, and video-to-3D to automate

3D modeling In 3D computer graphics, 3D modeling is the process of developing a mathematical coordinate-based representation of any surface of an object (inanimate or living) in three dimensions via specialized software by manipulating edges, vertices, an ...

. Ai CAD libraries could also be developed using linked open data of

schematics A schematic, or schematic diagram, is a designed representation of the elements of a system using abstract, graphic symbols rather than realistic pictures. A schematic usually omits all details that are not relevant to the key information the sc ...

and

diagram A diagram is a symbolic representation of information using visualization techniques. Diagrams have been used since prehistoric times on walls of caves, but became more prevalent during the Enlightenment. Sometimes, the technique uses a three- ...

s. Ai CAD assistants are used as tools to help streamline workflow.

Software and hardware

Generative AI models are used to power chatbot products such as

programming tools A programming tool or software development tool is a computer program that software developers use to create, debug, maintain, or otherwise support other programs and applications. The term usually refers to relatively simple programs, that can ...

such as

GitHub Copilot GitHub Copilot is a cloud-based artificial intelligence tool developed by GitHub and OpenAI to assist users of Visual Studio Code, Visual Studio, Neovim, and JetBrains integrated development environments (IDEs) by autocompleting code. Currently ...

products such as Midjourney, and text-to-video products such as Runway Gen-2. Generative AI features have been integrated into a variety of existing commercially available products such as Microsoft Office,

Google Photos Google Photos is a photo sharing and storage service developed by Google. It was announced in May 2015 and spun off from Google+, the company's former social network. As of June 1, 2021, in its free tier, any newly uploaded photo and video c ...

, and

Adobe Photoshop Adobe Photoshop is a raster graphics editor developed and published by Adobe Inc. for Microsoft Windows, Windows and macOS. It was originally created in 1988 by Thomas Knoll, Thomas and John Knoll. Since then, the software has become the indu ...

. Many generative AI models are also available as open-source software, including Stable Diffusion and the LLaMA language model. Smaller generative AI models with up to a few billion parameters can run on

smartphones A smartphone is a portable computer device that combines mobile telephone and computing functions into one unit. They are distinguished from feature phones by their stronger hardware capabilities and extensive mobile operating systems, which ...

, embedded devices, and personal computers. For example, LLaMA-7B (a version with 7 billion parameters) can run on a Raspberry Pi 4 and one version of Stable Diffusion can run on an iPhone 11. Larger models with tens of billions of parameters can run on

laptop A laptop, laptop computer, or notebook computer is a small, portable personal computer (PC) with a screen and alphanumeric keyboard. Laptops typically have a clam shell form factor with the screen mounted on the inside of the upper li ...

or desktop computers. To achieve an acceptable speed, models of this size may require accelerators such as the GPU chips produced by NVIDIA and AMD or the Neural Engine included in Apple silicon products. For example, the 65 billion parameter version of LLaMA can be configured to run on a desktop PC. The advantages of running generative AI locally include protection of

privacy Privacy (, ) is the ability of an individual or group to seclude themselves or information about themselves, and thereby express themselves selectively. The domain of privacy partially overlaps with security, which can include the concepts of a ...

and intellectual property, and avoidance of

rate limiting In computer networks, rate limiting is used to control the rate of requests sent or received by a network interface controller. It can be used to prevent DoS attacks and limit web scraping. Research indicates flooding rates for one zombie machine ...

and censorship. The subreddit r/LocalLLaMA in particular focuses on using consumer-grade gaming

graphics card A graphics card (also called a video card, display card, graphics adapter, VGA card/VGA, video adapter, display adapter, or mistakenly GPU) is an expansion card which generates a feed of output images to a display device, such as a computer moni ...

s through such techniques as compression. That forum is one of only two sources Andrej Karpathy trusts for language model benchmarks. Yann LeCun has advocated open-source models for their value to vertical applications and for improving

AI safety AI is artificial intelligence, intellectual ability in machines and robots. Ai, AI or A.I. may also refer to: Animals * Ai (chimpanzee), an individual experimental subject in Japan * Ai (sloth) or the pale-throated sloth, northern Amazonian mam ...

. Language models with hundreds of billions of parameters, such as GPT-4 or PaLM, typically run on

datacenter A data center (American English) or data centre (British English)See spelling differences. is a building, a dedicated space within a building, or a group of buildings used to house computer systems and associated components, such as telecommunic ...

computers equipped with arrays of GPUs (such as NVIDIA's H100) or AI accelerator chips (such as Google's TPU). These very large models are typically accessed as cloud services over the Internet. In 2022, the

United States New Export Controls on Advanced Computing and Semiconductors to China Effective October 7, 2022, the United States of America implemented new export controls targeting the People's Republic of China's (PRC) ability to access and develop advanced computing and semiconductor manufacturing items. The new export control ...

imposed restrictions on exports to China of GPU and AI accelerator chips used for generative AI. Chips such as the NVIDIA A800 and the

Biren Technology Shanghai Biren Intelligent Technology Co. () is a Chinese fabless semiconductor design company. The company was founded in 2019 by Lingjie Xu and others, all of whom were previously employed at NVIDIA or Alibaba. Biren has advertised two gene ...

BR104 were developed to meet the requirements of the sanctions. There is free software on the market capable of recognizing text generated by generative artificial intelligence (such as GPTZero), as well as images, audio or video coming from it. Despite claims of accuracy, both free and paid AI text detectors have frequently produced false positives, mistakenly accusing students of submitting AI-generated work.

Law and regulation

In the United States, a group of companies including OpenAI, Alphabet, and Meta signed a voluntary agreement with the White House in July 2023 to watermark AI-generated content. In October 2023, Executive Order 14110 applied the Defense Production Act to require all US companies to report information to the federal government when training large AI models. In the European Union, the proposed

Artificial Intelligence Act The Artificial Intelligence Act (AI Act) is a proposed regulation by the European Commission which aims to introduce a common regulatory and legal framework for artificial intelligence. Its scope encompasses all sectors (except for military), and t ...

includes requirements to disclose copyrighted material used to train generative AI systems, and to label any AI-generated output as such. In China, the Interim Measures for the Management of Generative AI Services introduced by the Cyberspace Administration of China regulates any public-facing generative AI. It includes requirements to watermark generated images or videos, regulations on training data and label quality, restrictions on personal data collection, and a guideline that generative AI must "adhere to socialist core values".

Copyright

Training with copyrighted content

Generative AI systems such as

and Midjourney are trained on large, publicly available datasets that include copyrighted works. AI developers have argued that such training is protected under fair use, while copyright holders have argued that it infringes their rights. Proponents of fair use training have argued that it is a

transformative use In United States copyright law, transformative use or transformation is a type of fair use that builds on a copyrighted work in a different manner or for a different purpose from the original, and thus does not infringe its holder's copyright. Tr ...

and does not involve making copies of copyrighted works available to the public. Critics have argued that image generators such as Midjourney can create nearly-identical copies of some copyrighted images, and that generative AI programs compete with the content they are trained on. As of 2024, several lawsuits related to the use of copyrighted material in training are ongoing.

Getty Images Getty Images Holdings, Inc. is an American visual media company and is a supplier of stock images, editorial photography, video and music for business and consumers, with a library of over 477 million assets. It targets three markets— creative ...

has sued Stability AI over the use of its images to train Stable diffusion. Both the

Authors Guild The Authors Guild is America's oldest and largest professional organization for writers and provides advocacy on issues of free expression and copyright protection. Since its founding in 1912 as the Authors League of America, it has counted among ...

and The New York Times have sued

and OpenAI over the use of their works to train

Copyright of AI-generated content

A separate question is whether AI-generated works can qualify for copyright protection. The

United States Copyright Office The United States Copyright Office (USCO), a part of the Library of Congress, is a United States government body that maintains records of copyright registration, including a copyright catalog. It is used by copyright title searchers who are ...

has ruled that works created by artificial intelligence without any human input cannot be copyrighted, because they lack human authorship. However, the office has also begun taking public input to determine if these rules need to be refined for generative AI.

Concerns

The development of generative AI has raised concerns from governments, businesses, and individuals, resulting in protests, legal actions, calls to pause AI experiments, and actions by multiple governments. In a July 2023 briefing of the United Nations Security Council, Secretary-General

António Guterres António Manuel de Oliveira Guterres ( , ; born 30 April 1949) is a Portuguese politician and diplomat. Since 2017, he has served as secretary-general of the United Nations, the ninth person to hold this title. A member of the Portuguese Socia ...

stated "Generative AI has enormous potential for good and evil at scale", that AI may "turbocharge global development" and contribute between $10 and $15 trillion to the global economy by 2030, but that its malicious use "could cause horrific levels of death and destruction, widespread trauma, and deep psychological damage on an unimaginable scale".

Job losses

From the early days of the development of AI, there have been arguments put forward by ELIZA creator Joseph Weizenbaum and others about whether tasks that can be done by computers actually should be done by them, given the difference between computers and humans, and between quantitative calculations and qualitative, value-based judgements. In April 2023, it was reported that image generation AI has resulted in 70% of the jobs for video game illustrators in China being lost. In July 2023, developments in generative AI contributed to the

2023 Hollywood labor disputes From May 2 to November 9, 2023, a series of long labor disputes within the film and television industries of the United States took place, mainly focused on the strikes of the Writers Guild of America and SAG-AFTRA. It was the second time two H ...

. Fran Drescher, president of the Screen Actors Guild, declared that "artificial intelligence poses an

existential threat A global catastrophic risk or a doomsday scenario is a hypothetical future event that could damage human well-being on a global scale, even endangering or destroying modern civilization. An event that could cause human extinction or permanen ...

to creative professions" during the

2023 SAG-AFTRA strike From July 14 to November 9, 2023, the American actors' union SAG-AFTRA (Screen Actors Guild – American Federation of Television and Radio Artists) was on strike over a labor dispute with the Alliance of Motion Picture and Television Producer ...

. Voice generation AI has been seen as a potential challenge to the voice acting sector. The intersection of AI and employment concerns among underrepresented groups globally remains a critical facet. While AI promises efficiency enhancements and skill acquisition, concerns about job displacement and biased recruiting processes persist among these groups, as outlined in surveys by Fast Company. To leverage AI for a more equitable society, proactive steps encompass mitigating biases, advocating transparency, respecting privacy and consent, and embracing diverse teams and ethical considerations. Strategies involve redirecting policy emphasis on regulation, inclusive design, and education's potential for personalized teaching to maximize benefits while minimizing harms.

Racial and Gender Bias

Generative AI models can reflect and amplify any

cultural bias Cultural bias is the phenomenon of interpreting and judging phenomena by standards inherent to one's own culture. The phenomenon is sometimes considered a problem central to social and human sciences, such as economics, psychology, anthropology, ...

present in the underlying data. For example, a language model might assume that doctors and judges are male, and that secretaries or nurses are female, if those biases are common in the training data. Similarly, an image model prompted with the text "a photo of a CEO" might disproportionately generate images of white male CEOs, if trained on a racially biased data set. A number of methods for mitigating bias have been attempted, such as altering input prompts and reweighting training data.

Deepfakes

Deepfakes (a portmanteau of "deep learning" and "fake") are AI-generated media that take a person in an existing image or video and replace them with someone else's likeness using artificial neural networks. Deepfakes have garnered widespread attention and concerns for their uses in deepfake celebrity pornographic videos, revenge porn, fake news,

hoax A hoax is a widely publicized falsehood so fashioned as to invite reflexive, unthinking acceptance by the greatest number of people of the most varied social identities and of the highest possible social pretensions to gull its victims into pu ...

es, health disinformation, and

financial fraud In law, fraud is intent (law), intentional deception to secure unfair or unlawful gain, or to deprive a victim of a legal right. Fraud can violate Civil law (common law), civil law (e.g., a fraud victim may sue the fraud perpetrator to avoid t ...

. This has elicited responses from both industry and government to detect and limit their use.

Audio deepfakes

Instances of users abusing software to generate controversial statements in the vocal style of celebrities, public officials, and other famous individuals have raised ethical concerns over voice generation AI. In response, companies such as ElevenLabs have stated that they would work on mitigating potential abuse through safeguards and

identity verification An identity verification service is used by businesses to ensure that users or customers provide information that is associated with the identity of a real person. The service may verify the authenticity of physical identity documents such as a driv ...

. Concerns and fandom have spawned from AI generated music. The same software used to clone voices has been used on famous musicians' voices to create songs that mimic their voices, gaining both tremendous popularity and criticism. Similar techniques have also been used to create improved quality or full-length versions of songs that have been leaked or have yet to be released. Generative AI has also been used to create new digital artist personalities, with some of these receiving enough attention to receive record deals at major labels. The developers of these virtual artists have also faced their fair share of criticism for their personified programs, including backlash for "dehumanizing" an artform, and also creating artists which create unrealistic or immoral appeals to their audiences.

Cybercrime

Generative AI's ability to create realistic fake content has been exploited in numerous types of cybercrime, including phishing scams. Deepfake video and audio have been used to create disinformation and fraud. Former Google fraud czar Shuman Ghosemajumder has predicted that while deepfake videos initially created a stir in the media, they would soon become commonplace, and as a result, more dangerous. Additionally, large-language models and other forms of text-generation AI have been at a broad scale to create fake reviews on e-commerce websites to boost ratings. Cybercriminals have created large language models focused on fraud, including WormGPT and FraudGPT. Recent research done in 2023 has revealed that generative AI has weaknesses that can be manipulated by criminals to extract harmful information bypassing ethical safeguards. The study presents example attacks done on ChatGPT including Jailbreaks and

reverse psychology Reverse psychology is a technique involving the assertion of a belief or behavior that is opposite to the one desired, with the expectation that this approach will encourage the subject of the persuasion to do what is actually desired. This techniqu ...

. Additionally, malicious individuals can use ChatGPT for

social engineering Social engineering may refer to: * Social engineering (political science), a means of influencing particular attitudes and social behaviors on a large scale * Social engineering (security), obtaining confidential information by manipulating and/or ...

attacks and phishing attacks, revealing the harmful side of these technologies.

Misuse in journalism

In January 2023, ''Futurism.com'' broke the story that

CNET ''CNET'' (short for "Computer Network") is an American media website that publishes reviews, news, articles, blogs, podcasts, and videos on technology and consumer electronics globally. ''CNET'' originally produced content for radio and televi ...

had been using an undisclosed internal AI tool to write at least 77 of its stories; after the news broke, CNET posted corrections to 41 of the stories. In April 2023, the German tabloid ''

Die Aktuelle ''Die Aktuelle'' is a German language weekly women's magazine published in Essen, Germany. History and profile ''Die Aktuelle'' has been published since 1979. The magazine is part of Funke Mediengruppe. It is published by Gong Verlag on a weekly ...

'' published a fake AI-generated interview with former racing driver Michael Schumacher, who had not made any public appearances since 2013 after sustaining a brain injury in a skiing accident. The story included two possible disclosures: the cover included the line "deceptively real", and the interview included an acknowledgment at the end that it was AI-generated. The editor-in-chief was fired shortly thereafter amid the controversy. Other outlets that have published articles whose content and/or byline have been confirmed or suspected to be created by generative AI models – often with false content, errors, and/or non-disclosure of generative AI use - include NewsBreak, outlets owned by Arena Group ( Sports Illustrated, TheStreet,

Men's Journal ''Men's Journal'' is an American monthly men's lifestyle magazine focused on outdoor recreation and comprising editorials on the outdoors, environmental issues, health and fitness, style and fashion, and gear. It was founded in 1992 by Jann Wenne ...

B&H Photo B&H Photo Video (also known as B&H Photo and B&H and B&H Foto & Electronics Corporation) is an American photo and video equipment retailer founded in 1973, based in Manhattan, New York City. B&H conducts business primarily through online e-com ...

, outlets owned by

Gannett Gannett Co., Inc. () is an American mass media holding company headquartered in McLean, Virginia, in the Washington, D.C., metropolitan area.The Columbus Dispatch,

Reviewed Review is an evaluation of a publication, product, service, company, or other object or idea. An article about or a compilation of reviews may itself be called a review. Review may also refer to: Evaluation processes *Book review, a description ...

MSN MSN (meaning Microsoft Network) is a web portal and related collection of Internet services and apps for Windows and mobile devices, provided by Microsoft and launched on August 24, 1995, alongside the release of Windows 95. The Microsoft Net ...

News Corp News Corporation, stylized as News Corp, is an American mass media and publishing company headquartered in Midtown Manhattan, New York City. The second incarnation of the News Corporation (1980–2013), original News Corporation, it was formed ...

, outlets owned by G/O Media (

Gizmodo ''Gizmodo'' ( ) is a design, technology, science and science fiction website. It was originally launched as part of the Gawker Media network run by Nick Denton, and runs on the Kinja platform. ''Gizmodo'' also includes the subsite ''io9'', whic ...

, Jalopnik,

A.V. Club ''The A.V. Club'' is an American online newspaper and entertainment website featuring reviews, interviews, and other articles that examine films, music, television, books, games, and other elements of pop-culture media. ''The A.V. Club'' was cre ...

), The Irish Times, outlets owned by

Red Ventures Red Ventures is an American media company, which owns and operates brands such as Lonely Planet, CNET, ZDNet, The Points Guy, Healthline and Bankrate. Red Ventures focuses on sites that dispense news, advice, and reviews. The company's corporat ...

(

Bankrate Bankrate, LLC is a consumer financial services company based in New York City. Bankrate.com, perhaps its best-known brand, is a personal finance website. As of November 8, 2017, it became a subsidiary of Red Ventures through an acquisition. Hist ...

), and

BuzzFeed BuzzFeed, Inc. is an American Internet media, news and entertainment company with a focus on digital media. Based in New York City, BuzzFeed was founded in 2006 by Jonah Peretti and John S. Johnson III to focus on tracking viral content. Ken ...

. In response to potential pitfalls around the use and misuse of generative AI in journalism, outlets such as Wired, The Associated Press and The Guardian have published guidelines around how they plan to use and not use generative AI in their work.

References

{{Reflist Artificial intelligence Artificial neural networks Deep learning Machine learning