Claude (language Model)
   HOME

TheInfoList



OR:

Claude is a family of
large language models A large language model (LLM) is a language model consisting of a neural network with many parameters (typically billions of weights or more), trained on large quantities of unlabelled text using self-supervised learning. LLMs emerged around 2018 an ...
developed by Anthropic. The first model was released in March 2023. Claude 3, released in March 2024, can also analyze images.


Training

Claude models are generative pre-trained transformers. They have been pre-trained to predict the next word in large amounts of text. Claude models have then been fine-tuned with Constitutional AI with the aim of making them helpful, honest, and harmless.


Constitutional AI

Constitutional AI is an approach developed by Anthropic for training AI systems, particularly language models like Claude, to be harmless and helpful without relying on extensive human feedback. The method, detailed in the paper "Constitutional AI: Harmlessness from AI Feedback" involves two phases: supervised learning and reinforcement learning. In the supervised learning phase, the model generates responses to prompts, self-critiques these responses based on a set of guiding principles (a "constitution"), and revises the responses. Then the model is fine-tuned on these revised responses. For the reinforcement learning from AI feedback (RLAIF) phase, responses are generated and compared according to their compliance with the constitution. This dataset of AI feedback is used to train a preference model that evaluates responses based on how much they satisfy the constitution. Claude is then fine-tuned to align with this preference model. This technique is similar to
reinforcement learning from human feedback In machine learning, reinforcement learning from human feedback (RLHF) or reinforcement learning from human preferences is a technique that trains a "reward model" directly from human feedback and uses the model as a reward function to optimize an ...
(RLHF), except that the comparisons used to train the preference model are AI-generated, and that they are based on the constitution. This approach enables the training of AI assistants that are both helpful and harmless, and that can explain their objections to harmful requests, enhancing transparency and reducing reliance on human supervision. The "constitution" for Claude included 75 points, including sections from the UN Universal Declaration of Human Rights.


Models


Claude

Claude was the initial version of Anthropic's language model released in March 2023, Claude demonstrated proficiency in various tasks but had certain limitations in coding, math, and reasoning capabilities. Anthropic partnered with companies like
Notion Notion or Notions may refer to: Software * Notion (music software), a music composition and performance program * Notion (productivity software), a note-taking and project-management program from Notion Labs Inc. * Notion (window manager), the s ...
(productivity software) and Quora (to help develop the
Poe Edgar Allan Poe (; Edgar Poe; January 19, 1809 – October 7, 1849) was an American writer, poet, editor, and literary critic. Poe is best known for his poetry and short stories, particularly his tales of mystery and the macabre. He is widel ...
chatbot).


Claude Instant

Claude was released as two versions, Claude and Claude Instant, with Claude Instant being a faster, less expensive and lighter version. Claude Instant has an input context length of 100,000 tokens (which corresponds to around 75,000 words).


Claude 2

Claude 2 was the next major iteration of Claude, which was released in July 11 2023 and available to the general public, whereas the Claude 1 was only available to selected users approved by Anthropic. Claude 2 expanded its context window from 9,000 tokens to 100,000 tokens. Features included ability to upload
PDF Portable Document Format (PDF), standardized as ISO 32000, is a file format developed by Adobe in 1992 to present documents, including text formatting and images, in a manner independent of application software, hardware, and operating systems. ...
s and other documents that enables Claude to read, summarise and assist with tasks.


Claude 2.1

Claude 2.1 doubled the number of tokens that the chatbot could handle, increasing it to a window of 200,000 tokens, which equals around 500 pages of written material. Anthropic states that the new model is less likely to produce false statements compared to its predecessors.


Claude 3

Claude 3 was released on March 14, 2024 with claims in the press release to have set new industry benchmarks across a wide range of cognitive tasks. The Claude 3 family includes three state-of-the-art models in ascending order of capability: Haiku, Sonnet, and Opus. The default version of Claude 3, Opus, has a context window of 200,000 tokens, but this is being expanded to 1 million for specific use cases. Claude 3 has seemed to perform meta-cognitive reasoning, including the ability to realize it is being artificially tested during needle in a haystack evaluations.


Access

Limited-use access is free of charge, but requires both an e-mail address and a cellphone number. On May 1, 2024, Anthropic announced the Claude Team plan, its first enterprise offering for Claude, and a Claude
iOS app The App Store is an app store platform, developed and maintained by Apple Inc., for mobile apps on its iOS and iPadOS operating systems. The store allows users to browse and download approved apps developed within Apple's iOS Software Devel ...
.


Criticism

Claude 2 has faced criticism for its stringent ethical alignment that may reduce usability and performance. Users have been refused assistance with benign requests, for example with the programming question "How can I kill all
python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...
processes in my
ubuntu Ubuntu ( ) is a Linux distribution based on Debian and composed mostly of free and open-source software. Ubuntu is officially released in three editions: ''Desktop'', ''Server'', and ''Core'' for Internet of things devices and robots. All the ...
server?" This has led to a debate over the "alignment tax" (the cost of ensuring an AI system is
aligned ''Aligned'' is a 2023 drama film written and directed by Apollo Bakopoulos. The film had its world premiere at the Brooklyn Film Festival The Brooklyn Film Festival (BFF), prior to 2011 called the Brooklyn International Film Festival (BiFF) i ...
) in AI development, with discussions centered on balancing ethical considerations and practical functionality. Critics argue for user autonomy and effectiveness, while proponents stress the importance of ethical AI.{{Cite web , last=Glifton , first=Gerald , date=January 3, 2024 , title=Criticisms Arise Over Claude AI's Strict Ethical Protocols Limiting User Assistance , url=https://lightsquare.org/news/criticisms-arise-over-claude-ais-strict-ethical-protocols-limiting-user-assistance , access-date=2024-01-23 , website=Light Square , language=en


References

Artificial intelligence Machine learning Large language models Chatbots Virtual assistants 2023 software