Large Language Models (LLMs)

Foundation Model

Definition

Foundation model is a term coined by Stanford's Center for Research on Foundation Models to describe large AI models like GPT-4, Claude, and DALL-E that are (1) trained at scale on massive, diverse datasets, (2) capable of being adapted to many downstream tasks without task-specific training from scratch, and (3) the basis upon which many AI applications are built. The 'foundation' metaphor emphasizes that these models serve as the underlying base for a variety of downstream applications—similar to a foundation in construction. Foundation models span modalities (text, image, audio, code) and include LLMs, vision-language models, embedding models, and generative image models.

Why It Matters

Foundation models are the infrastructure layer of the AI application era. The ability to start from a general-purpose, capable model and adapt it via prompting or fine-tuning dramatically reduces the cost and expertise needed to build AI-powered features. Before foundation models, each AI application required training a specialized model from scratch—data collection, labeling, training, and evaluation for every use case. Foundation models move AI from custom to commoditized: organizations can access state-of-the-art capabilities through an API and customize for their domain with minimal ML expertise. For 99helpers, foundation models (specifically LLMs and embedding models) are the building blocks of every AI feature on the platform.

How It Works

Foundation models achieve their breadth through pre-training data diversity and scale. A text foundation model trained on books, websites, code, scientific papers, and conversations develops generalized capabilities across all these domains. Fine-tuning narrows the model's behavior toward a specific task, style, or domain while preserving the broad linguistic and reasoning capabilities from pre-training. Prompting elicits specific behaviors from the foundation without modifying weights. The emergence of high-quality open-source foundation models (Llama, Mistral, Falcon) has democratized AI development: teams can download foundation models and fine-tune them on private data without sending data to external providers.

Foundation Model — One Base, Many Applications

Foundation Model

e.g. GPT-4 / Llama 3 / Claude 3

LanguageReasoningCodeVision

💬

Customer Support Chatbot

fine-tuned / prompted

⌨

Code Assistant

fine-tuned / prompted

📄

Document Summarizer

fine-tuned / prompted

📊

Sentiment Analyzer

fine-tuned / prompted

🛡

Content Moderator

fine-tuned / prompted

🔍

Search Reranker

fine-tuned / prompted

Pre-training data

Trillions of tokens

Compute

~$10M–$100M+

Adaptation cost

~$10–$10K

Real-World Example

99helpers is built on several foundation models working in concert: a text embedding foundation model (text-embedding-3-small) converts knowledge base content and user queries to vectors for semantic search; a chat LLM (Claude or GPT-4o) generates responses using retrieved context; an optional image embedding model enables multimodal knowledge base search. None of these capabilities are built from scratch—each uses a pre-trained foundation model accessed via API. This foundation-model-powered architecture delivers capabilities that would have required years of ML research and infrastructure investment just five years earlier.

Common Mistakes

✕Treating all foundation models as equivalent—capabilities, safety properties, context windows, and costs vary enormously across models; selection matters.
✕Conflating foundation model with LLM—LLMs are the most prominent foundation models, but the category includes embedding models, vision models, audio models, and multimodal models.
✕Assuming foundation model capabilities are frozen—foundation models are retrained and updated; API behaviors can change with new versions, requiring regression testing.

Ready to build your AI chatbot?

Put these concepts into practice with 99helpers — no code required.

Start free trial →

Foundation Model

Definition

Why It Matters

How It Works

Real-World Example

Common Mistakes

Related Terms

Large Language Model (LLM)

Pre-Training

Fine-Tuning

Open-Source LLM

Multimodal LLM

Ready to build your AI chatbot?