Large Language Model
LLM
ELI5 — The Vibe Check
The full name for an LLM — a really, really big AI trained on mountains of text. 'Large' means billions of parameters, 'Language' means it deals with text, and 'Model' means it's a mathematical thing that makes predictions. It's the thing that writes your emails now.
Real Talk
Large Language Models are transformer-based neural networks trained on internet-scale text datasets. Their scale enables emergent capabilities including in-context learning, reasoning, and zero-shot task performance. Examples include GPT-4, Claude, Gemini, and Llama.
When You'll Hear This
"Large language models have changed how we build products." / "The large language model was fine-tuned on our support data."
Related Terms
ChatGPT
ChatGPT is the app that made AI mainstream — it's the iPhone moment for artificial intelligence.
Fine-tuning
Fine-tuning is like taking a smart graduate student who knows everything and then sending them to a specialist bootcamp.
GPT (Generative Pre-trained Transformer)
GPT is the brand of AI model from OpenAI that kicked off the LLM revolution. GPT-3 made everyone's jaw drop, GPT-4 made jaws stay dropped.
LLM (Large Language Model)
An LLM is a humongous AI that read basically the entire internet and learned to predict what words come next, really really well.
Transformer
The Transformer is THE architecture behind all modern AI. ChatGPT, Claude, Midjourney, Whisper — all transformers under the hood. The key innovation?