Plain-English definitions for every AI term you'll encounter
A
Algorithm
A step-by-step set of instructions that a computer follows to solve a problem or complete a task.
A
Artificial Intelligence (AI)
Technology that enables computers to perform tasks that typically require human intelligence, such as understanding language, recognizing images, and making decisions.
A
Attention Mechanism
A neural network component that allows the model to focus on relevant parts of the input when generating output — the key innovation behind Transformers.
B
Backpropagation
An algorithm for training neural networks that calculates how much each weight contributed to the error and adjusts them accordingly.
B
BERT
Bidirectional Encoder Representations from Transformers — a Google language model that reads text from both directions, greatly improving language understanding tasks.
B
Bias (AI)
When an AI system produces unfair or skewed outcomes, usually because the training data reflects historical inequalities or prejudices.
B
Big Data
Extremely large datasets that are too complex for traditional data processing tools. Characterized by volume, velocity, and variety.
C
Chatbot
An AI program that simulates conversation with users, typically through text or voice. Examples: customer service bots, Siri, Alexa.
C
Classification
A machine learning task where the model assigns inputs to discrete categories (e.g., spam/not spam, cat/dog).
C
Claude
An AI assistant developed by Anthropic, known for being helpful, harmless, and honest.
C
CNN (Convolutional Neural Network)
A neural network architecture designed for processing visual data like images, using filters to detect patterns.
C
Computer Vision
The field of AI that enables computers to interpret and understand visual information from images and videos.
C
Context Window
The maximum amount of text (measured in tokens) that a language model can consider at one time.
D
Data Augmentation
Artificially expanding training data by applying transformations (flipping, rotating, cropping) to existing examples.
D
Dataset
A collection of data used to train, validate, or test a machine learning model.
D
Deep Learning
A subset of machine learning using neural networks with many layers (hence "deep") to automatically learn complex patterns from large data.
E
Embedding
A dense numerical vector representation of data (text, images) where similar items are close together in vector space.
E
Epoch
One complete pass through the entire training dataset during model training.
F
Federated Learning
A training approach where models learn from data across multiple devices or institutions without the raw data ever leaving its source.
F
Fine-tuning
Further training a pre-trained model on a specific dataset to adapt it for a particular task.
G
GAN (Generative Adversarial Network)
A neural network architecture where two networks (generator and discriminator) compete to produce realistic synthetic data.
G
Generative AI
AI that can create new content including text, images, music, code, and video by learning patterns from training data.
G
GPT
Generative Pre-trained Transformer — a series of large language models by OpenAI capable of generating human-like text.
G
Gradient Descent
An optimization algorithm that minimizes the model's loss by iteratively adjusting weights in the direction that reduces error.
H
Hallucination
When an AI model generates confident but factually incorrect information. A significant challenge in deploying LLMs safely.
H
Hyperparameter
A setting that controls the training process (learning rate, batch size) rather than a value learned from data.
I
Inference
The process of using a trained model to make predictions on new, unseen data.
L
Large Language Model (LLM)
A massive AI model trained on vast text data that can understand and generate human language. Examples: GPT-4, Claude, Gemini.
L
Latent Space
The compressed internal representation that a neural network learns, where similar concepts cluster together.
L
Learning Rate
A hyperparameter that controls how much the model's weights are adjusted with each training step.
L
LIDAR
Light Detection And Ranging — a sensor that measures distance using laser pulses, used in robotics and self-driving cars.
L
Loss Function
A mathematical formula that measures how wrong the model's predictions are — training aims to minimize this value.
M
Machine Learning
A branch of AI where systems learn from data and improve their performance over time without being explicitly programmed.
M
MCP (Model Context Protocol)
An open standard by Anthropic that lets AI models connect to external tools and data sources in a standardized way.
M
Multi-modal AI
AI that can process multiple types of data (text, images, audio, video) together.
N
Natural Language Processing (NLP)
The field of AI focused on enabling computers to understand, interpret, and generate human language.
N
Neural Network
A computing system loosely inspired by the human brain, consisting of interconnected nodes that process information in layers.
O
Overfitting
When a model performs very well on training data but poorly on new data because it memorized examples instead of learning patterns.
P
Parameter
A numerical value in a neural network that is learned during training. Modern LLMs have billions or trillions of parameters.
P
Policy (RL)
In reinforcement learning, the strategy the agent uses to decide which action to take in each state.
P
Pre-training
Training a model on a large general dataset before fine-tuning it for specific tasks.
P
Prompt
The input text or instruction given to an AI model to guide its output.
P
Prompt Engineering
The skill of crafting effective prompts to get desired outputs from AI models.
R
RAG (Retrieval-Augmented Generation)
A technique where an LLM retrieves relevant documents from a database before generating a response, reducing hallucinations.
R
Regression
A machine learning task that predicts continuous numerical values (e.g., house prices, temperature).
R
Reinforcement Learning
A type of machine learning where an agent learns by taking actions in an environment and receiving rewards or penalties.
R
RLHF
Reinforcement Learning from Human Feedback — using human ratings of AI outputs to train more helpful and safe models.
R
RNN (Recurrent Neural Network)
A neural network designed for sequential data where outputs from previous steps feed back as inputs, enabling memory.
R
Robotics
The engineering discipline that designs, builds, and programs robots — physical machines that sense, process, and act in the world.
S
Semantic Search
Search that understands the meaning of a query, not just keywords — powered by embeddings and neural networks.
S
SLAM
Simultaneous Localization and Mapping — a robotics technique for building a map of an environment while tracking position within it.
S
Supervised Learning
A type of ML where the model learns from labeled training data — each example has an input and a correct answer.
T
Token
A unit of text (word, sub-word, or character) that language models process. "Hello world" = 2 tokens.
T
Training
The process of adjusting a model's parameters using data so it can make accurate predictions.
T
Transfer Learning
Reusing a pre-trained model on a new but related task, saving time and data requirements.
T
Transformer
A neural network architecture introduced in 2017 that uses attention mechanisms. The foundation of all modern LLMs.
U
Underfitting
When a model is too simple to capture the patterns in data, performing poorly on both training and new data.
U
Unsupervised Learning
ML where the model finds patterns in unlabeled data without predefined correct answers. Used for clustering and anomaly detection.
V
Vector Database
A database optimized for storing and searching high-dimensional embeddings — essential for semantic search and RAG systems.
X
XP (Experience Points)
Points you earn on AiMyGuru by completing quizzes. Higher scores earn more XP and unlock better badge ranks.
Z
Zero-shot Learning
The ability of a model to handle tasks it was never explicitly trained on, using only its general knowledge.