Glossary

Certification exam glossary

The cloud, security, AI, privacy, data, and DevOps terms that recur across exam objectives, each defined once in plain English and linked to the certifications that test it. Browse the full certification catalogue for practice.

AI & Machine Learning

Large Language Model (LLM)

A neural network trained on vast amounts of text to predict the next token, which lets it generate and reason over natural language. LLMs are the engine behind chat assistants, summarisation, and retrieval systems.

Tested in:AIF-C01 NCA-GENL PMLE

Foundation Model

A large model pre-trained on broad data that can be adapted to many downstream tasks through prompting or fine-tuning, rather than being built for a single purpose.

Tested in:AIF-C01 NCA-GENL

Retrieval-Augmented Generation (RAG)

A technique that grounds a language model's output in documents fetched at query time from an external knowledge source. It reduces hallucination and lets the model use current or private data it was never trained on.

Tested in:AIF-C01 NCA-GENL PMLE

Prompt Engineering

The practice of structuring the input to a generative model - instructions, context, and examples - to steer it toward accurate, well-formatted output, without changing the model's weights.

Tested in:AIF-C01 NCA-GENL

Fine-Tuning

Continuing the training of a pre-trained model on a smaller, task-specific dataset so it adapts to a domain or style. It changes the model's weights, unlike prompting or retrieval.

Tested in:AIF-C01 NCA-GENL PMLE

Embedding

A numeric vector that represents the meaning of text, an image, or other data, so that similar items sit close together in vector space. Embeddings power semantic search and retrieval-augmented generation.

Tested in:AIF-C01 NCA-GENL PMLE

Hallucination

Output from a generative model that is fluent and confident but factually wrong or unsupported by its source. Grounding techniques such as retrieval-augmented generation and human review reduce it.

Tested in:AIF-C01 NCA-GENL AIGP

Inference

Running a trained model on new input to produce a prediction or generation, as opposed to training, which produces the model. Inference cost and latency are the main operational concerns once a model ships.

Tested in:AIF-C01 NCA-AIIO NCA-ADS PMLE

Token

The unit of text a language model reads and generates, roughly a word fragment. Context windows, pricing, and rate limits are all measured in tokens.

Tested in:AIF-C01 NCA-GENL

Cloud

Shared Responsibility Model

The division of security duties between the cloud provider, who secures the infrastructure, and the customer, who secures their data, identities, and configuration. Where the line falls depends on the service model (IaaS, PaaS, SaaS).

Tested in:CLF-C02 AZ-900 SC-900 SAA-C03

Identity and Access Management (IAM)

The control plane that governs who can authenticate and what they are authorised to do. It is how least-privilege access is enforced in every cloud platform.

Tested in:AZ-900 AZ-104 CLF-C02 SAA-C03 SC-300

Serverless

A model where code runs in response to events without the user provisioning or managing servers; the provider scales capacity automatically and bills per execution. AWS Lambda and Azure Functions are the canonical examples.

Tested in:SAA-C03 DVA-C02 AZ-900 CLF-C02

Security

Zero Trust

A security model that treats no user, device, or network as inherently trustworthy; every request is authenticated, authorised, and encrypted regardless of where it originates. 'Never trust, always verify' is the guiding principle.

Tested in:SC-900 SC-100 SY0-701

Principle of Least Privilege

Granting each user, service, or process only the permissions it needs to do its job, and no more. It limits the blast radius of a compromised credential.

Tested in:SY0-701 SC-300 CISSP CISM SC-900

Multi-Factor Authentication (MFA)

Requiring two or more independent proofs of identity - something you know, have, or are - before granting access. It is the single most effective control against credential theft.

Tested in:SC-900 SY0-701 SC-300 CISSP

Privacy & Governance

General Data Protection Regulation (GDPR)

The European Union law governing how the personal data of EU residents is collected, processed, and protected, with significant fines for non-compliance. It applies to any organisation handling that data, wherever the organisation is based.

Tested in:CIPP-E AIGP

Personally Identifiable Information (PII)

Any data that can identify a specific individual, directly or in combination with other data, such as a name, email, or device identifier. Protecting it is the core obligation of most privacy laws.

Tested in:CIPP-US CIPP-E AIGP

Data

ETL vs ELT

ETL extracts data, transforms it, then loads it into a warehouse; ELT loads raw data first and transforms it inside the warehouse. ELT suits modern cloud warehouses with cheap, scalable compute.

Tested in:DP-900 DP-700 COF-C03 PDE

DevOps

CI/CD

Continuous integration merges and tests code changes frequently; continuous delivery and deployment automate releasing those changes to production. Together they shorten the path from commit to live with fewer manual steps.

Tested in:AZ-400 DOP-C02 GH-200 GH-500

Blue-Green Deployment

Running two identical production environments and switching traffic from the old (blue) to the new (green) once it is verified. It enables near-zero-downtime releases and instant rollback.

Tested in:AZ-400 DOP-C02 SAA-C03