CONCEPT Cited by 2 sources

Trusted Execution Environment (TEE)¶

Definition¶

A Trusted Execution Environment (TEE) is a hardware-enforced isolated execution context whose contents — memory, register state, and (with modern extensions) accelerator state — are inaccessible to the host OS, hypervisor, and cloud-operator control plane. A TEE typically provides:

Memory confidentiality + integrity — contents encrypted + authenticated by the CPU; the host cannot read or silently tamper.
Execution isolation — the TEE runs outside the hypervisor's trust boundary.
Hardware root of trust — the CPU produces a signed attestation of the loaded binary's measurement that a remote verifier can check.

TEEs come in several shapes — enclave-style (e.g. Intel SGX: an application-level protected memory region inside a normal process), VM-style (e.g. AMD SEV-SNP, Intel TDX: the protected boundary is a whole virtual machine; see CVMs), and emerging accelerator-side (e.g. NVIDIA Hopper Confidential Computing mode on GPUs).

What TEEs are for¶

TEEs address the "data in use" gap: classical cryptography protects data at rest (disk encryption) and in transit (TLS), but once the data is decrypted for computation, it sits as plaintext in process memory — visible to a privileged attacker who compromises the host OS, hypervisor, or datacentre operator. TEEs close that gap by letting code run over plaintext inside a boundary the host cannot see into.

Why TEEs are load-bearing for private AI inference¶

Server-side LLM inference over private user content has historically required the user to accept that the inference provider can see the plaintext. WhatsApp Private Processing is the canonical wiki instance of using a TEE to run large-model inference while preserving end-to-end encryption: the device encrypts the request with an ephemeral key bound to a specific CVM whose binary digest has been attested against a published ledger; only the device and that CVM can decrypt.

What TEEs are NOT¶

TEEs do not by themselves:

Prove code correctness — attestation proves which binary is running, not that it does what it claims. Pair with transparency logs, open-source, and third-party audit.
Defeat side-channel attacks — TEE research repeatedly demonstrates speculative-execution, timing, and power side-channels. Defence-in-depth is required.
Defeat physical attacks on the host — encrypted DRAM closes many, not all. Meta explicitly notes this residual risk and layers physical datacentre controls on top.
Solve confidentiality of the application's own logic — a buggy app inside a TEE can still leak data. Containerisation, log-filtering, and minimised input surfaces still apply.
Eliminate the need for defence-in-depth — the TEE is one layer; everything above, below, and beside it still needs hardening.

Composition with the rest of the stack¶

In the Private Processing architecture, the TEE is the inner-most trust boundary, composed with:

Remote attestation + RA-TLS — gate the client session on a verified TEE binary digest.
Verifiable transparency log of acceptable digests — so the what is auditable, not just the whether.
OHTTP + third-party relay — route-level non-targetability so the operator cannot map a user to a specific TEE host.
Anonymous credentials — application-level authentication without identification.
Log-filtering egress pipeline — observability without content leakage.

TEE as one defense layer, not the only one¶

The 2026-05-27 Google zero-trust-aggregation post canonicalises a structural framing of TEE that the WhatsApp Private Processing instance hinted at but did not load-bear:

"With this solution, confidentiality no longer relies entirely on hardware protection. The cryptographic layer ensures that individual raw data is never exposed or reconstructed in any server memory — not even within the hardware-protected perimeters."

Translation: a TEE alone is not the load-bearing privacy mechanism when the deployment is long-lived and side-channels are an ongoing risk class. The structural justification is TEE side-channel vulnerabilities — "new side-channel vulnerabilities are expected to be discovered." The architectural response in the Google design is cryptography-plus-TEE defense in depth: a cryptographic confidentiality layer underneath the TEE, with the TEE providing attestation transparency (which binary is running) rather than serving as the only confidentiality boundary.

Two distinct production patterns emerge across wiki instances:

Pattern	TEE's role	Wiki canonical instance
patterns/tee-for-private-ai-inference	Sole confidentiality boundary; data is plaintext inside the TEE	systems/whatsapp-private-processing
patterns/cryptography-plus-tee-defense-in-depth	Auditability + defense-in-depth backstop; data is encrypted even inside the TEE	systems/google-confidential-federated-analytics

The TEE-for-private-AI-inference pattern works for inference workloads where the model needs to operate on plaintext (no homomorphic-inference substrate is currently practical at LLM scale). The cryptography-plus-TEE pattern works for aggregation / federated-analytics workloads where the math admits ciphertext-aggregation primitives like secure aggregation — there's no need to ever decrypt individuals.

Seen in¶

sources/2025-04-30-meta-building-private-processing-for-ai-tools-on-whatsapp — TEE (specifically a CVM + Confidential-Compute-mode GPU) is the core primitive underneath Private Processing. Canonical wiki instance of TEE-for-private-AI-inference.
sources/2026-05-27-google-private-analytics-via-zero-trust-aggregation — TEE composed with cryptographic secure-aggregation in Google Confidential Federated Analytics. Canonical wiki instance of cryptography-plus-TEE defense in depth — TEE provides auditability + backstop, not the only confidentiality boundary. Specifically cites Intel TDX and AMD SEV-SNP as the TEE substrate options, and cites SNPeek + TDXray as the side-channel risk class motivating the composition.

concepts/confidential-computing — the industry-wide posture that TEEs instantiate.
concepts/remote-attestation — the proof-of-binary-identity mechanism.
concepts/ra-tls — the TLS composition that turns attestation into a session-gate.
concepts/blast-radius — the TEE bounds blast radius on host compromise.
concepts/defense-in-depth — TEEs are one layer, not a panacea.
concepts/tee-side-channel-vulnerability — the persistent risk class motivating composition with cryptographic privacy layers.
concepts/secure-aggregation — the cryptographic primitive that composes with TEE in the federated-analytics direction.
concepts/federated-analytics — the workload class where cryptography-plus-TEE composition is most natural.
systems/cvm-confidential-virtual-machine — the VM-granularity realisation.
systems/intel-tdx — Intel's VM-TEE substrate.
systems/amd-sev-snp — AMD's VM-TEE substrate.
systems/google-confidential-federated-analytics — canonical cryptography-plus-TEE production instance.
patterns/tee-for-private-ai-inference — TEE-as-only-confidentiality-boundary pattern.
patterns/cryptography-plus-tee-defense-in-depth — TEE-as-backstop pattern.
patterns/attestation-before-session-key-release — how TEEs are gated.