// the search engineering layer

Focus on results, not infra.

Layer takes on your team's other jobs — caching, transforms, embedding, cost, and ops.

Free for 30 days. No credit card, no sales call.

deployment-map layer-pro

╔════════════╗      ╔════════════╗          ╔═══ retrieval system ═══════════════════╗
║ generated  ║░     ║ layer      ║░         ║                                        ║░
║ clients    ║◀────▶║ gateway    ║◀──API───▶║  ┏━━━━━━━━━┓  ┏━━━━━━━━━┓              ║░
║            ║░     ║            ║░         ║  ┃ ANN     ┃  ┃ BM25    ┃              ║░
╚════════════╝░     ╚═════╤══════╝░         ║  ┗━━━━━━━━━┛  ┗━━━━━━━━━┛              ║░
 ░░░░░░░░░░░░░░      ░░░░░│░░░░░░░░         ║                                        ║░
                          │                 ╚════════════════════════════════════════╝░
╔════════════╗      ╔═════▼══════╗           ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
║ layer      ║░     ║ layer      ║░
║ dashboard  ║◀────▶║ operator   ║░         ╔═══ kubernetes api ═════════════════════╗
║            ║░     ║ + runtime  ║◀────────▶║                                        ║░
╚════════════╝░     ╚═════╤══════╝░         ║ RBAC · transforms · agents · cache     ║░
 ░░░░░░░░░░░░░░      ░░░░░│░░░░░░░░         ║ telemetry · cost · hosted operations   ║░
                          ▼                 ║                                        ║░
                   ┏━━━━━━━━━━━━━━┓         ╚════════════════════════════════════════╝░
                   ┃ Object Store ┃          ░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░░
                   ┃ Bucket (S3)  ┃
                   ┗━━━━━━━━━━━━━━┛

# layer run

Experiment faster in production.

Ever needed to backfill your production data? With Layer that's as easy as building a docker container — Layer handles the compute and backfills as much of your index as you specify.

$ layer run -f udf.yaml --index products

✓ submitted product-tags
→ watching   142 rows · 0 failed · 8 rows/s
→ watching 1,284 rows · 0 failed · 11 rows/s
→ watching 4,510 rows · 0 failed · 13 rows/s
✓ complete · 12,840 rows · 23s · 0 failed

You build and push your container to the configured registry. Layer handles queueing and scaling semantics for you, while you track progress. No Kubernetes experience necessary.

# jobs to be done

Your search team's jobs to be done.

Search teams need to solve many of the most complex and expensive jobs to be done in your org's data platform. Most of them have nothing to do with making search better for users.

// ship embeddings

Ship Python. Layer runs the GPU pool.

Building CUDA images, writing Kubernetes autoscalers, managing Spark — the time sink every search team underestimates, and managed services trade one kind of pain for another. Layer collapses it: declare a Python UDF and layer runs the work on CPU or GPU, scaling pods and nodes between bursts.

Read the Docs

// stay consistent

Track every state change your index makes.

Keeping the index in sync with source data usually means hand-rolled watchers and event hooks glued together by the team that wrote them. Layer ships the operator: it scans the index for consistency, watermarks state changes, and rolls up facets your application can read directly.

Read the Docs

// serve fetches

A doc cache deep enough to forget about.

Whether it's a near-bottomless queue for building your pipeline, or serving full datasets from a pull-through cache, your search system needs O(1) read/write capabilities. Layer ships a production-hardened document cache tuned for high-throughput fetches.

Read the Docs

// see search

Metrics, traces, clickstream, alerts — without the plumbing.

Observability in 2026 has plenty of options and still demands plumbing. Layer bundles clickstream from the doc cache and operational metrics from the gateway into an opinionated dashboard, backed by a PromQL-compatible time series.

Read the Docs

// scope access

Scoped access without writing the auth proxy yourself.

Today every search team inside a multi-tenant product writes the auth proxy themselves: scope credentials to namespaces, gate the write paths, ship audit events somewhere security will accept. Layer ships scoped credentials, per-namespace RBAC, and an audit feed — the pattern your security team always asks for, as a primitive.

Read the Docs

// track cost

Know exactly how much you're spending on search.

Today "what does search cost us per million docs" is a question nobody can answer in under a week. AWS line items live in one bill, turbopuffer in another, GPU pool minutes nowhere obvious. Layer pulls every line item into one invoice and derives the unit metrics — cost per million docs, cost per TiB indexed, cost per query — that scrub with the timeframe.

Read the Docs

# vector systems

Built for turbopuffer.

Layer runs against turbopuffer today — one operator surface in front of the store your team already chose, wire-compatible with the client you already use.

# demos

See Layer working, live.

Four real apps built on Layer, each making a gateway decision visible in the UI. Explore the demos.

shop: An end-to-end storefront: pipelines, CLIP search, facets, autoscaling.
chart: Clinical search scored against real relevance judgments.
hybrid-text: Hybrid text fusion proven with qrels — no embeddings, no GPU.

# how to get started

A trial key gets you the full install path.

Sign up with a work email and Layer sends a signed trial key. Install typically takes around 15 minutes with Terraform and Helm.

Signup: Use the trial form and receive an emailed key for the install.
Helm value: Set license.token or reference an existing Kubernetes Secret.
Lifecycle: Layer degrades gracefully after your trial if you decide not to proceed.

# start a trial

Start your 30 day free trial.

Request your key and use it with a real turbopuffer-backed search workload. No credit card or sales call necessary.

// trial fit

1–3 person search team carrying a real retrieval workload and willing to provide feedback
using, or seriously considering turbopuffer for retrieval
3–5 TB+ of managed source data
no CMEK requirement

The trial key and install instructions arrive by email.

# what's in the box

hev layer is a BYOC product installed with Terraform and Helm. Read the Docs.

Gateway: Rust gateway that stays wire-compatible with your turbopuffer path while adding the read-path machinery your client doesn't have.
Kube Operator: Kubernetes operator owning index consistency, snapshots, and per-workload autoscaling.
Dashboard: Operator console for click-ops and fin-ops — namespaces, snapshots, jobs, cost in one place.
Clients: Python, Go, and TypeScript SDKs for teams bringing Layer into existing search code.
License: Trial and commercial keys install through Helm, expose health through /v2/license, and degrade through licensed, grace, and floor states.

See the docs for the full SBOM.