Introducing Chroma Cloud

Chroma Cloud is a hosted, production-grade deployment of Chroma built from everything we learned shipping the single-node engine, re-architected into a distributed system for scale. Under the hood, it’s a cloud-native, distributed implementation of Chroma, but the API stays the same, so you can move from single-node to cloud without rewriting your app. Chroma Cloud runs across AWS and GCP and uses a distributed vector index to enable massive scale without provisioning or tuning infrastructure.

Distributed indexing#

Chroma Cloud uses an implementation of SPANN combined with SPFresh for our distributed vector index, enabling indexes to scale horizontally. Chroma Cloud’s strong consistency is driven by wal3, our object-storage backed write-ahead log.

In practice, this means you can:

Ingest large collections without babysitting index operations.
Query with low latency while your workload scales across a distributed infrastructure.
Rely on a production-grade indexing path that’s tuned and operated for you (so you don’t have to become an indexing expert).

Dense, sparse, full-text, and metadata search#

Modern search and retrieval isn’t one-size-fits-all, Chroma Cloud supports a variety of index types:

Dense vector search for semantic similarity over embeddings.
BM25 and SPLADE for keyword-style search to capture exact term matches.
Full-text search (FTS) and regex on documents when you need lexical matching over raw text.
Metadata filtering with inverted indexes on metadata values to narrow results efficiently.

Collection forking#

Chroma Cloud Forking lets you duplicate a collection quickly without paying the time and cost of re-indexing. Forked collections share underlying data with the parent and only incur additional storage cost for incremental changes, keeping forks fast and cost-efficient.

This is ideal for:

Versioned datasets (e.g., codebases by commit/tag)
A/B testing retrieval settings and prompts against the “same” corpus
Multi-tenant or per-customer variants without duplicating everything upfront

Dashboard#

Chroma Cloud includes a robust web based dashboard for day-to-day operations. Collaborate with your team, inspect and validate collection contents, and monitor usage and billing, all in one place.

Collection configuration persists inside of Chroma’s dashboard, test and debug queries directly
View performance metrics like p50, p90 and p99 query latencies as well as request and storage volumes

Insights and metrics#

In the dashboard, you can track key operational signals for your Cloud DBs and collections, including:

Usage and activity so you can spot spikes, trends, and regressions.
Storage and data growth to understand how collections are evolving as you ingest more data.
Billing and spend visibility tied to usage so teams can stay on top of costs.

CLI#

The Chroma CLI is deeply integrated with Chroma Cloud so you can manage workflows from your terminal.

Authenticate with chroma login, then use per-team profiles that persist API keys and tenant IDs locally
Manage databases and explore data in collections with a built in collection browser
Copy local data to and from Chroma Cloud: use chroma copy to copy one or more collections (or --all) from a local Chroma server into a Chroma Cloud DB.

Pricing#

Chroma Cloud is usage-based: you pay for what you use as you scale up (and scale down).

No minimums, get started with production-grade infrastructure without upfront commitments.
Predictable, transparent metering so teams can track and manage spend over time.

If you’re building anything from a prototype to a mission-critical production workload, Chroma Cloud is the fastest path to reliable, scalable retrieval, powered by the open-source Chroma engine, fully managed for you.

Introducing Chroma Cloud

Distributed indexing#

Dense, sparse, full-text, and metadata search#

Collection forking#

Dashboard#

Insights and metrics#

CLI#

Pricing#

Product

Follow

Company

Legal