ClickHouse Table per Tenant in Production

Clickhouse table per tenant in production · Anantha Kumaran

Random thoughts of a programmer.

Anantha Kumaran random thoughts

Clickhouse table per tenant in production

02 Jun 2026

At work, we made a decision to go with a table per tenant approach and we have been running that setup for a couple of years now. I thought this would be the right time to share what we have learned and what works well.

Why?

The first question is why do you want to create a table per tenant? In our case, we allow users to define their own event attributes with their own types. There are 2 main ways you can handle this: keep all the attributes as a JSON field, or create a table per tenant. Clickhouse json support has been getting better and better. When we did the benchmark 2 years ago, creating a table per tenant was way better from multiple perspectives, like less storage due to better compression, way less query latency. Back then JSON was mostly stored as a string, so you had to load and parse the whole blob rather than just the columns you needed. I will not spend more time on why we made this decision here. This post is more about what you need to be aware of and how to handle things if you decide to go down this route.

Table vs Partition vs Part

+-----------------------------------------+ | table | | +---------------------------------+ | | | partition1 | | | | +--------+ +-------+ +-------+ | | | | | part1 | | part2 | | part3 | | | | | +--------+ +-------+ +-------+ | | | +---------------------------------+ | | +---------------------------------+ | | | partition2 | | | | +--------+ +-------+ +-------+ | | | | | part1 | | part2 | | part3 | | | | | +--------+ +-------+ +-------+ | | | +---------------------------------+ | +-----------------------------------------+

A clickhouse table is made of multiple partitions and each partition is made of multiple parts. Clickhouse creates a new part per insert and there are usually multiple parts per partition and multiple partitions per table. Most of the complexity comes from having too many parts. Clickhouse recommends no more than 5k tables, 50k partitions and 100k parts.

The number of tables doesn’t really matter, what you need to watch is the total part count. You can have just 10 tables but still hit the too many parts problem if your partitions are not set up carefully.

Parts are immutable

In Clickhouse, parts are immutable. Every insert creates a new part. In the background, Clickhouse merges parts together and 5 to 20 parts per partition is considered normal. If the merge process can’t keep up with the rate of inserts, things will start to go out of control.

Clickhouse gives you two ways to insert data. The first is batch insert, where you handle the batching yourself. This is atomic and durable. The second is asynchronous insert, where Clickhouse buffers rows in memory at the partition level and flushes them every n seconds. There is a risk of data loss if the server crashes and you don’t wait for the flush.

If you go with asynchronous insert, you will likely need to tune the following settings

async_insert_busy_timeout_ms = "30000" # 30 seconds async_insert_max_data_size = "104857600" # 100 mb async_insert_max_query_number = "1000"

The value you want to set depends on many factors. This affects how long it takes for new data to show up in a read query and if you wait for the flush, this also determines how long it takes to get an ack for an insert. Set up proper monitoring for async operations.

Number of parts

Your major goal should be to keep the number of parts under control. The total number of parts per server is one of the main things you need to monitor closely, and it’s surprisingly easy to let it get out of hand. In most cases, you should avoid creating partitions per table. Since you already have a lot of tables, adding partitions on top will multiply the part count and make things worse much faster.

Server startup time

Clickhouse loads all tables during startup. If you have a lot of tables, like 10k+, it can take multiple minutes. At peak, our servers were taking more than 5 minutes to start. There is a flag called async_load_databases to control whether you want to load tables asynchronously. At first glance, async load might look like a great idea, but it usually doesn’t work very well. It’s likely that you will be running more than 1 clickhouse server and you would be rolling restart the servers. If you use async_load_databases, the server will immediately announce to the world that it’s up and ready to serve requests. But if you send any requests and that specific table is not yet loaded, it will take quite a lot of time to finish. If you have any reasonable system load, this will usually end up causing a mini incident.

It is best to load the tables synchronously and let the other servers handle the requests while it’s restarting. If you run it on Kubernetes, it also makes sense to have a...

ClickHouse Table per Tenant in Production

Related Articles

The Newest Instagram "Exploit" Is the Goofiest I've Seen

It's Not Just X. It's Y

Amazon, Facebook, FBI have access to a private intelligence-sharing network

Show HN: GoPeek – open links in live mini browser windows without new tabs

Agent Memory: An Anatomy