Why Your Vercel Functions Are Stuck in One Region

Connection pooling solves the serverless connection tax described here. It also absorbs the connection churn AI agents create, which is part of why the same proxy now enforces safe Postgres access for agents and humans.

Vercel Functions support 20 regions. You can deploy a Next.js API route to São Paulo, Tokyo, or Frankfurt. The compute layer is global. But 20 regions is small. AWS operates almost 40. Google Cloud has around 40. Azure has more than 70. The major cloud providers solved global compute distribution years ago.

In practice, most Vercel projects deploy functions to whichever region matches their database. PostgreSQL on us-east-1? Functions go to iad1. Database in eu-west-1? Functions go to dub1. The Vercel docs say it directly: run your functions "in the same region as your database, or as close to it as possible, for the lowest latency." Compute follows data.

This means the global serverless infrastructure that Vercel built goes unused. A user in Singapore hits your API, the request flies to Virginia, queries the database across localhost, and flies back. The function is fast. The network is slow. Your P95 latency is 400ms+ and there's nothing in your code to fix.

Fluid compute is great, but not global

Vercel's original attempt at solving this was Edge Functions: run JavaScript globally, in the region closest to the user. The idea was right. The execution was limited.

Under the hood, Edge Functions ran on Cloudflare Workers, which at the time brought its own constraints: no native TCP sockets, a strict 1 MB code size limit, no Node.js built-ins, and cold start behavior that varied across PoPs. The edge runtime couldn't hold persistent TCP connections to databases. A function running closer to the user still had to cross the ocean to reach PostgreSQL.

Vercel moved away from Edge Functions and built Fluid compute. Full Node.js compatibility, persistent connections, longer execution times. The engineering is a clear improvement.

But Fluid compute's default model runs your functions "closer to where your data lives." The direction changed. Instead of distributing compute to users, the default optimizes for colocating compute with the database. The data layer problem that limited Edge Functions was never addressed. It just stopped being the focus.

The connection tax

Even when your function is colocated with your database, serverless PostgreSQL connections are expensive.

Each new connection requires:

TCP handshake: 1 round trip
PostgreSQL SSLRequest: 1 round trip (client asks the server to switch to TLS)
TLS negotiation: 1-2 round trips (TLS 1.3 is 1 RTT; TLS 1.2 is 2)
PostgreSQL startup message: 1 round trip
Authentication (SCRAM-SHA-256): 2 round trips (client-first, server-challenge, client-response, server-final)

Six to seven round trips before your first query executes. When function and database are in the same region, each round trip is sub-millisecond, so this costs ~6ms. Tolerable.

When they're in different regions, the math breaks. us-east-1 to ap-southeast-1 (Singapore) is ~230ms per round trip. That's 1,400ms+ of connection overhead before a single row is returned. Your API endpoint takes 1.5 seconds to serve 50 rows that PostgreSQL produces in 2ms.

Vercel Functions can reuse connections across invocations when instances stay warm. But this is an optimization, not a guarantee. Under load, new instances spin up with cold connections. During traffic spikes, most connections are new.

Connection exhaustion

The problem compounds under concurrency. Each function invocation opens its own connection to PostgreSQL. 200 concurrent users means 200 simultaneous connections. Most managed PostgreSQL instances (RDS, Cloud SQL, Supabase) cap at 100-500 connections.

Serverless compute scales to thousands of instances. PostgreSQL was designed for a fixed number of long-lived clients. The mismatch is architectural: you hit too many connections errors during the exact moments your application needs to scale.

The solutions that exist lock you in

Cloudflare Hyperdrive is the closest thing to a real solution. It does connection pooling and query caching at the wire protocol level, globally distributed, with support for PostgreSQL and MySQL databases. The engineering is solid. But it only works from Cloudflare Workers. If you're on Vercel, Lambda, Railway, or anything else, it's not available.

Neon's serverless driver replaces TCP with WebSocket/HTTP connections, eliminating the 4-RTT handshake. But it only works with Neon databases. If you're on RDS, Cloud SQL, Supabase, or self-hosted PostgreSQL, it's not an option. It also doesn't solve the global problem: your queries still travel to whichever single region Neon hosts your database.

Prisma Accelerate provides connection pooling and caching, but requires the Prisma ORM. Teams using Drizzle, Knex, Kysely, or raw pg can't use it.

Self-hosted PgBouncer works with any client, but runs in a single region, doesn't cache queries, and adds operational overhead you're using serverless to avoid.

Each approach either locks you into a compute platform, a database provider, a specific ORM, or a single region. None of them solve the actual problem: making your data layer globally reachable without vendor lock-in.

The missing layer

The problem was never that compute couldn't be global. Vercel solved that. The problem is that PostgreSQL is a single-region service, and everything between your function and your database (connection setup, query execution, result transfer) pays the cost of that distance.

The fix is a proxy layer that:

Runs in multiple regions, close to where functions execute
Maintains persistent, authenticated TLS connections to the upstream database
Pools connections so thousands of function invocations share a small number of database connections
Caches read query results regionally so repeated reads never cross the ocean

PgBeam is that layer. It speaks the PostgreSQL wire protocol natively. No SDK, no ORM adapter, no driver swap. You change one environment variable:

.env

# Before: direct to RDS in us-east-1
DATABASE_URL=postgresql://user:pass@mydb.us-east-1.rds.amazonaws.com:5432/mydb

# After: through PgBeam
DATABASE_URL=postgresql://user:pass@abc.proxy.pgbeam.app:5432/mydb

Your application code stays identical:

Any ORM, any driver

// Prisma
const users = await prisma.user.findMany();

// Drizzle
const users = await db.select().from(usersTable);

// Raw pg
const { rows } = await pool.query("SELECT * FROM users");

Warm connections change the math

Latency-based DNS resolves your PgBeam hostname to the nearest proxy node. When a Vercel Function connects, it completes the PostgreSQL handshake locally in single-digit milliseconds.

The proxy already holds open, authenticated TLS connections to your upstream database. The expensive cross-region connection setup is paid once per upstream pool connection and shared across thousands of function invocations.

This matters even for writes and cache misses. A function in Mumbai (bom1) querying a database in us-east-1:

Direct: 6-7 round trips at ~180ms each for connection setup alone. Total with query: ~1,484ms.
Through PgBeam (cache miss): local connection to PgBeam Mumbai (~5ms), query forwarded over PgBeam's existing upstream connection. Total: ~332ms.
Through PgBeam (cache hit): served from PgBeam's Mumbai cache. Total: ~10ms.

Even without caching, warm upstream connections cut Mumbai-to-Virginia latency by 4.5x. With caching, it's 148x. These are real measurements from 20 Vercel regions.

PgBeam eliminates connection setup overhead, not the speed of light. Writes and cache misses from Mumbai still cross the ocean, but they do it over an already-open connection instead of establishing a new one from scratch on every request.

How the pool works

PgBeam's default mode is transaction pooling. A function acquires an upstream connection for the duration of a transaction (or a single query outside a transaction), then returns it. 200 concurrent function invocations share 10-20 upstream connections, keeping your database's connection count stable during traffic spikes.

When a connection returns to the pool, PgBeam checks whether the session was modified: SET statements, PREPARE, temporary tables, LISTEN/NOTIFY. Clean sessions go back immediately. Dirty sessions get a DISCARD ALL before returning. Connections still inside an active transaction are closed outright, not returned to the pool.

Most serverless queries are stateless CRUD operations that don't touch session state. If your application relies on prepared statements or session variables, session mode is also available, where each client gets a dedicated upstream connection for the lifetime of the session.

The pool maintains separate connection groups per credential pair. Multi-tenant setups where different services use different database users work without configuration.

If the upstream database goes down, a circuit breaker trips after consecutive connection failures and backs off exponentially. New connections fail immediately instead of hanging for TCP timeouts. When the database recovers, a probe connection succeeds and the circuit closes.

Query caching for reads that don't need to cross regions

PgBeam caches query results at the proxy layer. When enabled, repeated reads are served from a regional cache without hitting the upstream database.

You control caching per query with SQL comments. Product catalogs and feature flags get cached. Transactional queries always hit the database. The cache uses stale-while-revalidate: when an entry expires, the first request triggers a background refresh while still serving the stale result. This avoids thundering herd on popular queries.

The cache is TTL-based. A write in one region does not immediately invalidate caches in other regions. If you cache a query with a 60-second TTL, reads in other regions may return stale data for up to 60 seconds after a write. For queries where this matters, don't cache them.

Global routing unlocks global functions

PgBeam runs in 6 regions today: us-east-1, us-west-2, eu-west-1, ap-south-1, ap-southeast-1, and ap-northeast-1. These 6 were chosen to maximize global coverage across Vercel's 20 function regions. Latency-based DNS routes each connection to the nearest proxy.

Update (June 2026): PgBeam now routes each connection to the nearest region at the network layer rather than through latency-based DNS, and the region footprint has expanded. The user-facing behavior (a connection lands on the nearest region) is unchanged; the mechanism described here is dated.

This is what makes global Vercel Functions practical. Deploy your functions to multiple regions. Each function connects to the nearest PgBeam node. Cached reads resolve locally. Writes and cache misses route through PgBeam's warm upstream connections to the origin database, still faster than a cold connection from the function directly.

Real numbers from 20 Vercel Functions (one per Vercel region) querying a PostgreSQL database in us-east-1:

Vercel region	Direct	PgBeam (miss)	PgBeam (hit)	Proxy region
`iad1` Washington, D.C.	222ms	159ms	13ms	us-east-1
`pdx1` Portland	581ms	214ms	11ms	us-west-2
`dub1` Dublin	646ms	217ms	10ms	eu-west-1
`hnd1` Tokyo	1,186ms	279ms	11ms	ap-northeast-1
`bom1` Mumbai	1,484ms	332ms	10ms	ap-south-1
`sin1` Singapore	1,782ms	377ms	10ms	ap-southeast-1

Regions without a nearby PgBeam proxy still benefit from connection pooling, but the latency gains are smaller.

Your function is no longer stuck in iad1. The database is still in one region, but the data layer is global.

No lock-in

PgBeam works with any PostgreSQL database, any ORM, and any compute platform. If you move from Vercel to Lambda, or from RDS to Supabase, the connection string stays the same. No SDK to remove, no driver to swap, no code to change.

An open invitation to Vercel

Vercel built the best global compute platform for frontend developers. 20 regions, Fluid compute, instant rollbacks, preview deployments. The compute side is solved.

The data side is not. Vercel tells developers to colocate functions with their database. That's a workaround, not a solution. It concedes that global compute is only useful if you stay in one region.

PgBeam exists because the data layer was left behind. We run in 6 regions today and we're adding more. But imagine what happens when global compute and a global database proxy are designed together from day one: automatic region selection, integrated connection pooling, cache invalidation tied to deployment lifecycle, zero-config latency optimization across all 20 regions.

Vercel, let's build this together. Your compute, our data layer. One integration that makes "deploy globally" actually mean globally, for every PostgreSQL database, every ORM, every framework. The infrastructure is ready on both sides. The developers stuck in iad1 are waiting.

Try it

Sign up at dash.pgbeam.com, add your database, swap the host in your .env, and deploy. Check the live benchmarks to see latency numbers from 20 Vercel regions running against a real PostgreSQL database.

Why Your Vercel Functions Are Stuck in One Region

Get started with PgBeam