BookingApp — Find & Book Events

🎯

The Problem

Seat booking is a classic distributed systems problem: when two users click “Book” on the same seat at the same millisecond, a naive implementation will issue two INSERT statements that both succeed — resulting in a double-booking. This is not a hypothetical edge case; it is the default behaviour of any system that reads availability before writing a reservation.

The challenge is to guarantee exactly-once booking under arbitrary concurrency without serialising every request through a single lock — which would destroy throughput. This system solves it at the database layer using a composite UNIQUE(event_id, seat_id) constraint on the tickets table, so the database itself becomes the arbiter.

⚡ Key insight

No application-level mutex needed. The DB constraint turns a race into a first-write-wins with automatic rollback and HTTP 409 for losers — atomic and correct under any load.

🏗

System Architecture

Seven containerised services communicate over a shared Docker network. The API tier is stateless — all shared state lives in Postgres and Redis — so horizontal scaling is as simple as adding more API replicas behind a load balancer.

🔍 Click to zoom

System Architecture Diagram

Service	Image	Port	Role
api	FastAPI + Uvicorn	8000	HTTP API, JWT auth, booking logic
db	PostgreSQL 15	5434 → 5432	Primary data store
redis	Redis 7	6380 → 6379	Cache + metrics counters
rabbitmq	RabbitMQ 3.13	5672 / 15672	Message broker
worker	Celery 5.4	—	Async email notifications
mailpit	Mailpit	8025 / 1025	Dev email capture
test-db	PostgreSQL 15	5433 → 5432	Isolated test database

🔄

Request Flow

Every POST /api/v1/bookings call passes through six distinct layers before a 201 is returned — and three additional async steps fire after the response is sent to the client.

🔍 Click to zoom

Request Flow Swimlane Diagram

🔒 Where the lock happens

db.flush() materialises the INSERT inside the open transaction. PostgreSQL enforces UNIQUE(event_id, seat_id) at flush time — not commit time — so conflicts surface immediately and the transaction is rolled back before any commit overhead.

🧪

Concurrency Proof

The test fires 50 simultaneous booking requests for a single seat using asyncio.gather — the closest approximation to a real thundering-herd scenario in a test suite.

🔍 Click to zoom

Concurrency Proof — 50 user race result

HTTP 201

Successful booking

HTTP 409

Conflict — seat taken

The constraint that makes this work:

# backend/app/models/booking.py
class Ticket(Base):
    __tablename__ = "tickets"

    id         = Column(Integer, primary_key=True)
    booking_id = Column(Integer, ForeignKey("bookings.id"))
    event_id   = Column(Integer, ForeignKey("events.id"))
    seat_id    = Column(Integer, ForeignKey("seats.id"))

    __table_args__ = (
        UniqueConstraint("event_id", "seat_id", name="_event_seat_uc"),
    )

The service layer catches the database integrity error and converts it to a 409:

# backend/app/services/booking_service.py
try:
    db.flush()          # triggers UNIQUE constraint check
    db.commit()
    return booking
except IntegrityError:
    db.rollback()
    raise HTTPException(status_code=409, detail="Seat already booked")

🔍 Click to zoom

Concurrency Mechanism Timeline

⚡

Performance & Caching

A Redis read-through cache sits in front of every GET /events/:id and event-listing query. On cache miss the result is stored with a 300-second TTL and atomic counters (Redis INCR) track hits, misses, and latency for the live /api/v1/metrics endpoint.

🔍 Click to zoom

Performance Dashboard — 4-panel chart

8.9 ms

Cached response

Redis hit

148 ms

DB response

Cache miss

16.6×

Speedup

cache vs. DB

Cache instrumentation (simplified):

# backend/app/services/cache_service.py
async def get_event(event_id: int) -> dict | None:
    t0 = time.monotonic()
    data = await redis.get(f"event:{event_id}")
    latency_ms = (time.monotonic() - t0) * 1000

    if data:
        await redis.incr("metrics:cache_hits")
        await redis.incrbyfloat("metrics:cache_ms_total", latency_ms)
        return json.loads(data)

    await redis.incr("metrics:cache_misses")
    return None  # caller fetches from DB and populates cache

📐

LLD / Data Model

Six core entities. The Ticket table is the junction between a Booking and an Event + Seat pair — and it carries the uniqueness constraint that enforces single-occupancy.

🔍 Click to zoom

LLD Class Diagram (UML)

🔍 Click to zoom

Entity Relationship Diagram

🧠

Design Decisions

Every non-trivial architectural choice involved a trade-off. The diagram below documents five key decisions with the alternatives considered and the reasoning for the final choice.

🔍 Click to zoom

Design Decisions — 5 trade-off cards

Decision	Chosen	Why not the alternative
Concurrency lock	DB UniqueConstraint	Redis SETNX: extra round-trip, TTL risk; Pessimistic lock: serialises all writes
Cache invalidation	TTL 300s (time-based)	Event-driven invalidation: overkill for read-heavy event data
Async notifications	Celery + RabbitMQ	Inline SMTP: blocks response; FastAPI BackgroundTask: no retry on crash
Auth	JWT (stateless)	Session cookies: requires session store, harder to scale horizontally
DB pool	pool_size=20, overflow=10	Single connection: serialises all queries; unlimited pool: OOM under burst

🛠

Tech Stack

🔍 Click to zoom

Tech Stack and Service Map

Technology	Why it's here
FastAPI 0.116	Async Python API framework, OpenAPI docs out of the box
PostgreSQL 15	Primary store — ACID transactions, UniqueConstraint as the concurrency lock
Redis 7	Read-through cache, atomic INCR counters for metrics, TTL 300s
Celery 5.4 + RabbitMQ	Async notification pipeline, acks_late=True for guaranteed delivery
SQLAlchemy 2	ORM with pool_size=20, max_overflow=10 — handles burst traffic
Next.js 15 (App Router)	Frontend — SSR + client components, deployed at booking.404by.me
Docker Compose	7-service local stack: api, db, redis, rabbitmq, worker, mailpit, test-db
Alembic	Schema migrations — version-controlled DB changes

📈

Scaling Roadmap

The current architecture handles ~37 RPS comfortably on a single API replica. Here is the path to 10×, 100×, and beyond without changing the core booking logic.