API Rate Limiting for Self-Hosted Apps | TinyPod

Without rate limiting, a single client can overwhelm your self-hosted API. Here's how to implement effective rate limiting.

Why Rate Limit?

Without rate limiting, your API is vulnerable to:

**Brute force attacks**: Thousands of login attempts per second

**Scraping**: Automated bots downloading your entire database

**Accidental overload**: A client bug that sends requests in an infinite loop

**DDoS**: Distributed attack overwhelming your server

Rate Limiting Strategies

Fixed Window

Allow N requests per time window (e.g., 100 requests per minute). Simple but allows burst at window boundaries.

Sliding Window

Smooths out the fixed window problem. Counts requests over a rolling time period.

Token Bucket

Tokens are added at a fixed rate. Each request consumes a token. If no tokens are available, the request is rejected. Allows short bursts while enforcing long-term limits.

Leaky Bucket

Requests are queued and processed at a fixed rate. Smooths traffic but adds latency.

Implementation Levels

Reverse Proxy (Caddy/Nginx)

Rate limit at the proxy level before requests reach your application. Effective for DDoS and brute force protection.

Application Middleware

Rate limit within your application code. More granular — different limits per endpoint, per user, per API key.

API Gateway

Dedicated rate limiting service. Best for complex APIs with multiple backends.

Recommended Limits

Public APIs

Unauthenticated: 30 requests/minute

Authenticated: 300 requests/minute

Premium: 3,000 requests/minute

Authentication Endpoints

Password reset: 3 requests/hour per email

Registration: 3 accounts/hour per IP

Webhooks

10 requests/second per source

Response Headers

Always include rate limit headers:

X-RateLimit-Limit: Maximum requests allowed

X-RateLimit-Remaining: Requests remaining

X-RateLimit-Reset: Time when limit resets

Retry-After: Seconds to wait (when rate limited)

HTTP Status Codes

429 Too Many Requests: Rate limit exceeded

Include Retry-After header

Return a clear error message

Never silently drop rate-limited requests. Always inform the client.

API Rate Limiting: Protecting Self-Hosted APIs from Abuse