Blog

Field notes on LLM request reliability.

Practical writing on durable LLM requests, retries, idempotency, wait mode, and the failure modes that show up in production.

Why Direct LLM API Calls Break In Production

A category-defining guide to the boring failure modes behind LLM calls: timeouts, rate limits, worker restarts, duplicate retries, and unknown outcomes.

Implementation Notes

Go deeper.

How We Built The Retry Engine

A look at the small worker loop behind ReqRun: durable queue rows, lock tokens, attempts, retryable failures, terminal failures, and backoff with jitter.

Read article

2026-04-219 min read

Production Patterns

Go deeper.

Idempotency For LLM Requests

Retries are dangerous without dedupe. This post explains how to choose idempotency keys for LLM tasks and how ReqRun deduplicates by project.

Read article

2026-04-217 min read

Wait Mode And Async Fallback For LLM Requests

Some LLM requests finish quickly. Some do not. wait=true gives developers a synchronous happy path without giving up durable async recovery.

Read article

2026-04-216 min read

Reliable Webhook-Triggered LLM Work

Webhook senders retry delivery. If a webhook triggers LLM work, your handler needs idempotency and a durable model request record.

Read article

2026-04-216 min read

Go deeper.

Request Execution Vs Workflow Orchestration

ReqRun is intentionally narrow. It does not coordinate every step in your system; it makes the LLM request step durable and visible.

Read article

2026-04-217 min read