Ozgur Guler · A public record

Production AI, from first principles.

I build, write about, and ship production AI systems — agent harnesses, long-running workflows, inference architecture, EvalOps, and the infrastructure underneath.

A curated record: books in progress, essays, code, build notes, talks, and selected startup work. No hype, no metrics theatre — just the trail.

agents inference books writing contact

Now: Drafting AI Inference Engineering and AI Agents in Production.
Building: Durable agent harnesses on Azure AI Foundry · MCP boundaries · EvalOps.
Open to: Talks, sober technical consulting, and selected startup work.

Selected work

A curated index.

Practice

Two tracks, one discipline.

Production AI is the meeting point of model behaviour, infrastructure economics, and operator trust. I work where those three meet.

Agents

Memory, state, and forgetting policies
Durable execution and long-running workflows
Tool boundaries, MCP, and approval gates
Evals, replay, and run-trace observability
Governance and enterprise deployment

Inference

Serving topology and model routing
Latency, throughput, and batching
KV-cache pressure and scheduling
GPU economics and AI-factory architecture
Benchmarking, reliability, observability

Books

Book drafts.

Inference Engineering

Prod Agents

AI & Data

AI Security

Recent

Writing and build notes.

Essays

Build log

Startup work

Public-safe notes.

Selected involvement; deep technical work, public framing.

Enlighty.ai

AI-native consumer intelligence platform turning fragmented data into trusted insight.

Eachlabs.ai

AI workflow and model platform for app builders, with curated image, video, voice, and text models.

Ozgur Guler
AI systems builder.

Est. MMXXIV
Set in Newsreader, Inter, & JetBrains Mono.
Built with Astro · No tracking · No nonsense.

Reach

email github medium substack linkedin x.com