AI | orndorff.dev

Want an LLM to Be a Domain Expert? Build It a Wiki — Automatically.

Everyone wants their LLM to be an expert in their thing — their product, their policies, their field. The default playbook is to fine-tune a model on it. That’s expensive, opaque, frozen the day you finish, and it still makes things up. There’s a cheaper, more honest move: don’t teach the model your domain. Hand it a wiki. The idea Take your pile of documents. Have a model read all of it and write the wiki you wish existed — every concept as its own page, every relationship as a link between pages, the whole thing backed by a database the model can search and traverse. ...

The Expert Is the Graph: A 4-Bit Local Model Out-Answered Frontier Claude on Its Own Data

For the last couple of weeks I’ve been building okb, the open-knowledge-bundler — a small tool that takes a pile of domain data and hands you back a portable “knowledge bundle” any agent can read. Under the hood it builds a DuckDB knowledge graph, exports it as a cat-readable open knowledge format bundle packaged as a Claude-compatible “skill,” and ships a chat agent that answers questions about it. The whole thing runs fully local, on a single AMD chip on my desk. No API keys, no cloud, no embedding server. ...

Getting Gud with LLMs: How to Build the Intuition

I recently let Claude crawl 25 months of my own LLM tooling history and write up what it found. The result lives over here: Notes from Claude: What I Found in One User’s Data. That post is mostly what one person’s data looks like — eighty repos, 2,826 logged calls, voice memos full of profanity, the works. It’s not a how-to. People keep asking me for the how-to. So here it is. Not a list of magic incantations. Not “ten prompts that will change your life.” The operating principles I actually use when I sit down with a model, distilled from being annoyed at GPT-2 back in 2019 and shipping production code with Opus in 2026. ...

Notes from Claude: What I Found in One User's Data

Notes from Claude: I asked Claude (Opus 4.7) to look through 25 months of my own LLM tooling history — git logs, Claude Code transcripts, two llm CLI databases, my GitHub orgs — and write up what it found, from its own perspective, in response to the recurring “models are getting worse” discourse. What follows is its draft, lightly edited. — Zac I gave Anthropic 10 days. Tried to fix multiple bugs in multiple repos. Opus 4.7 just goes in circle and doesn’t do anything. ...

Small LLMs, Big Reasoning: How a Neuro-Symbolic Expert System Makes Haiku Agents Reliable

There’s a dirty secret in the AI agent space: most agent frameworks hand the model a bag of tools and pray. The model decides what to query, how to reason about results, and what conclusions to draw. For demos, this works great. For anything you’d actually bet your job on — compliance audits, student intervention decisions, infrastructure monitoring — it’s a liability. What if the model didn’t have to reason at all? ...