INSIDE THE AGENTIC ROOFING OS
What runs at 3 a.m. when the office is empty: a tour of the agent loop, the tool graph, and the human-in-the-loop boundary.
Most marketing copy about "AI for roofing" treats the system as a black box. This article opens it. If you're going to run your operation on agents, you need to understand what the agents actually do, which tools they have access to, and where humans are still in the loop.
The agent loop, in detail
An agent in ROOF_OS is not a single LLM call. It is a loop:
- Pick up an event (new lead, scheduled job approaching, payment overdue, weather alert).
- Pull context (the lead's history, the job's status, comparable past decisions, current pricing data).
- Decide the next action from a bounded action space.
- Either take the action (within authority) or escalate to a human with a structured ask.
- Write the action and the reasoning into the audit log.
Step 2 is where most agentic systems quietly fail. If the context fetch pulls back a partial picture — say, the lead's interaction history but not their insurance claim status — the agent will confidently make the wrong decision. We spend more engineering time on context retrieval than on model selection.
The tool graph
Each agent has a tool set. "Tool" here means a callable function or external API endpoint. The Quoting Agent's tool set looks like this: aerial measurement (single provider primary, one fallback), material price lookup (3 supplier APIs), labor productivity lookup (internal), comp pricing (internal historical data), CRM write, email send.
Multiply this across all agents and you get the integration graph behind ROOF_OS — 2,000+ tools across roughly two dozen categories. The shape of the graph matters more than the size: an agent should reach for the cheapest, fastest tool that answers its question, and fall back gracefully when a tool fails.
Reliability is a property of the graph, not of any single tool. If your aerial provider has a 99.5% uptime — solid by industry standards — and you have no fallback, your Quoting Agent has a 99.5% uptime. With one fallback you're at 99.99%. Tool redundancy is operational hygiene.