Skip to main content

DORA × Engineering Cost: The ROI Story Tools Miss

· 11 min read
Artur Pan
CTO & Co-Founder at PanDev

A VP of Engineering walks into the quarterly review with a clean DORA dashboard: lead time down from 9 days to 4, deployment frequency up from 1.2 per week to 2.8, change failure rate trimmed from 18% to 11%. The CFO listens patiently, then asks the only question that matters: "What did that save us in dollars?" The room goes quiet. The DORA tool does not know. The finance tool does not know either, because it does not see deployment data. The CTO ends up arguing on principle. Two quarters later, the platform team's budget is cut to fund a sales hire.

Most engineering organizations track DORA and cost in two separate systems. Sleuth, Swarmia, LinearB show you DORA. Jellyfish (its standalone finance module) and Faros show you cost. The DORA State of DevOps reports explicitly link the four DORA metrics to organizational performance, but only at the outcomes layer, not the dollar layer. To translate "we cut lead time from 9d to 4d" into a real number the CFO defends, you need both data sources in the same query. This article walks through the four integration points, then a worked Q1 → Q2 example with a 2.73x quarterly ROI.

Logistics Engineering Metrics for Delivery Platform Teams

· 7 min read
Artur Pan
CTO & Co-Founder at PanDev

A delivery platform's engineering team runs a fundamentally different workload from a B2B SaaS team. The courier mobile app pings location every 3-5 seconds. The dispatcher console expects sub-200ms order assignments. Route-optimization jobs crunch combinatorial problems overnight and need to finish before dawn shifts start. A 2024 McKinsey report on last-mile logistics pegged the cost of a single hour of dispatcher downtime at $12,000-$35,000 for a mid-size regional carrier.

This shape of work changes what engineering metrics actually matter. DORA four keys still apply, but the team-health and delivery-performance picture shifts. Here's the metric stack that fits logistics platform teams — and the places where "copy a SaaS DORA dashboard" misleads you.

Marketplace Engineering: Metrics for Two-Sided Products

· 9 min read
Artur Pan
CTO & Co-Founder at PanDev

A marketplace CTO told me the line I keep hearing: "My supply team ships fast, my demand team ships fast, and GMV still stagnates." The DORA dashboards were green on both sides. The matching engine was not. Two-sided products have a metric gap that single-sided SaaS doesn't: engineering output on one side of the marketplace only creates business value if it's matched by output on the other side.

Andreessen Horowitz's marketplace framework ranks liquidity — the probability that a listed item actually transacts within a window — as the single best predictor of marketplace health. That probability is an engineering outcome, not a marketing one. When search latency rises by 200ms, listed-item conversion drops measurably. When seller onboarding takes 14 days instead of 4, supply growth curves flatten within a quarter.

Feature Flag Management Without Chaos: The Playbook

· 8 min read
Artur Pan
CTO & Co-Founder at PanDev

Your team turned on feature flags three years ago because it felt responsible — gradual rollouts, kill switches, A/B tests. Today the flag service has 87 live flags, and nobody on the team can tell you what 34 of them do. Two of them are contradicting each other in production right now. One was meant to be removed in 2024. Airbnb's engineering team publicly described this exact failure mode in 2023 — they hit 6,000+ flags before a full audit forced a cleanup. GitHub reported 3,700 experiments running simultaneously at peak.

The problem is not feature flags. The problem is that teams treat flags as free — cheap to add, invisible to maintain. This playbook is a lifecycle framework that works for teams between 10 and 200 engineers, backed by data from 100+ B2B companies we track via IDE heartbeats. The goal: a flag count that stays roughly flat with team size, not linear with team age.

Manufacturing Software Engineering: Agile Meets Hardware

· 8 min read
Artur Pan
CTO & Co-Founder at PanDev

A mid-sized automotive supplier I consulted for in 2024 had a production bug land at 03:15 on a Tuesday. The fix took 8 minutes to code and 19 days to deploy — because it required a software update to PLCs on 14 production cells, each of which could only be updated during the 4-minute changeover window between shift batches. The engineering team's average lead time on the office-IT side: 31 hours. On the shop-floor side: 14 days. Same team, same repository, two different universes of delivery constraint.

Manufacturing software engineering is Agile meeting hardware. The practices that work at a SaaS startup — deploy-whenever, feature flags, canary releases — collide with regulated plant-floor reality: OEE targets, changeover costs, OT/IT separation, and production lines that cannot pause for a deploy. A 2023 Deloitte Smart Factory study found 73% of manufacturers cite "IT/OT integration" as the top barrier to digitization. The problem isn't technology; it's that metrics and rituals designed for pure software break when the software touches a physical process.

Retroactive Rate Changes: When You Update a Salary Backwards

· 9 min read
Artur Pan
CTO & Co-Founder at PanDev

A VP of Engineering walks out of a Q1 review and announces an 8% raise for 12 backend engineers, effective March 1. It's now May 18. Three months of finance reports already shipped to the board with the old rates baked in. HR has two options: pretend the raise started today, or retroactively update March, April, and May. Most engineering finance tools force option one. PanDev Metrics supports option two, and the Sarbanes-Oxley Act of 2002 is the reason it has to be done carefully.

This is one of the few areas where our product genuinely diverges from LinearB, Jellyfish, and Code Climate Velocity. Those tools were built around forward-only rate models. PanDev's UserRate table is bitemporal: every rate has a startPeriod and endPeriod, and the OverheadCoefficientFullRecalcCronJob will replay activity events through new rate × overhead K when historical rows change. That's powerful. It's also exactly the kind of capability that auditors look at twice.

API Versioning Best Practices: Real Team Examples

· 10 min read
Artur Pan
CTO & Co-Founder at PanDev

Twilio maintains 14 active API versions. Stripe pins every customer to the version active on their signup date and has supported versions going back to 2011. GitHub's REST API runs three major versions in parallel and publishes deprecation headers 12 months before sunset. Your team is probably trying to get away with one — and debating whether the version goes in the URL, a header, or the accept type.

The versioning debate is really three separate decisions stacked into one argument: where the version lives, how breaking changes are scoped, and when old versions die. Getting one right doesn't save you if the other two are wrong. This is a playbook drawn from how the companies that actually run public APIs at scale handle it, plus what we see inside PanDev Metrics customers running internal APIs with 20-200 consumers.

Cost Attribution in Microservices: Who Pays for Auth?

· 10 min read
Artur Pan
CTO & Co-Founder at PanDev

A platform team of 6 engineers costs $156K per quarter. They run auth, observability, the internal API gateway, the shared cache, and the deploy pipeline. Eight product teams use those services every day. Ask the CFO who pays for it and the answer is "central R&D." Ask the platform lead who consumes it and the answer is "everyone equally." Both are wrong, and the gap between them is where engineering finance loses six figures a year in distorted decisions.

Adrian Cockcroft made the original argument when Netflix decoupled into microservices: shared infrastructure has a unit cost, and unit cost should follow the request. The CNCF FinOps Working Group in their 2024 State of FinOps for Engineering report found fewer than 24% of microservices organizations allocate platform-team time back to consumer teams. The other 76% treat platform engineering as overhead, which means the team consuming 41% of platform requests is invoiced the same as the team consuming 1%.

PropTech Development Velocity: Real Estate SaaS Engineering

· 8 min read
Artur Pan
CTO & Co-Founder at PanDev

A PropTech team I worked with last year ships 4.2 deploys per week across their flagship product. Their CEO benchmarks that against a reference SaaS portfolio and concludes velocity is "mediocre." It's not. A fintech of similar headcount ships 7.1; a pure B2B SaaS ships 9.4. PropTech lives at the intersection of regulated data, geospatial complexity, and 1990s MLS integrations — the raw deploy-frequency number hides what engineering is actually fighting.

Stack Overflow's 2024 Developer Survey places real-estate software in the bottom third of all industries for reported build and integration-testing speed. Microsoft Research's 2024 DevEx benchmarks show regulated industries losing an average 23% of engineering throughput to compliance friction alone. PropTech layers geospatial complexity on top of that.

Dependency Management: npm vs pip vs Go Modules Playbook

· 7 min read
Artur Pan
CTO & Co-Founder at PanDev

A mid-size JavaScript service imports 47 direct dependencies and ends up resolving 2,500+ transitive packages. The same service ported to Go imports 12 direct modules and resolves 42 total. The pip equivalent sits near 180. These are not preferences — they are the shape of each ecosystem, and your dependency strategy has to start from that reality.

Your supply-chain exposure, lockfile discipline, and upgrade cadence should be different in each. This is a playbook for doing that well across npm, pip, and Go modules — the three ecosystems that cover about 84% of production backend code according to the 2025 Stack Overflow Developer Survey.