A data product API without execution evidence is just a query with a nicer front door.

The API needs to explain its work

Apache DataFusion gives teams a query engine they can embed inside services. That makes it attractive for data product APIs, because the query planner and execution engine can live close to the application boundary.

The risk is that the service becomes a black box. Consumers see response time and maybe an error code. Operators need more. They need to know scan volume, partition pruning behavior, operator time, memory pressure, and whether policy filters changed the result path.

Execution metrics are product evidence

DataFusion documents runtime metrics on execution operators and exposes plans through EXPLAIN and EXPLAIN ANALYZE. In an API setting, those metrics can become reliability evidence instead of developer-only diagnostics.

A mature data product API should record which table version or snapshot was queried, which partitions were touched, which filters were pushed down, which policy rules constrained the result, and which execution metrics explain the cost of the request.

Core idea: metrics are not only for tuning. They are how a data product proves what work it did.

The ODI pattern links metrics to contracts

Execution metrics become more valuable when they connect to data contracts. A freshness SLA means little if the API cannot show which source snapshot it used. A cost budget means little if the API cannot show why a request scanned too much data.

For adjacent context, read data product SLAs in ODI, policy-aware DataFusion services, and ODI control planes for AI workloads.

What breaks first

  • The API returns correct data but cannot explain a latency spike.
  • A policy filter changes scan behavior, but the service logs only the final response.
  • Partition pruning stops working and the product owner notices through cost, not evidence.
  • Agent consumers treat every response as equal because quality and execution signals are missing.

Questions to ask

Ask which execution metrics are captured per request, which ones appear in incident review, and which ones define product-level budgets. Ask whether metrics can be traced back to the data product contract, not only to a service instance.

Sources to start with

These primary sources anchor the technical claims in this guide.

A query service becomes a data product when it can show its work.