OTIF (On-Time In-Full): Formula, Benchmarks, and the Supplier Performance Gap

In a closed-loop procurement platform — where every step of the buying workflow runs as one connected record, from demand signal through purchase order, supplier reply, and receiving — OTIF (On-Time In-Full) is the supplier performance metric that checks both dimensions at once. Fill rate alone can look acceptable while delivery timing drifts. Lead-time accuracy can look clean while order completeness erodes. OTIF is the AND: an order only passes if it arrived within the agreed delivery window AND was received at or above the agreed quantity threshold.

OTIF stands for On-Time In-Full. It measures the percentage of purchase orders for which the supplier satisfied both conditions simultaneously.

OTIF (%) = (Orders on-time AND in-full / Total orders) × 100

A supplier that delivered 93 of 100 orders on time, but only 85 of those 93 in full, has an OTIF of 85% — not 93%. The strict AND is what makes OTIF a useful single-number scorecard: it catches the supplier whose timing is excellent but quantity is unreliable, and the supplier whose completeness is consistent but scheduling is erratic. Both are operational planning problems. OTIF surfaces both in one number.

Quick answers

What does OTIF mean? On-Time In-Full. A per-order pass/fail metric: did the order arrive within the agreed delivery window AND at or above the agreed quantity threshold? If either condition fails, the order fails OTIF.

What is the OTIF formula? OTIF (%) = (Orders on-time AND in-full / Total orders) × 100.

What is a good OTIF score? For most supply chain relationships, 95–99% is strong. Retail compliance programs (Walmart, Target, Costco) typically require 95%+ with financial penalties for non-compliance. SMB buyers measuring their own suppliers should target ≥ 95% for primary suppliers on A-items.

How is OTIF different from fill rate? Fill rate measures the proportion of ordered units actually delivered — a quantity-only metric. OTIF checks both timing and quantity on a per-order basis using strict AND logic. An order with 100% of units delivered but arriving two days late is a fill rate win and an OTIF failure.

How is OTIF different from lead-time accuracy? Lead time accuracy measures whether delivery timing matched the quoted timeline — a timing-only metric. OTIF also requires that the order was complete. A supplier with 98% lead-time accuracy but 88% unit-fill accuracy can have a significantly lower OTIF score.

Does OTIF only matter for large retailers? No. Retail compliance programs formalized OTIF with fines, but the metric applies to any buyer-supplier relationship where timing and completeness both affect operations, replenishment planning, or downstream production. SMB buyers should track OTIF per supplier for A-item categories.

Can I calculate OTIF without special software? Yes — with a PO log and receiving log that record confirmed delivery date and received-versus-ordered quantities per order. In practice, consistent data entry at receiving is the bottleneck. A closed-loop procurement system captures these events automatically as structured records, making OTIF a query rather than a reconstruction project.

The formula in detail

OTIF (%) = N_on-time-and-in-full / N_total × 100

Where:

N_on-time-and-in-full = count of purchase orders that met both the timing threshold and the quantity threshold in the measurement period
N_total = total purchase orders placed with the supplier in the measurement period

Both thresholds must be defined explicitly before the metric is useful.

Defining "on time"

On-time is measured from the agreed delivery date — not the ship date, not the original order date — and requires a tolerance definition:

Setting	On-time window
Retail compliance (Walmart, Target)	Arrives at the DC on or before the Required Delivery Date (RDD), often with a narrow ±1-day window
SMB buyer (direct supplier)	Within ±1 day of the supplier's confirmed delivery date
Replenishment-focused SMB	Arrives before inventory for affected SKUs falls below safety stock

For SMB buyers, the most operationally relevant definition is the last one: on-time means the order arrived before the replenishment gap opened. This anchors OTIF to actual stockout risk rather than a calendar tolerance that may or may not matter for a given order.

Defining "in full"

In-full is usually defined as a threshold, not necessarily 100% of every unit:

Setting	In-full threshold
Retail compliance	100% of ordered units (strict; no partial orders)
SMB buyer with occasional short fills	≥ 95% of ordered units
High-variability categories	≥ 90% of ordered units, with pre-agreed tolerances documented in the supplier record

Below the defined threshold on any order = fails in-full, which means the order fails OTIF regardless of timing.

Why OTIF is stricter than fill rate or lead-time accuracy alone

A supplier with 97% fill rate and 94% on-time delivery does not necessarily have a 91% OTIF. OTIF requires both conditions to be true of the same order. If the 3% of short-fill orders overlap with the 6% of late orders — meaning some orders fail both dimensions — the overlap is not double-counted. The OTIF is calculated from actual order-level pass/fail records, not from the product of two aggregate percentages.

The practical implication: suppliers who look acceptable on each individual metric can look meaningfully worse on OTIF, because OTIF exposes the orders where both dimensions fail simultaneously. Those orders are the most operationally disruptive — a short fill that also arrives late cannot be patched with an emergency reorder because there is no time.

The compounding effect on replenishment math

Safety stock formulas use lead time and demand variability as inputs. When OTIF is structurally low, two failure modes compound:

Late delivery: the reorder point fires correctly, but the order arrives after the safety stock buffer is consumed. Inventory runs below the safety floor before replenishment arrives.

Short fill: the order arrives on time, but not in full. Inventory rises, but not to the expected level. The next reorder point fires sooner than the replenishment model predicted.

Both outcomes produce unplanned purchases — often at emergency pricing — or stockouts that affect revenue. Tracking fill rate and lead-time accuracy independently underestimates this compounding effect. OTIF is the metric that captures the conjunction of both failures.

Two operational contexts for OTIF

1. Buyer-side OTIF: measuring your suppliers

The primary use case for SMB buyers is tracking OTIF per supplier over a rolling 90-day window:

Supplier OTIF (%) = Orders from Supplier X on-time AND in-full / Total orders from Supplier X × 100

Tracked alongside fill rate by supplier, lead-time accuracy, and purchase price variance, OTIF gives a complete picture of supplier reliability across the dimensions that affect replenishment planning.

Tiering suppliers by OTIF:

OTIF range	Interpretation	Action
≥ 97%	Reliable across both dimensions	Maintain as primary; consider for blanket PO or auto-send cadence
90–96%	Acceptable, monitor for drift	Review quarterly; investigate specific failure orders
80–89%	Structural reliability problem	Upsize safety stock; qualify a secondary source
< 80%	Unreliable for A-item replenishment	Shift A-item volume; escalate with data

2. Seller-side OTIF: when your business is the supplier

SMBs that sell into large retail or distribution channels — CPG brands supplying grocery chains, supplement brands supplying distributors, specialty goods selling to national retailers — face OTIF requirements from the opposite direction. The retailer measures your OTIF as a supplier and charges financial penalties when it falls below their threshold.

Walmart's supplier OTIF compliance program, as a well-documented example, has required ≥ 95% OTIF with a penalty of approximately 3% of the purchase order cost for each non-compliant shipment. Target and Costco operate comparable programs. For a $500,000/year supplier relationship running at 80% OTIF, the fine exposure can exceed $30,000 annually — before accounting for delisting or volume reduction risk.

The procurement implication: your OTIF performance to retail customers is downstream of your suppliers' OTIF to you. A vendor who consistently delivers short fills or late orders makes it structurally difficult to maintain compliance with retail customers. Buyer-side OTIF tracking is therefore a prerequisite for seller-side OTIF defense.

Benchmarks

Relationship type	Target OTIF	Consequence of non-compliance
Walmart (supplier to Walmart DC)	≥ 95%	~3% of PO cost per non-compliant order
Target, Costco (supplier to retailer DC)	≥ 95%	Fines per non-compliant order; delisting risk
Mid-tier grocery / regional distributor	≥ 90–93%	Volume reduction, priority downgrade
SMB buyer measuring own suppliers (A-items)	≥ 95%	Replenishment risk, safety stock gap
SMB buyer measuring own suppliers (B-items)	≥ 88%	Acceptable with appropriate safety stock

Industry benchmarks by supplier type:

Supplier category	Typical OTIF range
Automotive (just-in-time dependency)	98–99%
Consumer packaged goods (selling into retail chains)	90–95%
Independent SMB suppliers (B2B)	80–90%
Specialty and seasonal goods	75–85% (higher variance expected; price into safety stock)
Fresh and perishable produce	82–92% (seasonality, weather, and harvest variance affect timing)

Small businesses buying from smaller suppliers should expect OTIF to fall in the lower bands of these ranges and size safety stock accordingly. The cost of expecting 97% OTIF from a supplier delivering 85% is unplanned emergency buys — which typically cost more per unit and create order irregularity that further disrupts replenishment math.

Worked example

A specialty grocer places orders with four primary suppliers. Over the trailing 90 days:

Supplier	Total orders	On-time	In-full	On-time AND in-full	OTIF
Supplier A — produce	36	34	33	32	88.9%
Supplier B — dry goods	24	24	22	22	91.7%
Supplier C — dairy	36	35	35	34	94.4%
Supplier D — specialty imports	12	10	11	9	75.0%

Supplier A has the highest order frequency and the second-lowest OTIF. Decomposing the 4 failures:

2 orders late, of which 1 was also short
3 orders short, of which 1 was also late

Orders that failed at least one condition: 2 (late-only) + 2 (short-only) + 1 (both) = 5. But the overlap means only 4 unique failures, not 5 — the order that was both late and short counts once. On-time AND in-full: 36 − 4 = 32. OTIF = 88.9%.

For fresh produce on an A-item profile, 88.9% OTIF means approximately one short or late order every 9 days at twice-weekly ordering frequency. The operational consequence: emergency procurement from a secondary source roughly 6 times per quarter, or planned safety stock large enough to absorb every expected failure — which, for perishables with a decay rate, means waste.

Supplier D at 75% OTIF on specialty imports is not alarming given the category: long transit, customs variability, and seasonal harvest availability. The right operational response is not to treat Supplier D as a critical-path supplier for A-items, but to plan 4–6 weeks of safety stock on imported SKUs and accept that OTIF for this category will be structurally lower than for domestic suppliers.

OTIF and the ABC intersection

ABC inventory analysis and OTIF interact directly. The practical policy:

Item tier	Supplier OTIF threshold	Response to non-compliance
A-items	≥ 95%	Immediate dual-source qualification; elevated safety stock
B-items	≥ 88%	Safety stock upsize by fill-rate gap; quarterly review
C-items	≥ 80%	Reduce order frequency; lower-risk to stockout temporarily

The highest-risk combination is an A-item sourced from a supplier with below-85% OTIF. That combination — high revenue exposure, structurally unreliable source — requires the largest safety stock and is the clearest case for qualifying a backup supplier. C-items from a low-OTIF supplier are a lower-priority problem: the revenue exposure is smaller and the safety stock requirements are more forgiving.

How closed-loop procurement makes OTIF trackable

The measurement problem with OTIF is that it requires three structured data points per order: the agreed delivery date (supplier-confirmed), the actual delivery date (receiving timestamp), and the received quantity (receiving variance against the living PO). In open-loop procurement — purchase orders sent by email, supplier replies tracked in an inbox, receiving noted on paper or entered manually into accounting — those three data points rarely live in the same record.

In a closed-loop procurement system:

Agreed delivery date is captured when the supplier confirms the order, whether through email parsing, WhatsApp reply, EDI acknowledgment, or supplier portal update. The confirmed ETA becomes part of the living PO record.
Actual delivery date is captured at structured receiving — timestamped when goods are logged against the PO, not when an invoice arrives later.
Received quantity is captured at receiving with explicit variance flagging against the supplier-confirmed expected quantity.

When all three events generate structured records attached to the same purchase order, OTIF becomes a computed metric — available per supplier, per category, per time period — rather than a project requiring inbox archaeology. The supplier scorecard builds itself from the operating records the procurement loop is already producing.

The contrast with open-loop procurement is direct: building a supplier OTIF scorecard manually from email threads and spreadsheets typically takes hours of reconstruction per month. In a closed-loop system, it is a standing report derived from the receiving and confirmation history already in the system.

Start a 90-day free trial at linenow.co.

Fill Rate vs Service Level: The Metric Most Operators Confuse — fill rate is the quantity-only component of OTIF: the proportion of ordered units actually delivered
Lead Time: Definition, Formula, and How to Measure It Accurately — lead-time accuracy is the timing component of OTIF, measured empirically from receiving records
Supplier Scorecard for SMBs: Four Metrics That Actually Capture Supplier Reliability — how fill rate, lead-time accuracy, PPV, and substitution rate combine with OTIF into an operational view of supplier reliability
Purchase Price Variance (PPV): Formula, Causes, and Why Procurement Decides It — the price dimension of supplier performance that OTIF does not capture
Safety Stock: How to Size It Statistically — how low supplier OTIF increases the safety stock required to maintain a given service level
ABC Inventory Analysis: Classify SKUs, Set Policy by Tier — how ABC tier determines how much OTIF risk is acceptable for a given item