Skip to main content
Supply Chain & FMCG

What Every FMCG Supply Chain Needs Before AI

FMCG companies spend millions on AI-powered demand planning tools and still run 8% stockout rates. The AI is not the problem. The data underneath it is.

Amit Kumar Singh - Technology Consulting Partner at MyData Insights

Technology Consulting Partner · MyData Insights

14+ years in industrial data · Former Accenture & EY · GCC, India, SEA

1 Apr 2026 · 11 min read

The bottom line

FMCG supply chain teams that rush to AI without sorting their data foundation first will spend more and stockout just as often. The prerequisite is a unified, real-time data layer connecting ERP, WMS, distributor sell-through, and POS. Without it, even the best demand model is working from a corrupted input. Get the data right, then layer the AI on top - not the other way around.

The AI Demand Planning Trap

An FMCG company with 400 SKUs and distribution across 12 markets invests $800,000 in an AI demand planning platform. Eighteen months later, their stockout rate has improved from 9% to 8.5%. The platform vendor declares this a success. The supply chain director disagrees.

This story is common. The platform is not the problem - the data it was trained on is. AI demand planning models are only as good as the signals they receive. When those signals come from siloed, inconsistently formatted, partially reconciled systems, the model amplifies the noise rather than reducing it.

The industry has developed a dangerous assumption: that AI is a remedy for poor data quality. It is not. It is an amplifier. Give it clean, connected, consistent signals and it will materially improve forecast accuracy. Give it fragmented, delayed, and contradictory signals and it will confidently produce wrong forecasts - faster than any spreadsheet could.

AI is a signal amplifier, not a data quality remedy. Clean connected data with a simple model will outperform dirty fragmented data with an expensive AI platform - every time.

Why the Data Underneath Is the Real Problem

A typical mid-size FMCG company runs 7–12 operational systems relevant to supply chain: an ERP for procurement and financials, a DMS (Distributor Management System) for secondary sales, a WMS for warehouse management, a TMS for logistics, a trade promotion management system, and various Excel files bridging the gaps between them.

Each of these systems has its own SKU master, its own customer hierarchy, and its own definition of "stock on hand." When the AI demand planning platform ingests these systems, it sees multiple versions of reality that have never been reconciled. The model has to guess which version is correct - and it cannot.

The result is demand signals that are systematically biased by whichever data source dominates the training set. If the ERP data dominates, the model ignores distributor sell-out signals. If DMS data dominates, the model misses the promotional uplifts that only appear in the trade promotion system. The model is doing its job. The data architecture is failing it.

The Disconnected Systems Problem

The core issue is master data fragmentation. In most FMCG organisations, the same SKU exists under different codes in the ERP, DMS, and WMS. The same customer is recorded differently across sales, logistics, and finance. Without a unified master data layer, any cross-system analysis requires manual reconciliation - which is slow, error-prone, and impossible to automate.

Beyond master data, the timing problem compounds the quality problem. ERP data updates overnight. DMS data may be a day or two behind distributor reconciliation cycles. WMS data is often near real-time but disconnected from the financial system. When you train a demand model on data that arrives at different frequencies and different levels of completeness, the model learns the reconciliation lag, not the demand signal.

One client in the GCC was running replenishment off demand signals that had a 72-hour lag built into them - not because the systems were slow, but because the reconciliation process between DMS and ERP required human intervention three times per week. The AI model had learned to predict the reconciliation lag, not customer demand.

When replenishment decisions carry a 72-hour data lag embedded in the reconciliation process, no demand model - AI or otherwise - can compensate. The architecture must be fixed before the model is deployed.

What Data Foundation Actually Means

A data foundation for FMCG supply chain has three layers. The first is a unified master data management (MDM) layer that establishes a single, governed definition of SKU, customer, location, and supplier - and enforces it across all consuming systems. Without this, every cross-system analysis is a reconciliation exercise in disguise.

The second layer is a unified ingestion layer that pulls from ERP, DMS, WMS, TMS, and promotional systems into a single governed data platform - typically a lakehouse architecture - where data arrives with consistent schemas, timestamps, and quality markers. This is where change data capture (CDC) replaces batch exports, and data quality rules are enforced at ingestion rather than at reporting.

The third layer is a demand signal layer that aggregates sell-out data (from DMS or POS), inventory positions (from WMS), promotional calendars, and external signals (weather, seasonality, competitor activity) into a single analytical model. This is the layer where AI adds genuine value - but only when layers one and two are reliable.

The Right Order of Operations

The right sequence is: master data first, integration layer second, analytics and AI third. Most organisations try to reverse this - buying the AI platform first because it is visible, exciting, and easy to demonstrate to a board - and then discovering that the data foundation work was the hard part all along.

Master data rationalisation takes three to six months for a mid-size FMCG business. Integration layer build-out, depending on complexity and systems involved, takes four to eight months. AI model development, once the foundation is in place, is often the fastest phase - three to four months to a production-ready model with a meaningful reduction in forecast error.

The total timeline is 10–18 months to a genuinely functioning AI demand planning capability. The organisations that try to shortcut to the AI without the foundation spend the same time - or more - in failed implementations, re-implementations, and manual reconciliation loops that never quite close.

What the Foundation Runs On

In a Microsoft estate the three layers have a concrete shape. The unified ingestion layer is data integration through Azure Data Factory, pulling ERP, DMS, WMS, TMS, and the trade promotion system into OneLake on a Microsoft Fabric lakehouse — change data capture where the source supports it, scheduled extracts where it does not, with quality rules applied at landing rather than at the report. Years of sell-out and inventory history sit in one governed Delta Lake table instead of a dozen exports.

Master data management sits above that: a governed SKU, customer, location, and supplier definition mapped once and enforced across every consuming system, so a cross-system query is no longer a reconciliation exercise. The demand signal layer then joins sell-out, inventory positions, promotional calendars, and external factors into a single Power BI Direct Lake semantic model — one definition of fill rate, days of inventory, and forecast accuracy that planning, sales, and finance all read from.

Only on top of that does demand forecasting earn its place. The same OneLake foundation also feeds inventory analytics and a supply chain control tower without a second integration project — you connect the systems once and reuse. That reuse is what turns a foundation build from a cost into the platform every later analytics request runs on.

Where This Still Breaks

The hardest part is not technical — it is master data ownership. Sales, logistics, and finance each believe their SKU and customer hierarchy is the correct one, and none will cede the definition without a mandate. Until someone with authority — often a Fractional CDO — forces a single governed master, the integration layer just digitises three versions of the truth faster. The MDM phase fails on organisational politics far more often than on tooling.

The second limit is source-system latency you do not control. If distributor reconciliation genuinely happens three times a week, no pipeline makes that sell-out data real-time — the honest move is to model the known lag explicitly, not pretend it away. And external signals (weather, competitor activity) add forecast value only where you have clean internal demand history to anchor them; bolted onto fragmented data they add noise, not accuracy.

The caveat worth stating plainly: this is a 10–18 month build, and the AI is the last and fastest phase. A board that wants the AI demo live next quarter is the single biggest risk to the programme — because it pressures teams to skip the foundation that makes the AI work at all.

The master data layer fails on ownership politics, not technology. Until one team owns the governed SKU and customer master, integration just replicates the disagreement at speed.

What This Means for the Supply Chain Leader

The shift is from buying a model to fixing the inputs the model you may already own is starving on. A 1–2 point stockout reduction on a 400-SKU range is real margin and real service-level recovery — and it usually comes from connected, reconciled data feeding the existing planning tool, not from a second platform purchase. Before approving more demand-planning spend, the sharper question is whether the current tool has ever been given clean signals to work with.

It also starts smaller than the 10–18 month headline implies. A six-week Discover phase maps the 7–12 systems, profiles the master-data overlap, and quantifies the embedded reconciliation lag — first value is a costed foundation plan and one connected demand signal, not a full rollout. You prove the data problem is the real problem on one category before committing the full programme.

Unify the data, govern the masters, then predict — in that order. The FMCG businesses still running 8–9% stockouts with good AI tools are not short of a better algorithm. They are short of a single version of demand the algorithm can trust.

I've stopped being surprised when FMCG companies with genuinely good AI tools run 9% stockout rates. The AI is doing exactly what it was trained to do - on data that doesn't reflect reality. Fix the data foundation and the AI tool you already have will often deliver the results you were promised. Most of the time you don't need a better model. You need better data going into it.

Free Assessment

Where does your operation sit on the data maturity curve?

8 questions. 3 minutes. You get a scored breakdown across data infrastructure, analytics readiness, and automation potential — with a specific next step for your industry.

FMCGSupply ChainAIData Foundation

Your Data · Our Technology · Our Automation

Get practical insights every fortnight

Amit writes about Microsoft Fabric, Power BI, AI in operations, and digital transformation for manufacturing and supply chain leaders. Practitioner perspective - no fluff, no vendor spin.

No spam. Unsubscribe any time. Also on Substack.

Is this the challenge you're facing?

Book a 30-minute call. We'll look at your specific operation and tell you what's achievable - plainly and without slides.