Skip to main content
- Manufacturing

Predictive Maintenance: What Vendors Won't Tell You

Vendor PdM demonstrations make it look straightforward. The labelled training data problem, the CMMS integration gap, and the asset selection mistake are what actually determine success or failure.

Amit Kumar Singh - Technology Consulting Partner at MyData Insights

Technology Consulting Partner · MyData Insights

28 Jan 2026 · 7 min read

The bottom line

Predictive maintenance vendors will show you a demo with clean data, pre-labelled failure events, and a well-integrated CMMS. Your plant has none of that. The three problems that determine whether a PdM deployment succeeds - labelled training data scarcity, CMMS integration gaps, and poor asset selection - are almost never discussed in vendor conversations. Solving them is more valuable than any algorithm upgrade. Before you buy a PdM platform, fix the data layer underneath the one you already have.

The Labelled Training Data Problem

Supervised machine learning models - the type used in most predictive maintenance applications - require historical examples of the event they are trying to predict. For bearing failure prediction, the model needs sensor readings (vibration, temperature, current) from hundreds of historical bearing failures, labelled with the failure type and timing. Without this labelled dataset, the model cannot learn to distinguish pre-failure signatures from normal operational variation.

Most manufacturers do not have this data in structured form. They have maintenance work orders in a CMMS, written in free text by technicians who did not know they were creating training data. Converting those work orders into structured failure labels - with asset ID, failure mode, failure date, and the sensor readings that preceded it - is a data engineering and data quality task that typically takes 3–6 months before model training can begin.

Vendors demonstrate PdM on assets with well-documented failure histories because the demo is designed to show the model working, not to show the implementation effort. The gap between a successful demonstration and a production deployment is the labelled data problem - and it is the most common reason PdM pilots fail to scale.

Vendors demonstrate PdM on the assets where the training data already exists. The first question to ask any PdM vendor is: "Where did your training data come from, and how long did it take to prepare?"

The CMMS Integration Gap

Predicting a failure is only useful if a work order gets created, the right parts are pre-positioned, and the maintenance technician receives the alert in a system they already use. Without CMMS integration, a PdM alert is a number on a screen that requires a human to notice it, interpret it, decide it is credible, and manually create a work order. In a busy plant, that human step is where most PdM value is lost.

A PdM prediction that triggers an automatic work order in the CMMS - with the right failure code, asset history, and recommended parts - closes the loop from insight to action. That integration is not straightforward. Most CMMS systems have different data models, different APIs, and different criticality classification schemes. Building the integration is an engineering task that vendors routinely underscope in their implementation proposals.

The rule of thumb is: if the PdM implementation does not include CMMS integration in scope, the operational value is at risk. The prediction may be accurate. The response may still be manual, slow, and inconsistent.

Asset Selection: The Decision That Determines ROI

The most common asset selection mistake in PdM programmes is selecting assets based on sensor availability or equipment complexity rather than maintenance cost history and failure consequence. Vendors tend to pitch on the assets that are most instrumented - because those are the easiest to demonstrate. Operations teams are drawn to the most sophisticated equipment - because those seem most worthy of AI investment.

The correct starting point is a maintenance cost analysis. Which assets have generated the most unplanned downtime over the past 24 months? Which failures have caused the longest production stops? Which assets have the highest maintenance cost per unit of output? Those are your first targets - regardless of how many sensors they currently have.

Starting with high-cost, high-frequency failures means the ROI is measurable within 12 months. Starting with complex, low-frequency failures means you may spend 18 months building a model that has never seen enough failure events to be reliable. Asset selection is not a technical decision - it is a business prioritisation decision, and it should be made from maintenance cost data, not engineering instinct.

The vendor demonstration works because the vendor controlled the data. Your site isn't the vendor's demonstration environment. The labelled failure history, the clean sensor feeds, the integrated CMMS - none of it comes with the licence. Plan for the 12–18 months it takes to build them, and the programme you end up with will actually deliver.

Predictive MaintenanceAICMMSManufacturing

Your Data · Our Technology · Our Automation

Get practical insights every fortnight

Amit writes about Microsoft Fabric, Power BI, AI in operations, and digital transformation for manufacturing and supply chain leaders. Practitioner perspective - no fluff, no vendor spin.

No spam. Unsubscribe any time. Also on Substack.

Is this the challenge you're facing?

Book a 30-minute call. We'll look at your specific operation and tell you what's achievable - plainly and without slides.