The bottom line
In food import and distribution, being first to a major trend — by 18 to 24 months — is often the difference between owning a category and trading a commodity. The problem is structural: internal sales data is retrospective, so by the time a trend shows in your order book it has already shown in a competitor’s. The intelligence that drives forward portfolio decisions lives outside the ERP — in DGFT customs records, global food exhibitions, hotel expansion plans, and restaurant menus on Zomato and Swiggy. None of it is in your system. All of it is knowable. The companies pulling ahead are building a system to read it continuously: a unified lakehouse on Microsoft Fabric, an Azure OpenAI recommendation engine grounded in current data, and a monthly Market Intelligence Brief that ranks SKUs against market opportunity and flags the next Basa before it is on anyone’s radar.
In This Article
The Fish That Built a Category
There is a fish most Indians had never heard of in 2008. It had a nondescript name, a mild flavour, and a texture that worked across almost every cuisine. It was cheap to import, easy to portion, and held up well in cold chain. Within ten years it was on the menu of nearly every five-star hotel in the country, served in thousands of restaurants, and had become the default white fish for institutional catering across premium hospitality.
The importers who got there first built a category. The ones who followed competed on price.
That is the entire competitive dynamic of food import and distribution, compressed into a single case study. Being first to a major trend — by 18 months, 24 months, sometimes more — is not just an advantage. In a business where differentiation is difficult and margins are thin, it is often the difference between category leadership and commodity trading.
The question every food importer should be sitting with right now is not “how do we grow this year?” It is: what is the next version of that opportunity — and do we have a system to find it before our competitors do?
Most companies don’t have a system. They have instinct, relationships, and the one exhibition trip a year that someone on the commercial team manages to attend. That was sufficient when the world moved more slowly. It is not sufficient now.
Being first to a major trend — by 18 to 24 months — is not just an advantage. Where differentiation is hard and margins are thin, it is the difference between category leadership and commodity trading.
A Forward Business Run on Backward Data
Every food import and distribution business runs on data. Sales figures, inventory logs, order histories, margin by SKU, performance by city — the operational data exists, and in most mature businesses it is reasonably well organised.
The problem is not the data. The problem is what it shows.
Internal sales data is inherently retrospective. It tells you what sold, to whom, at what margin, in which geography, during which quarter. It is excellent for managing what you already have. It is structurally incapable of telling you what you should be importing two years from now.
And this is precisely where the competitive game is played. The window to establish a new SKU, build the supply chain, educate your hospitality customers, and secure the pricing advantage of being the only supplier — that window is measured in months. By the time a trend appears in your order data, it has already appeared in someone else’s vision.
The intelligence that actually drives forward-looking portfolio decisions lives entirely outside your four walls:
→ What are executive chefs in Dubai, Singapore and London putting on their menus right now — and what does that signal about Indian fine dining 24 months from now?
→ Which categories saw meaningful growth at the last Gulfood, SIAL or Annapoorna exhibition?
→ What food imports are your competitors quietly increasing, traceable by HS code in publicly available customs data?
→ Which ingredients are gaining momentum on Zomato and Swiggy restaurant menus across the Indian metros?
→ What new hotel properties and restaurant groups are under construction — and what proteins and specialty ingredients will they require when they open?
None of this is in your ERP. All of it is knowable. The companies building an advantage right now are the ones building a system to read it — continuously, not once a year.
By the time a trend appears in your order data, it has already appeared in someone else’s vision. The order book is a record of decisions already made — yours and theirs.
The Five Signal Categories
Before discussing architecture, it is worth being specific about where the intelligence actually comes from. In our experience building these platforms for food import and distribution businesses, the signal landscape resolves into five distinct categories — each with different lead times and different reliability profiles.
1. India Customs and DGFT import data. The Directorate General of Foreign Trade publishes import data at the HS-code level. It is neither well known nor well used in the industry, but it is one of the most powerful competitive intelligence sources available. You can see — with a lag of a few months — exactly what your competitors are importing, at what volumes, from which origin countries. A sudden increase in imports of a specific seafood category, a new dairy ingredient or a plant-based protein is a signal worth acting on immediately. This is not inference or trend-spotting. It is direct evidence of what is entering the market.
2. Global food exhibition and publication intelligence. Food exhibitions in the UAE, Europe and Southeast Asia tend to lead the Indian premium market by 18 to 36 months. The ingredients and formats attracting attention at Gulfood today will reach Indian fine dining within two to three years. Scanning exhibition catalogues, chef publication databases and specialty food media systematically — rather than relying on one team member’s notes from one annual trip — is how you compress that lead time into a structured signal.
3. India hospitality expansion data. Hotels and premium restaurant groups publish their expansion plans. New properties are announced, construction permits are filed, catering contracts are awarded. Each is a forward-looking demand signal. A five-star hotel opening in Pune in 18 months will need proteins, specialty cheeses, specialty seafood and premium ingredients — and it will want an established supply relationship before it opens, not after. Tracking this systematically tells you where demand is being built before it becomes visible as orders.
4. Digital restaurant and menu intelligence. Zomato and Swiggy carry the menu descriptions of tens of thousands of restaurants across India. Analysing these at scale — tracking which ingredients are appearing more frequently, in which cuisines, in which cities — provides a ground-level view of where chef and consumer taste is actually moving. This is not social-media trend-watching. It is structured intelligence derived from real purchasing behaviour.
5. Social and food media signals. Instagram food communities, food media and influencer coverage are lagging but high-volume signals. By the time a product is generating significant social content in India, it has usually already crossed the adoption threshold. These signals are useful for confirming trend velocity rather than identifying early opportunities — but they matter for separating early signals that are becoming durable trends from those that are fads.
DGFT import data is not inference or trend-spotting. It is direct evidence of what is entering the market — at the HS-code level, by volume, by origin country, with a lag of months, not years.
The Three-Layer Architecture
Building a market intelligence platform for food import and distribution means integrating these external signals with the internal performance data you already have, then applying AI to synthesise the two into ranked recommendations. We build this across three layers.
Layer 1 — the data foundation. The foundation is a unified data lakehouse on Microsoft Fabric, with a Bronze / Silver / Gold medallion architecture that ingests both internal and external sources into a single platform. Internal data flows from the ERP through Azure Data Factory pipelines — sales history, customer master, SKU catalogue, order data, margin by product. External data arrives through a combination of structured pipelines (for customs data and market databases with established APIs), Python scrapers (for exhibition content, restaurant menus and social signals), and Fivetran connectors where pre-built integrations exist.
In the Bronze layer, data arrives raw and unmodified. The Silver layer applies cleaning, standardisation and taxonomy normalisation — critically, building a unified SKU master that maps your internal product codes to standard HS codes, product categories and ingredient classifications. The Gold layer produces intelligence-ready datasets: SKU performance matrices, competitor import trend series, restaurant menu frequency indices, and hospitality expansion timelines. The transformation logic is built in dbt, with PySpark handling large-scale aggregations across the historical data. Nothing in this stack requires proprietary infrastructure beyond what Azure already provides.
Layer 2 — the AI intelligence engine. The AI layer sits on top of the Gold data. It is built on Azure OpenAI (GPT-4o) with a Retrieval-Augmented Generation architecture — meaning the model does not operate from training data alone, but retrieves specific, current information from the data platform before generating any output. Azure AI Search serves as the vector store for unstructured content — food articles, chef publications, exhibition catalogues, social content. When the system produces a recommendation, it pulls the supporting evidence directly from current data rather than relying on static model knowledge.
Three analytical components do the work. The SKU Scoring Model classifies every SKU across two dimensions: internal sales trajectory and external market opportunity. A SKU growing at 8% a year looks healthy in isolation; against a category growing at 25%, the same SKU is losing relative share — and that distinction drives a completely different portfolio decision. The India Trend Lag Model tracks adoption curves for food trends in lead markets (UAE, UK, Southeast Asia) and applies a historically calibrated lag to estimate when they will reach the Indian premium market — the core of the first-mover intelligence. The Competitor HS-Code Tracker continuously monitors import data to flag when competitors begin importing new categories, increase volumes, or shift origin countries in ways that signal supply-chain moves.
Layer 3 — intelligence output. The platform generates several output types, all automated and delivered to leadership without anyone digging for the data. The Monthly Market Intelligence Brief is the primary output: a structured document covering SKU performance versus market opportunity, ranked by category; competitor import activity over the prior 30 days; the top five to seven new SKU recommendations with evidence, sourcing-feasibility assessment, India demand signal and a confidence rating; and a watch list of categories showing early signals worth monitoring over the next two quarters.
Alongside the brief, a Power BI leadership dashboard gives real-time visibility into portfolio performance, competitor activity and trend signals — structured for commercial leadership, not an analytics team. Real-time alerts via Microsoft Teams flag significant competitor import movements as they appear. And a Copilot Studio chat interface lets any member of the commercial team ask natural-language questions — “which of our current seafood SKUs are most at risk from competitor substitution?”, “what proteins are trending in UAE hotel menus that we don’t currently carry?” — and receive answers grounded in the current data platform, not a model’s training knowledge.
The model does not run on training data alone. With Retrieval-Augmented Generation, every recommendation is grounded in current evidence pulled from the data platform — DGFT records, menus, exhibition catalogues — not in what the model happened to learn.
The “Next Basa” Logic
The practical question is how the system actually surfaces a recommendation like the original Basa opportunity — a product not yet on anyone’s radar, entering a market window that is open but closing. The process is systematic.
First, the trend lag model scans global lead markets for proteins, specialty ingredients and food formats gaining adoption in cuisines and restaurant formats that historically lead the Indian market. The scan is continuous, not periodic.
Second, it cross-references against India import data. Is anyone already bringing this in? At what volumes? Is the import window still open, or has it already closed? The timing of entry is as important as the product itself.
Third, it matches the opportunity against the existing customer base. Do the hotel groups and restaurant chains already in the distribution network have the menu format and customer profile to absorb this product? A recommendation is only actionable if the current commercial infrastructure can reach the right buyers.
Fourth, it assesses sourcing feasibility — whether an established supply chain exists in an accessible origin country, at volumes that make the business case work.
Finally, it generates a ranked recommendation with the supporting evidence, the confidence level and the key risk factors — in language a commercial team can act on, not a data-science report that needs interpreting.
A recommendation is only actionable if your current commercial infrastructure can reach the right buyers. The system scores the product, the timing, the customer fit and the sourcing — not just the trend.
What Changes When You Build This
The most important change is not operational. It is strategic.
Right now, most food import and distribution businesses make their forward portfolio decisions in a quarterly planning session, based on the commercial director’s read of the market, filtered through whatever the sales data shows and whoever attended the last industry event. It is a reasonable process given what is available. It is also a process that is structurally likely to be behind the curve.
With a systematic intelligence platform, the portfolio conversation changes. Instead of asking “what should we try next year?”, you are asking “the system is flagging three opportunities in the high-confidence range — which two do we have the capacity to pursue, and what is the sourcing lead time on the third?” That is a fundamentally different quality of decision.
The experienced commercial professional’s judgment does not become less relevant. It becomes better-resourced. The instinct that comes from two decades in the industry is still the most important input — but it is now working with current, synthesised, external market data instead of internal history and a single annual exhibition.
The experienced buyer’s judgment doesn’t become less relevant. It becomes better-resourced — working with current external market data instead of internal history and one trip a year.
How We Build It
We do not build this as a big-bang implementation. The risk in any data platform project is delivering infrastructure before demonstrating value — and the food import and distribution category has seen enough failed IT projects to be rightly sceptical of large upfront commitments. Our operating model is Discover, Prototype, Deploy, Expand.
Discover (weeks 1–2). We run a structured diagnostic of the current SKU portfolio, the data landscape and the external intelligence gaps. This includes mapping which external sources are accessible, understanding the ERP data structure, and establishing the first version of the SKU performance matrix from existing data.
Prototype (weeks 3–5). We build a working version of the platform — not a proof-of-concept slide deck, but an actual functioning system that ingests the first data sources and generates the first Market Intelligence Brief. At the end of this phase you have real output: a ranked view of your portfolio against market opportunity, competitor import signals from the prior 90 days, and the first new SKU recommendations with evidence.
Deploy (weeks 6–14). Full platform build — all data sources integrated, AI recommendation engine live, Power BI dashboards deployed, Copilot Studio interface configured.
Expand. The platform extends to additional geographies, additional data sources and additional product categories as the commercial use cases are validated.
The diagnostic and prototype phase is delivered at a fixed fee. The output at the end of five to six weeks is a working intelligence system and the first brief — not a proposal for what we might build. If it delivers value, Phase 2 follows. If it does not, you have spent a defined budget understanding your portfolio and your competitive landscape in more depth than you had before. There is no bad outcome from that.
You get a working intelligence system and the first brief at the end of five to six weeks — not a proposal for what we might build. Fixed fee. You are not funding a learning exercise.
The Window Is Open — For Now
The food import and distribution businesses that will define the next decade of premium distribution in India are building their intelligence infrastructure now — while most competitors still rely on last quarter’s sales data and last year’s exhibition notes.
This is not a technology story. It is a competitive strategy story. The technology happens to make something possible that was not practically achievable five years ago: a continuous, systematic, AI-powered market intelligence function, built on cloud infrastructure, running at a cost accessible to a mid-market food business.
The data to see what is coming already exists — in DGFT records, exhibition catalogues, restaurant menus, hotel expansion announcements. It has always been there. The only question is whether you have a system to read it before the company that does walks into your largest accounts with the next Basa. That is a 30-minute conversation, not an eighteen-month programme — no slides, no pitch deck, no obligation to proceed.
Free Assessment
Where does your operation sit on the data maturity curve?
8 questions. 3 minutes. You get a scored breakdown across data infrastructure, analytics readiness, and automation potential — with a specific next step for your industry.