AI is messy: here's how to clean up your data before it derails your strategy
Date:
Thu, 25 Jun 2026 06:53:05 +0000
Description:
Most Enterprise AI programs don't fail because of the model. They fail
because underlying data is fragmented, inconsistent, and poorly governed.
FULL STORY ======================================================================Copy link Facebook X Whatsapp Reddit Pinterest Flipboard Threads Email Share this article 0 Join the conversation Follow us Add us as a preferred source on Google Newsletter Subscribe to our newsletter Getting AI -ready while
building your data infrastructure is like learning to drive a manual transmission on the wrong side of the road.
Its complicated and requires potentially dangerous multitasking.
Organizations with immature data-handling processes that are adopting AI are trying to solve multiple technology problems at once, and risk stalling out. Latest Videos From Watch full video here: Matt Finlayson Social Links Navigation
CTO, ActivTrak. Unsurprisingly, 48% of enterprises cited data-related issues as their top challenge to AI adoption in NVIDIA 's 2026 State of AI report.
Most enterprise AI programs don't fail because of the model or solution selected. They fail because underlying data is fragmented, inconsistent and poorly governed. You may like Enterprises dont have an AI problem, they have
a data problem Why messy data will make your companys AI bill much higher
than expected AI isnt failing; your enterprise systems are Get Your Data Foundation in Order Enterprise data is messy in layers. Its scattered across many systems, making it hard to pull together into a coherent picture. Even when you can consolidate it, you often will run into granularity or
identifier mismatches. One application may store account numbers as plain digits, while another adds ACCT as a prefix. That small inconsistency creates an extra reconciliation step every time you join those data sets.
Data governance compounds the problem. Without a system intentionally
designed to control who accesses data , where it moves and what protections are in place, gaps emerge fast. PII exposure is the most obvious risk: an email address that ends up in the wrong hands can trigger a serious breach. Raw, unstructured data also yields mediocre AI outputs and is more expensive to process. Are you a pro? Subscribe to our newsletter Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed! Contact me with news and offers from other Future brands Receive email from us on behalf of our trusted partners
or sponsors By submitting your information you agree to the Terms &
Conditions and Privacy Policy and are aged 16 or over.
Clean, structured data yields better results at lower cost. A third gap, explainability, is quickly becoming a legal requirement. Many countries and several U.S. states now require organizations to demonstrate how AI-driven decisions were reached. Cut corners on the data foundation and you may not be able to show that chain of reasoning.
At that point, youre either in compliance violation territory or your model
is producing outputs you cant defend. Three Steps to Get Your Data AI-Ready Define governance before you deploy. Classify your data: what is it, where
did it come from and who can touch it. Separate the roles of technical decision-making and compliance oversight. Keeping those responsibilities with different people prevents a compromising situation where the same person sets the rules and monitors compliance. What to read next Why building AI applications still means building infrastructure-first Five signs your infrastructure is stalling your AI strategy Why enterprises need to rethink data in the AI era
Run cross-functional AI governance as a standing function. Assign a representative from every department and meet monthly to discuss what teams are working on, what concerns have surfaced and what support they need from one another.
Approach larger AI-readiness initiatives like any other business project: assign a project manager , designate an executive owner, set a weekly
cadence, build a task list and work through it.
Collect behavioral data even before you need it. The outcomes you get from AI vary enormously depending on how skilled the operator is, ranging from using it as an expensive search engine to developing autonomous workflows. Without visibility, you might be pouring money into AI licenses and getting Google -level output in return.
You dont know who needs training, whether they have the right tool in front
of them or what outcomes theyre achieving. The risk is that you make the
wrong strategic call as a resultabandoning a rollout, for example, when the real fix was better training or a different tool. Further considerations
Heres another layer to consider. When an experienced worker completes a task, with AI assistance, they leave more skilled than when they started. The
output and the learning happen together. That's what behavioral data should demonstrate over time not just task completion, but upward skill trajectories.
When someone at the beginning of the learning curve accepts whatever AI produces without critically engaging with it, you get the output but not the growth. Behavioral data is how you catch that gap early, before it becomes a long-term cost you can't unwind.
Stay curious and look for the easy wins. Focus your data readiness efforts on the workflows where work actually happens, and prioritize tools that let you get at that data.
A recent example illustrates the payoff. A product manager ran an AI-powered analysis of quarterly bug patterns using data from the departments most commonly used tools. The results were unexpected. One team carried a disproportionate share of incoming tickets, most of them requests for manual workarounds to a missing product feature.
While other teams split their time roughly 75% on new work and 25% on
incoming bugs, that team was closer to 50-50. By not building a single feature, the organization was effectively operating 1.5 people below
capacity.
The entire analysis took about 45 minutes. None of it would have been
possible without data that was organized, tagged by team, connected to individual contributors, accessible via existing AI connectors and protected by role-based access controls.
The organizations that get the most from AI are the ones that empower their people to ask "I wonder if there's something here" and have data to diagnose in an afternoon. That only happens when the foundation is already in place. Make your data safe with the best backup software . This article was produced as part of TechRadar Pro Perspectives , our channel to feature the best and brightest minds in the technology industry today.
The views expressed here are those of the author and are not necessarily those of TechRadarPro or Future plc. If you are interested in contributing find out more here:
https://www.techradar.com/pro/perspectives-how-to-submit
======================================================================
Link to news story:
https://www.techradar.com/pro/ai-is-messy-heres-how-to-clean-up-your-data-befo re-it-derails-your-strategy
--- Mystic BBS v1.12 A49 (Linux/64)
* Origin: tqwNet Technology News (1337:1/100)