Question 1

BN: Why do organizations spend time trying to eliminate data silos and why does it never work?

Accepted Answer

SS: Organizations spend time trying to eliminate data silos because they see them as barriers to efficiency, collaboration, and data-driven decision-making. The common belief is that silos create redundancy, inefficiencies, and inconsistency in data, making it harder to get a unified view of business operations. However, the attempt to unify 100 percent of all data silos can often turn out to be too ambitious with diminishing returns. Yet companies try to do that, and the underlying reason is in basic human psychology. As humans, we like to keep things neatly organized, like books in a library. It gives a sense of control and simplifies future discovery and use. We naturally try the same with data, until we realize that the scale, complexity, and fragmentation of data make 'one silo' an impossible goal. Ultimately the reason it never works is that new data silos are getting created faster than we can centralize them. By the time we get the bulk of the data in a single pattern the underlying technology changes, case in point Hadoop, Snowflake, Databricks, and now Iceberg.

Question 2

BN: How can you determine which silos are essential?

Accepted Answer

SS: Determining which data silos are essential involves evaluating their purpose, usability, and necessity within an organization. But one key point, not to be missed is to obey Data Gravity. Data Gravity means that naturally data will cluster by its type, function, or domain. It would be wise to understand that versus pushing against the grain. A good analogy is teams in a company. You might ideally want all employees to be in your headquarters in New York, for the sake of better communication and efficiency. But now you start a new division and realize that most good hiring for that division is happening in a particular location, say North Carolina. It might now make sense for you to then consider creating a satellite office (silo) in that location versus requiring everyone to move to your HQ in New York. The key points for identifying essential silos include:

Question 3

BN: Which types of data should be prioritized for decentralization?

Accepted Answer

SS: The types of data that should be prioritized for decentralization are those that benefit from being accessible, agile, and adaptable to different teams and use cases. This is data that is relevant to a large number of diverse users and applications in your enterprise. The key categories include:

Question 4

BN: What’s the best way to ensure seamless access to essential data?

Accepted Answer

SS: Instead of force-fitting all data into a central system, leverage Data Products to simplify discoverability and accessibility. Forrester Research reports that organizations implementing virtual data products achieve 47 percent faster time-to-insight and 35 percent reduction in data integration costs compared to traditional centralization approaches. Virtual Data Products support two approaches:

Question 5

BN: Where does AI fit into this approach?

Accepted Answer

SS: We are in an era of AI where powerful, general-purpose models are now available to everyone, everywhere. Gone are the days when every AI idea could take months or years from model design to training, to production. This means the true challenge for enabling AI is now connecting it to the right data. Breaking down data silos and making data accessible to AI is key. Silos can make that challenging, but before we go down breaking all silos it is very important to remember that data has to be tightly governed before it feeds into AI. Now breaking silos doesn’t mean putting all data from across silos into one silo. Instead, what it means is making data behind a silo seamlessly accessible. Ultimately the data user or application shouldn’t see silos as a source of friction. Image credit: Khakimullin/depositphotos.com

Data silos -- why they’re flawed and what to do about it [Q&A]

Recent Headlines

Maingear now lets buyers bring their own RAM to avoid DDR5 price spikes

Lemon Slice 2 turns any single image into a real time, talking AI avatar

Wondershare brings new AI Mate editing assistant to Filmora V15

AI video tools and how they’re changing business communication [Q&A]

AI risks, greater regulation and remote consultations -- healthtech predictions for 2026

Nissan confirms customer data was involved in Red Hat security breach

US slaps a ban on foreign-made drones and components

Most Commented Stories

MiniTool adds a duplicate cleaner and refreshed interface to Partition Wizard 13.5

The switch from Google Assistant to Gemini will be slower than expected

Jumping on the bandwagon, ‘Your Year with ChatGPT’ is now available

Foxit PDF editor gains new collaboration safeguards and AI features

Wondershare adds Topaz Labs' AI video tools to UniConverter 17

US slaps a ban on foreign-made drones and components

Microsoft releases emergency patch for Windows 10 to fix Message Queuing problems

Maingear now lets buyers bring their own RAM to avoid DDR5 price spikes

Why Trust Us

NEWS

UNITED KINGDOM

CANADA