With today's powerful data discovery tools, companies have more options than ever before in regards to the complexity, amount, and origin of the data they analyze. Indeed, it seems that many companies just can't get enough.
There are two options companies have when searching for more data to analyze:
The data that already exists in your company but is not currently being tapped for greater insight and analysis is known as dark data.Many are excited by the idea of big data. It’s exciting! It’s sexy! But here’s why you should instead first focus on bolstering your current BI strategy with the dark data that exists within your company.
With sandbox analytics, a small group of BI users experiment with potentially useful data. If that data does, in fact, prove valuable, only then is it distributed for greater use throughout the organization. The ability to play with big data sets and analyze them on top of what’s already in the data warehouse encourages employees to think strategically without the need to pull in IT. This is bimodal BI.
Let’s go back to the example of that manufacturing company that doesn’t have insight into their quality issue. There are a number of possible reasons for a sudden dip in quality product reaching the customer. Perhaps it’s a worker satisfaction issue that is causing an increase in mistakes. Could it be a shift time issue? Would this have to do with the hours of the shift itself or the manager of that shift? Can it be narrowed down by employee? What about the supply chain?
With these hunches in mind, it’s time to dig into existing data to see what, if anything, supports these hypotheses. Let’s go with the inkling that a drop in quality might have to do with an issue regarding shifts. The current dashboard that displays employee shift data only includes the hours per week that each employee has worked, and doesn’t tell you which shift those hours correspond with. The company’s HR system tracks this, but that type of data isn’t currently set up in an existing data model for analyses. Not to worry.
A comprehensive data discovery tool will allow users to pull data that isn't already available for analyses and lets users mash it up with the data already being used in current reports and analyses. By mashing up the time each employee on this particular production line clocks in and out each day along with the data already used to analyze shift data across the company shows that these employees are alternating between day and evening shifts much more frequently than employees on other teams.
This type of shift switching seems to not only affect productivity, but likely also causes general fatigue and overall employee dissatisfaction, likely the root cause to this dip in overall effectiveness. There is an evident negative correlation between shift times, employee satisfaction, and production quality. This previously dark data can be moved into an existing data model now or at a later time if so desired.
Having identified the problem, the company can now fix it and, of course, monitor the progress with BI. From here, a scorecard is created to monitor hours, shifts, and product quality across all teams. Now that a root cause is in focus, new best practices can be applied across all production teams to improve even already high performing teams.
Uncovering a little bit of dark data in a relatively short period of time – days, as opposed to weeks or months that it would realistically take for a business analyst to create a potentially valuable analysis with new, external data – adds tremendous value to the BI project as a whole.
Shining light on the right dark data can elucidate more than you might originally think. So next time you need to go digging for more data, first consider flipping on your flashlight.