Successful businesses today are driven by their use of data. How they organize, access and leverage the vast amounts of data within their databases is what gives them their competitive advantage.
And these businesses spend significant amounts of money to manage, protect, store and access their structured data sets, particularly following the recent surge of complex data regulations, such as the GDPR and the California Consumer Privacy Act of 2018.
However, structured data is only one part of the story. That’s because structured data is just a small part of what a typical large-scale organization actually holds. The problem is, they have no visibility over the other chunk of data - which is called ‘unstructured data’. This unstructured data consists of business files such as Word, Excel and .pdf.
The data explosion has left this unstructured data scattered all over the place - nobody knows what it is, who uses it, where it is stored and who else has access to it. What’s worse is that the amount of unstructured data within an organization is typically around 80% of their total data assets.
The impossible challenge
Unstructured data previously presented an unparalleled challenge given its vast abundance and lack of categorization. For an organization with 100 million files to categorize (which isn’t a lot for large businesses), it would take them 400 years to organize them if they took just 5 minutes looking at each document manually.
It’s not really a surprise, therefore, that many are simply ignoring this process, or have no idea where to start addressing it. What was needed is a way to automate the sifting through the mass amounts of unstructured data.
Why? So organizations can find the hidden value of each record they own. Think of it as finding diamonds in the dirt. Without automating this process you have zero visibility of unstructured data, which additionally presents a compliance and security risk.
Finding the value of that file
Previously, enterprise organizations could not search their data sets as they knew not what they had. These organizations had no idea if their unstructured data was full of valuable golden nuggets, or piles of garbage that pose a risk.
The basic tools available would be either so slow the task would never complete or so inaccurate companies could not tell their valuable data from the trash.
That’s why we spent years developing a focussed Artificial Intelligence engine, designed to quickly and accurately identify the majority of files in the unstructured data sets of the world’s largest companies, and categorize the data by its hidden value on a per record basis.
Our aim is to provide those companies with visibility for the first time, and they can now automatically identify and categorize each file by its value – quickly and accurately.
They can now focus their efforts based on value and risk, bringing organization to the unstructured chaos.
Using AI to do data management differently
Imagine if you had the power to itemize and manage your data by its monetary value. If you could, would you do things differently? Would you focus your efforts on your crown jewels or your worthless trash? Would you plug your data risks before they are exposed? Would you limit access to sensitive files with a click of a button? Would you quickly clean up the abundance on non-compliant CVs you have residing within your network?
This year we've been working with the Ponemon Institute, researching global enterprise organizations. We’ve asked their business content owners, stewards and creators, along with DBA’s, Heads of IT, CSIO’s, CDO’s, CIO’s, business owners, etc what value they place on the data and information held by their organizations. We think you’ll be surprised at the differing values placed on sets of data within global organizations by your peers and those responsible for the information.
By joining lead researcher Larry Ponemon of The Ponemon Institute, Doug Laney, author of the book 'Infonomics', and DocAuthority in November you'll be able to access this unique research paper, analysis and start to gain an insight into the true value of the unstructured data your organization is sitting on.
by Alan Weintraub January 2019
by Mike Quinn December 2018