Data governance and (hence) Metadata Management

Data is a strategic asset for businesses today and it is growing exponentially. Regulations like GDPR, BCBS 239, Basel 3 are making it extremely important for organisations to have full control over their data. Data governance is about availability, usability, integrity and security of data.

Is this something new we are learning here? Actually, No! Data governance has been in place since years. Every system has some or other form of data governance in place. Documents and excel sheets are traditional form of data governance. Could be access control mechanism implemented at system level. Operational fields in a databases like created by/ updated by, are also part of this governance. Audit logs provides useful information for governance purposes. 

But is this sufficient? We are implementing new systems, new functionalities every day. Is the organisation working in silos where individual departments might have different standards? Is managing metadata centrally in excel sheets for huge enterprises a possibility? Do we have an enterprise wide agreed framework? Is the process repeatable? Will we be able to demonstrate our organisation’s data compliance when needed? 
These questions makes us think about data governance more seriously.

The problem with DG programmes is they sound too academic. What’s the ROI they bring in?! Well, the ROI is compliance. And process efficiencies. And it is going help businesses in short term and even more in long term. 
Involvement of all people, all departments is also critical for the success of a DG programme. Hence, educating people, involving them from the start is important. Breaking the silos is the key for a DG programme. 
Starting small with a specific objective is a good way to start. Though we start small, we need to keep an eye on the long term goal as well. 

There are multiple business and technology focus areas for DG. Like, process standardisation from business perspective.
Here we will talk about Metadata Management and Data Quality. I see these two as pillars of DG implementation. 
In wide enterprises, it is going to be extremely difficult to know what data you have and how to access it. There are many metadata management tools available in the market. Some are even open sourced. These tools not only capture the metadata automatically, but they also provide lineage, relationships, description, usage and ownership. Some tools even have built in machine learning which helps to categorise the data automatically. The tools are intuitive, user friendly, with Google like search functionality, which allows users to search by technical metadata or business term or even tag name of the field. Imagine, when we are designing new APIs, we won’t have to struggle in understanding what data we have and where to find it!
Built in business glossaries integrate the business and IT and allows them to work together.

Data quality and maintaining the customer consents globally for marketing and other purposes  is also a part of DG. A good DQ design will help in data completeness, accuracy, integrity, identifying and merging duplicates and standardising the data.


Implementing (and executing!) data governance practices will help businesses in becoming data driven. Organisations can become more compliant and be able to demonstrate the process, data security and access control.

Comments

Popular posts from this blog

Understanding Wide-Column Stores

Two pizza team!

Relational and Non-relational databases