The Way forward for Knowledge Catalogs Lies in Lively Metadata – Atlan
7 mins read

The Way forward for Knowledge Catalogs Lies in Lively Metadata – Atlan


Final Friday, Knowledge Twitter was buzzing with Josh Wills’ tweet about metadata and enterprise intelligence.

At Atlan, we began as an information group, and we failed 3 times at implementing an information catalog. We discovered that the most important motive knowledge catalogs fail is the consumer expertise. This isn’t nearly a good looking consumer interface although. It’s about actually understanding how individuals work and giving them the very best expertise.

Individuals like Josh need context the place they’re, once they want it.

For instance, whenever you’re in a BI device like Looker, you inevitably suppose, “Do I belief this dashboard?” or “What does this metric imply?” And the very last thing anybody desires to do is open up one other device (aka the standard knowledge catalog), seek for the dashboard, and flick thru metadata to reply that query.

Think about a world the place knowledge catalogs don’t reside in their very own “third web site”. As an alternative, a consumer can get all of the context the place they want it — both within the BI device of their alternative or no matter device they’re already in, whether or not that’s Slack, Jira, the question editor, or the information warehouse.

Active metadata - Atlan + Looker
Lively metadata in Looker

I consider that is the way forward for knowledge catalogs — activating metadata and bringing metadata again into the day by day workflows of information groups.

In Josh’s phrases, ‘It’s like reverse ETL however for metadata’.

Why don’t knowledge catalogs work like this in the present day?

Historically, knowledge catalogs had been constructed to be passive. They introduced metadata from a bunch of various instruments into one other device referred to as the “knowledge catalog” or the “knowledge governance device”.

The issue with this strategy — it tries to unravel a “too many silos” drawback by including another siloed device. That doesn’t remedy the issue that customers like Josh face every single day. Ultimately, consumer adoption suffers!

A senior knowledge chief at a big firm referred to as these knowledge catalogs “costly shelfware”, or software program that sits on the shelf and by no means will get used.

Active metadata vs passive metadata (the old way of data cataloging)

How can we save knowledge catalogs from turning into shelfware?

Take into consideration the fashionable instruments we use and love in the present day — GitHub, Figma, Slack, Notion, Superhuman, and so forth.

One widespread factor throughout all these instruments is the idea of movement. Within the phrases of Rahul Vora (Founding father of Superhuman):

Move is a magical feeling.

Time melts away. Your fingers dance throughout the keyboard. You’re pushed by boundless vitality and a wellspring of creativity — you’re utterly absorbed by your job.

Move turns work into play.

Rahul Vora, Superhuman

At Atlan, as we’ve labored with almost 100 knowledge groups, we’ve realized that movement is the key to magical knowledge experiences.

These nice consumer experiences aren’t in regards to the macro-flows. They’re about micro-flows, like not having to change to a separate knowledge catalog to get context for the dashboards in your BI device. There are dozens of micro-flows like this that may energy magical experiences and utterly change the best way that knowledge customers really feel about their work. And therein lies the promise of an lively metadata platform.

Reaching movement via lively metadata

The one actuality in knowledge groups is range — a range of individuals, instruments, and know-how. Variety that results in chaos and sub-optimal experiences for everybody concerned.

The important thing to wrangling this range and attaining movement lies in metadata. It’s the widespread thread throughout all of our instruments that offers the context we’re desperately missing each time we bounce between instruments to determine what’s occurring with an information mission.

As an alternative of simply accumulating metadata from the remainder of the stack and bringing it again right into a passive knowledge catalog, lively metadata platforms make a two-way motion of metadata doable, sending enriched metadata again into each device within the knowledge stack.

Image this:

  • Whenever you’re looking via the lineage of an information asset and discover a problem, you’ll be able to create a Jira ticket proper then and there.
  • Whenever you ask a query a couple of knowledge asset in Slack, a bot brings context about that asset on to you into Slack.
  • If you find yourself pushing to manufacturing in GitHub, a bot runs via the lineage and dependencies and offers you a “inexperienced” standing that you just’re not going to interrupt something — proper in GitHub.
Activating experiences with active metadata

Going past the information catalog

The “knowledge catalog” is only a single use case of metadata — serving to customers perceive their knowledge property. However that hardly scratches the floor of what metadata can do.

Activating metadata holds the important thing to dozens of use circumstances like observability, price administration, remediation, high quality, safety, programmatic governance, auto-tuned pipelines, and extra.

A real lively metadata platform could make the clever knowledge platform dream a actuality.

Right here’s an instance of the way it might work:

  1. An lively metadata platform can use previous utilization metadata from BI instruments to grasp which dashboards are used probably the most and when individuals use them.
  2. Finish-to-end lineage connects these dashboards to the tables that energy them within the knowledge warehouse.
  3. Operational metadata exhibits linked compute workloads, related knowledge pipelines, and run occasions.

Couldn’t we use all of this data to auto-tune our pipelines and compute, optimizing for an excellent consumer expertise (up to date knowledge within the dashboard when individuals want it, and greatest efficiency on the time of max utilization) whereas minimizing prices?

Active metadata platform

Past that, the use circumstances of lively metadata are limitless. It has the potential to convey intelligence and movement to each a part of the information stack and actually act because the gateway to the information stack of our desires — a very clever knowledge system.

  • Mechanically deduce the house owners and consultants for knowledge tables or dashboards based mostly on SQL question logs
  • Mechanically cease downstream pipelines when an information high quality problem is detected, and use previous information to foretell what went incorrect and repair it with out human intervention
  • Mechanically purge low-quality or outdated knowledge merchandise
  • and way more

It could sound loopy. However in a world with self-driving vehicles, good homes, and rovers that navigate themselves throughout Mars, why can’t we think about a better knowledge expertise powered by our wealth of metadata?


Wish to study extra about third-generation knowledge catalogs and the rise of lively metadata? Try our e-book!



Leave a Reply

Your email address will not be published. Required fields are marked *