Friday, March 25, 2022
HomeBig DataAtlan + Airflow: Higher Pipeline Monitoring and Information Lineage with Our Latest...

Atlan + Airflow: Higher Pipeline Monitoring and Information Lineage with Our Latest Integration – Atlan

One morning at 8 am, I woke as much as the Cupboard Minister of India calling me. He mentioned, “Prukalpa, the quantity on this dashboard doesn’t appear proper.”

Frantic, I opened up my laptop computer and loaded the dashboard to understand the quantity was clearly off. And but, at that second, there was nothing I might do to clarify it. I might really feel myself shedding the credibility and hard-earned belief that had taken months to construct.

I known as my Undertaking Supervisor, who was improbable at stakeholder administration however couldn’t perceive the nitty-gritties of knowledge. She known as our Information Analyst, who appeared on the dashboard and mentioned, “Looks as if one thing broke down within the pipeline”. Our Analyst then known as our solely Information Engineer, who pulled out logs from Apache Airflow. However he couldn’t troubleshoot it as a result of he didn’t know what the variables meant and didn’t have the information context.

It took us 8 hours and 4 folks to determine what went mistaken. We misplaced time that day.

However extra importantly, we misplaced belief. Belief with our buyer. Belief in our group.

Belief is usually not about issues breaking. In years of working with knowledge, I’ve discovered that knowledge will at all times be chaos. However when issues break and you discover out too late, or you’ll be able to’t clarify why one thing broke, that’s what breaks belief.

Think about if, at that second when the cupboard minister known as me, I might rapidly open a dashboard and say, “Sure, looks as if the pipeline didn’t run on time as we speak. We’ve obtained an alert and it has already been escalated to knowledge engineering.” And even higher, think about if the dashboard had an alert on it, signaling to the minister that one thing was mistaken and he shouldn’t use it.

As we speak we’re excited to announce that Atlan natively integrates with Apache Airflow. For knowledge groups all over the place, this implies extra transparency and belief, and fewer time spent debugging pipelines after a damaged dashboard or mismatched metrics.

Atlan + Airflow: Constructing an ecosystem of belief and transparency

With this integration, knowledge groups can construct higher knowledge engineering experiences centered round constructing information and belief of their knowledge.

First, Atlan’s integration with Airflow brings much-needed pipeline context to knowledge property.

Now you’ll be able to share any sort of metadata from Airflow pipelines to Atlan knowledge asset profiles, the place knowledge analysts, scientists, and enterprise customers have entry to it. This opens up pipeline context and makes it absolutely clear in order that knowledge groups and shoppers can at all times know the standing of the information pipeline related to every knowledge asset.

Listed here are some nice context fields that we’ve seen folks convey from Airflow to Atlan:

  • Freshness: When was my desk final up to date?
  • Run schedule: Did the pipeline run as anticipated?
  • Pipeline standing: Was the final pipeline run profitable?
Customized Airflow metadata on an Atlan asset profile

Atlan already connects to knowledge warehouses (e.g. Snowflake, Redshift) and BI instruments (e.g. Tableau and Looker). Bringing Airflow into this ecosystem additionally signifies that knowledge groups can now map relationships throughout all of their knowledge. Whether or not you’re loading in new knowledge, revising a pipeline, or establishing a dashboard, now you can assemble and visualize knowledge lineage from finish to finish.

Atlan: Tableau assets linked with source Snowflake tables
Tableau property linked with supply Snowflake tables

Much less time debugging, extra time constructing

Getting an pressing name about damaged knowledge is without doubt one of the worst experiences for a knowledge group. As an alternative of calling everybody who has ever touched the information, now you can diagnose the issue in seconds.

All it takes is opening a knowledge asset profile and checking the pipeline standing and metrics. No extra hours of scrambling or damaged belief, Atlan and Airflow’s integration enables you to see all your knowledge and its context in a single place.

Able to get began with this integration? Take a look at a demo of Atlan.

Listed here are two sources that will help you get began with bringing Airflow and Atlan collectively:



Please enter your comment!
Please enter your name here

Most Popular

Recent Comments