Blog

/

Published Friday September 6, 2024

Data Catalog 2.0 — Refining AI

Brandon Strittmatter

Brandon Strittmatter

@burcs
Data Catalog 2.0 — Refining AI

Wrapping up this mini hackathon with one more hit, our all-new Data Catalog! We rebuilt this based on your feedback, yes you our beloved users! We knew it needed to be more powerful, more customizable, and most importantly, a significant improvement over the previous version.

What's different?

Well for starters, we made it accessible to everyone! We realized it went against our mantra of making data accessible to everyone by gating data catalogs behind a paywall. So now you too can dive in and add metadata, create definitions, and make sense of your data. But that's not all - we've also added several exciting new features!

What new features?

  1. Column Toggling: You can now toggle your columns on and off, meaning they will no longer be used as part of EZQL's (our AI agent) knowledge base when querying your data. This feature is crucial as it helps hide unnecessary information and allows you to remove redundant data, reducing confusion for both your team and the AI.

  2. Metadata Descriptions: We've introduced the ability to easily add metadata descriptions to all of your columns. This gives both the AI and your team context about what they're looking at. Let's face it, not all of us have pristine data (and if it's not pristine, it's definitely another engineer's fault, not ours!). Now you can quickly add notes describing the purpose of each column, bringing clarity to your data structure.

  3. Sample Data Addition: You can now add sample data to your columns, giving the AI insight into the structure of the data it's querying. For those who don't know, we never store your actual data or query it directly - we just use your schema. This meant our AI previously couldn't infer the expected format of the data. So if you asked for "all signups from Texas users," and your column actually used 2-letter abbreviations like TX, you'd never get accurate results. That's no longer the case!

What's next?

This is a fresh start on an established idea - a new canvas, if you will. We're excited to continue pushing the boundaries of AI and data, and the Data Catalog is the perfect place to continue those efforts. In the coming months, we plan on releasing numerous new features, including:

  • Data masking

  • dbt integrations

  • Auto-generation of metadata

  • And a whole lot more!

Ready to dive in? Sign up and explore the new Data Catalog (remember, it's free now!). Start by adding metadata to your most important columns, then experiment with toggling columns on and off to see how it affects your queries.

Stay tuned for all the exciting updates we'll be shipping in the near future. Your data journey is about to get a whole lot smoother!

Space, at your fingertips
astronaut

What will you discover?

Start exploring