Skip to content

Welcome to dbt-score

dbt-score is a linter for dbt metadata.

dbt allows data practitioners to organize their data in to models and sources. Those models and sources have metadata associated with them: documentation, tests, types, etc.

dbt-score allows to lint and score this metadata, in order to enforce (or encourage) good practices.

Example

> dbt-score lint
🥇 M: customers (score: 10.0)
    OK   dbt_score.rules.generic.has_description
    OK   dbt_score.rules.generic.has_owner
    OK   dbt_score.rules.generic.sql_has_reasonable_number_of_lines
Score: 10.0 🥇

In this example, the model customers scores the maximum value of 10.0 as it passes all the rules. It also is awarded a golden medal because of the perfect score.

Philosophy

dbt models/sources are often used as metadata containers: either in YAML files or through the use of {{ config() }} blocks, they are associated with a lot of information. At scale, it becomes tedious to enforce good practices in large data teams dealing with many models/sources.

To that end, dbt-score has 2 main features:

  • It runs rules on dbt models and sources, and displays any rule violations. These can be used in interactive environments or in CI.
  • Using those run results, it scores items, to ascribe them a measure of their maturity. This score can help gamify metadata improvements/coverage, and be reflected in data catalogs.

dbt-score aims to:

  • Provide a predefined set of good practices (the core rules).
  • Allow teams to easily add their own rules.
  • Allow rule sets to be packaged and distributed.
  • Be configurable to adapt to different data stacks and practices.

About

dbt-score is free software, released under the MIT license. It originated at Picnic Technologies in Amsterdam, Netherlands. Source code is available on Github.

All contributions, in the form of bug reports, pull requests, feedback or discussion are welcome. See the contribution guide for more information.