Render Config#
Cosmos aims to give you control over how your dbt project is rendered as an Airflow DAG or Task Group.
It does this by exposing a cosmos.config.RenderConfig
class that you can use to configure how your DAGs are rendered.
The RenderConfig
class takes the following arguments:
emit_datasets
: whether or not to emit Airflow datasets to be used for data-aware scheduling. Defaults to True. Depends on additional dependencies. If a model in the project has a name containing multibyte characters, the dataset name will be URL-encoded.test_behavior
: how to run tests. Defaults to running a model’s tests immediately after the model is run. For more information, see the Testing Behavior section.load_method
: how to load your dbt project. See Parsing Methods for more information.select
andexclude
: which models to include or exclude from your DAGs. See Selecting & Excluding for more information.selector
: (new in v1.3) name of a dbt YAML selector to use for DAG parsing. Only supported when usingload_method=LoadMode.DBT_LS
. See Selecting & Excluding for more information.dbt_deps
: A Boolean to run dbt deps when using dbt ls for dag parsing. Default Truenode_converters
: a dictionary mapping aDbtResourceType
into a callable. Users can control how to render dbt nodes in Airflow. Only supported when usingload_method=LoadMode.DBT_MANIFEST
orLoadMode.DBT_LS
. Find more information below.dbt_executable_path
: The path to the dbt executable for dag generation. Defaults to dbt if available on the path.env_vars
: (available in v1.2.5, use``ProjectConfig.env_vars`` for v1.3.0 onwards) A dictionary of environment variables for rendering. Only supported when usingload_method=LoadMode.DBT_LS
.dbt_project_path
: Configures the DBT project location accessible on their airflow controller for DAG rendering - Required when usingload_method=LoadMode.DBT_LS
orload_method=LoadMode.CUSTOM
airflow_vars_to_purge_cache
: (new in v1.5) Specify Airflow variables that will affect theLoadMode.DBT_LS
cache. See Caching for more information.
Customizing how nodes are rendered (experimental)#
There are circumstances when choosing specific Airflow operators to represent a dbt node is helpful. An example could be to use an S3 sensor to represent dbt sources or to create custom operators to handle exposures. Your pipeline may even have specific node types not part of the standard dbt definitions.
The following example illustrates how it is possible to tell Cosmos how to convert two different types of nodes (source
and exposure
) into Airflow:
# Cosmos will use this function to generate an empty task when it finds a source node, in the manifest.
# A more realistic use case could be to use an Airflow sensor to represent a source.
def convert_source(dag: DAG, task_group: TaskGroup, node: DbtNode, **kwargs):
"""
Return an instance of a desired operator to represent a dbt "source" node.
"""
return EmptyOperator(dag=dag, task_group=task_group, task_id=f"{node.name}_source")
# Cosmos will use this function to generate an empty task when it finds a exposure node, in the manifest.
def convert_exposure(dag: DAG, task_group: TaskGroup, node: DbtNode, **kwargs):
"""
Return an instance of a desired operator to represent a dbt "exposure" node.
"""
return EmptyOperator(dag=dag, task_group=task_group, task_id=f"{node.name}_exposure")
# Use `RenderConfig` to tell Cosmos, given a node type, how to convert a dbt node into an Airflow task or task group.
# In this example, we are telling Cosmos how to convert dbt source and exposure nodes.
# When building the Airflow DAG, if the user defined the conversion function, Cosmos will use it.
# Otherwise, it will use its standard conversion function.
render_config = RenderConfig(
node_converters={
DbtResourceType("source"): convert_source, # known dbt node type to Cosmos (part of DbtResourceType)
DbtResourceType("exposure"): convert_exposure, # dbt node type new to Cosmos (will be added to DbtResourceType)
}
)
# `ProjectConfig` can pass dbt variables and environment variables to dbt commands. Below is an example of
# passing a required env var for the profiles.yml file and a dbt variable that is used for rendering and
# executing dbt models.
project_config = ProjectConfig(
DBT_ROOT_PATH / "simple",
env_vars={"DBT_SQLITE_PATH": DBT_SQLITE_PATH},
dbt_vars={"animation_alias": "top_5_animated_movies"},
)
example_cosmos_sources = DbtDag(
# dbt/cosmos-specific parameters
project_config=project_config,
profile_config=profile_config,
render_config=render_config,
# normal dag parameters
schedule_interval="@daily",
start_date=datetime(2023, 1, 1),
catchup=False,
dag_id="example_cosmos_sources",
)
When defining the mapping for a new type that is not part of Cosmos’ DbtResourceType
enumeration, users should use the syntax DbtResourceType("new-node-type")
as opposed to DbtResourceType.EXISTING_TYPE
. It will dynamically add the new type to the enumeration DbtResourceType
so that Cosmos can parse these dbt nodes and convert them into the Airflow DAG.