Large number of Transactions for the logs when using WASB #34973
Replies: 3 comments 5 replies
-
|
Remote shares, especially cloud based, could turned life into the pain, see: What resources might limit Scheduler’s performance In additional there is report about this issue in SO in the past, seems like some stat calls could spawn additional Azure File Share transaction (whatever it was) |
Beta Was this translation helpful? Give feedback.
-
|
Hi @ahmadfarhan97! did you manage to find a solution for this? |
Beta Was this translation helpful? Give feedback.
-
|
We were also facing this problem using the helm chart. Turns out that the We solved it by disabling logging persistence at the helm chart level, and enabling remote logging at the airflow configuration level. We also changed all the remaining storage classes to The relevant parts of the helm template look like this: config:
logging:
remote_logging: "True"
remote_log_conn_id: "wasb" #in our case we created the connection using the AzureKeyVaultBackend integration
remote_base_log_folder: "wasb://logs@${STORAGE_ACCOUNT_NAME}.blob.core.windows.net"
azure_remote_logging:
remote_wasb_log_container: "logs"
logs:
persistence:
enabled: false |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
We are using WASB (azure storage account - file share) for the loggings and it shows that the transactions between Airflow and the file share (solely used for loggings) is very large - more than 1 million transaction per hour.
I am wondering what is happening in the app that continuously require the read/write transaction and how to reduce that number.
Currently we have 4 DAGs running including 2 only run on hourly schedule. The Airflow is deployed to Kubernetes cluster using Helm Chart (v1.10.0). We are using persistent volume to mount to storage account.
The PV and PVC looks like this
This is what we have in the values.yaml:
Beta Was this translation helpful? Give feedback.
All reactions