Skip to content

Add design descriptionf or project 340#343

Closed
tdewanNvidia wants to merge 235 commits intomainfrom
tdewan/Proj-340-design
Closed

Add design descriptionf or project 340#343
tdewanNvidia wants to merge 235 commits intomainfrom
tdewan/Proj-340-design

Conversation

@tdewanNvidia
Copy link
Contributor

Description

Issue - 340

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

RyaliNvidia and others added 30 commits October 1, 2025 14:56
elookpotts-nvidia and others added 27 commits January 9, 2026 13:11
* backend operator redesign doc
… Auth (#177)

* Data/Dataset Auth Check CLIs

* Remove auth check from data service

* Use auth check CLIs in ctrl

* Add exit code to docs

* Fix build issues

* Fix lint

* Ctrl to use user config when validating data auth

* Use the correct CLI argument type

* Fix lint

* Use profile when looking up data credential from config
* Add workflow to label external issues and pull requests

* pin to allowed action version

* add reopened event
* allow flexible squid proxy replicas

* fix
…ration (#167)

* Improving Performance for Uploading Workflow Artifacts in Worker Jobs

* Cleanup

* Add progress writing after upload

* Add dependency in Bazel BUILD

* Add type to mypy requirements

* Update mypy requirements

* Add to mypy_cli BUILD

* Fix lint

* Comment

* Use constant to define semaphor and storage client executor count
* Use last login url if url is not specified

* print message
…e synced (#252)

* Fix sync-feature-branches with no merge conflicts

* Allow a single branch to be specified for sync-feature-branches

* Perform operations as OSMO CI Bot

* Add external label when the PR is created

* extract issue number
* Initial commit for database pooling

* Update set_session

* Fix lint

* Update PostgresConnector to have semaphor to control connections

* Lint fix

* Fix number of maxconn for test

* Address comments
* script to create azure resources and deploy

* Remove auto-generated values files from tracking

- Added .gitignore to ignore values/, *.env files
- Removed values/*.yaml files from git (auto-generated during deployment)

* add aws script

* add aws script

* add copyright

* update copyright
* feat(src): add Azure service account and extra pod labels configuration

- implement service account creation with customizable name and annotations
- enhance service templates to support extra pod labels for various services
- update Azure backend to utilize DefaultAzureCredential for authentication
- add tests for Azure credential extraction and client creation

* feat(src): extract account key from connection string for Azure Blob Storage

- add function to extract AccountKey from connection string
- update AzureBlobStorageClient to handle different credential types

* feat(test): add tests for account key extraction from Azure connection strings

* chore: clean up linting issues for tests

* refactor(src): update data credential types in PostgresConnector and TaskGroup

- change StaticDataCredential to DataCredential in get_all_data_creds method
- update fetch_creds function signature to use DataCredential

* feat(src): update Azure client creation to include storage account and account URL

- remove deprecated storage account extraction function
- modify create_client to accept storage_account and account_url parameters
- update AzureBlobStorageClientFactory to use new parameters
- adjust tests to reflect changes in client creation

🔒 - Generated by Copilot

* refactor(src): mark storage_account parameter as unused in create_client function

🔧 - Generated by Copilot

* refactor(src): remove unused storage_account parameter from client creation

🔧 - Generated by Copilot
…ng (#211)

* Add new project proposal to describe nvlink + topology aware scheduling

* Split design into two docs

* Finish docs and add some updates from feedback

* Add some open items
* add redis utlis, update postgres utils

* add deps
* Initial design spike for dynamic subpool

* Add more context to design

* Address feedback
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.