Skip to content

Add helper scripts for import automation#1927

Open
vish-cs wants to merge 1 commit intodatacommonsorg:masterfrom
vish-cs:rollback
Open

Add helper scripts for import automation#1927
vish-cs wants to merge 1 commit intodatacommonsorg:masterfrom
vish-cs:rollback

Conversation

@vish-cs
Copy link
Copy Markdown
Contributor

@vish-cs vish-cs commented Mar 23, 2026

Add helper scripts for import automation

  • Retry failed ingestion job
    Added a script that identifies failed imports within a specific Dataflow job,
    reverts the failed imports in the Spanner database to their last known
    good version, resets any 'PENDING' imports back to 'STAGING', and
    optionally retriggers the Spanner ingestion workflow.

  • Generate provisional nodes
    Scan MCF files to find missing references and generate provisional nodes. Provides option to check the missing references in a Spanner Graph instance before producing provisional nodes.

  • Convert DC manifest files to MCF
    Script to read DC manifest text proto files (Import, Datasource etc.) to MCF format to be consumed by the ingestion pipeline

gemini-code-assist[bot]

This comment was marked as outdated.

@vish-cs vish-cs requested a review from ajaits March 23, 2026 17:18
@vish-cs vish-cs changed the title Add script to retry failed ingestion job Add helper scripts for import automation Apr 3, 2026
@vish-cs vish-cs marked this pull request as draft April 7, 2026 15:12
@vish-cs vish-cs marked this pull request as ready for review April 27, 2026 08:48
@datacommonsorg datacommonsorg deleted a comment from gemini-code-assist Bot Apr 27, 2026
- Retry failed ingestion job
- Generate provisional nodes
- Convert DC manifest files to MCF
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant