Split Translate and Speech into separate stages with TTS word timestamps by elasticsounds · Pull Request #268 · unicef/adt-studio

elasticsounds · 2026-04-08T03:04:29Z

Summary

Split Translate/Speech stages: The monolithic Translate stage is now two independent pipeline stages (Translate and Speech) with separate DAG nodes, settings panels, and run controls
Word-level timestamps: Whisper-based word timestamp generation with inline playback highlighting, editable multi-column timecode tables with custom TimecodeInput controls, and background task queue with progress tracking
Whisper accuracy: Source text passed as prompt parameter to Whisper API for better transcription alignment
Language picker: Country lists updated to show only linguistically relevant countries when browsing; all countries still searchable by typing
UI polish: Visual separator between run card controls and config sections; task progress messages displayed in sidebar

Test plan

Verify Translate and Speech stages appear independently in the sidebar and can be run separately
Run Speech generation, then calculate timestamps — confirm background task shows progress (e.g. 42/632)
Play audio and verify word-by-word highlighting syncs with playback
Expand timestamp viewer, edit timecodes, save — confirm edits persist
Re-run translations and verify pipeline completes without errors
Open language picker and confirm only relevant countries appear per language

…estamps and word highlighting Separate the monolithic Translate stage into distinct Translate and Speech stages with independent settings, DAG dependencies, and UI views. Add Whisper-based word timestamp generation with inline playback highlighting, editable multi-column timecode tables, and background task queue for batch transcription. Improve language picker to show only linguistically relevant countries, add visual separation to stage run cards, and pass source text as Whisper prompt for better accuracy.

…tions

Write timestamps incrementally during batch transcription instead of accumulating in a stale snapshot, preventing concurrent user edits from being silently overwritten. Show human-readable language name (e.g. "English") instead of locale code in the generate timestamps confirmation.

Fixes CI lint failure where `inputClass` (a Tailwind class string variable) was flagged as an unlocalized string.

gbergengruen · 2026-04-08T12:11:44Z

@elasticsounds, I saw that the language and speech steps are independent now, but you need to use the language step to run the speech step, as it seems to do some required step to generate the tts. The timestamps are amazing. I loved it.

tts-timestamps node data was not included in clearNodesByType calls alongside tts, leaving stale word-timestamp data after speech deletion or upstream page/caption edits.

elasticsounds added 4 commits April 7, 2026 23:04

Add confirmation dialogs for delete speech and generate timestamps ac…

afd1a48

…tions

Add Class-suffix pattern to ESLint ignoreNames for CSS class variables

1114f9a

Fixes CI lint failure where `inputClass` (a Tailwind class string variable) was flagged as an unlocalized string.

Fix tts-timestamps data not cleared when deleting TTS or editing pages

3a8daef

tts-timestamps node data was not included in clearNodesByType calls alongside tts, leaving stale word-timestamp data after speech deletion or upstream page/caption edits.

nicpottier merged commit f9055bd into main Apr 8, 2026
3 checks passed

nicpottier deleted the elasticsounds/split-translate-speech branch April 8, 2026 16:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split Translate and Speech into separate stages with TTS word timestamps#268

Split Translate and Speech into separate stages with TTS word timestamps#268
nicpottier merged 5 commits intomainfrom
elasticsounds/split-translate-speech

elasticsounds commented Apr 8, 2026

Uh oh!

gbergengruen commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

elasticsounds commented Apr 8, 2026

Summary

Test plan

Uh oh!

gbergengruen commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants