What's the recommended way to capture full call transcripts + per-turn latencies from a pipecat pipeline?

### pipecat version

_No response_

### Python version

_No response_

### Operating System

_No response_

### Question

hey everyone,

i am building an analytics layer on top of pipecat voice bot and trying to figure out the cleanest approach to capture:

1. full transcript — user utterances and agent responses, each with a start_ms / end_ms relative to the call start
2. tool calls — which function was called, with what args, what it returned, and when
3. per-turn latencies — things like stt latency, llm-ttfb, tts synthesis time, and e2e turn latency (user stops speaking → bot starts speaking)




### What I've tried

i first tried adding a custom `Observer` that oversees the whole pipeline, but I didn’t make much progress and it became very complicated. then implemented a custom `ProcessorFrame` that intercepts other frames to capture the generated llm transcription and so on, but I ran into many issues and it got complicated very quickly — especially with race conditions between frames and similar problems.

### Context

i am looking for any recommendations for examples, patterns, or existing integrations 
that would be really appreciated..

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's the recommended way to capture full call transcripts + per-turn latencies from a pipecat pipeline? #3977

pipecat version

Python version

Operating System

Question

What I've tried

Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What's the recommended way to capture full call transcripts + per-turn latencies from a pipecat pipeline? #3977

Description

pipecat version

Python version

Operating System

Question

What I've tried

Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions