Feature Request: partition_by support for window aggregations

### Description

Following in the steps of [551](https://github.com/Nixtla/mlforecast/pull/551), it would be great to add support for `partition_by` in window aggregations.

<html>
<body>
<h2 data-start="302" data-end="316">Description</h2>
<p data-start="318" data-end="442">It would be useful to support <strong data-start="348" data-end="410">SQL-like window aggregations with <code data-start="384" data-end="398">PARTITION BY</code> semantics</strong> in lag/rolling transformations.</p>
<p data-start="444" data-end="634">Currently, <code data-start="455" data-end="467">mlforecast</code> supports <code data-start="477" data-end="486">groupby</code> in rolling transformations (e.g. <code data-start="520" data-end="537">RollingQuantile</code>), which aggregates values <strong data-start="564" data-end="595">across multiple time series</strong> that share the same value in a column.</p>
<p data-start="636" data-end="644">Example:</p>
<pre class="overflow-visible! px-0!" data-start="646" data-end="716"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span class="ͼm">RollingQuantile</span><span>(</span><span class="ͼm">p</span><span class="ͼg">=</span><span class="ͼj">0.5</span><span>, </span><span class="ͼm">window_size</span><span class="ͼg">=</span><span class="ͼj">3</span><span>, </span><span class="ͼm">groupby</span><span class="ͼg">=</span><span>[</span><span class="ͼk">"brand"</span><span>])</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>
<p data-start="718" data-end="822">This aggregates values from <strong data-start="746" data-end="793">all time series belonging to the same brand</strong> within the specified window.</p>
<p data-start="824" data-end="1034">However, there are many forecasting use cases where the desired behavior is different:<br data-start="910" data-end="913">
we want to aggregate <strong data-start="934" data-end="965">within the same time series</strong>, but <strong data-start="971" data-end="1033">only over rows that share the same value in another column</strong>.</p>
<p data-start="1036" data-end="1097">This corresponds to SQL window functions with <code data-start="1082" data-end="1096">PARTITION BY</code>.</p>
<p data-start="1099" data-end="1111">Example SQL:</p>
<pre class="overflow-visible! px-0!" data-start="1113" data-end="1346"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span class="ͼg">SELECT</span><br><span>  avg(qty_sold) OVER (</span><br><span>    PARTITION </span><span class="ͼg">BY</span><span> article, store, is_promo</span><br><span>    </span><span class="ͼg">ORDER</span><span> </span><span class="ͼg">BY</span><span> </span><span class="ͼg">CAST</span><span>(</span><span class="ͼm">date</span><span> </span><span class="ͼg">AS</span><span> </span><span class="ͼm">timestamp</span><span>)</span><br><span>    RANGE </span><span class="ͼg">BETWEEN</span><span> </span><span class="ͼm">INTERVAL</span><span> </span><span class="ͼj">7</span><span> DAYS PRECEDING </span><span class="ͼg">AND</span><span> </span><span class="ͼm">INTERVAL</span><span> </span><span class="ͼj">1</span><span> </span><span class="ͼg">DAY</span><span> PRECEDING</span><br><span>  ) </span><span class="ͼg">AS</span><span> rolling_avg_by_promo_7</span><br><span class="ͼg">FROM</span><span> sales</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>
<p data-start="1348" data-end="1498">Here the aggregation happens <strong data-start="1377" data-end="1423">within each <code data-start="1391" data-end="1409">(article, store)</code> time series</strong>, but <strong data-start="1429" data-end="1497">separately for <code data-start="1446" data-end="1461">is_promo=True</code> and <code data-start="1466" data-end="1482">is_promo=False</code> observations</strong>.</p>
<hr data-start="1500" data-end="1503">
<h1 data-start="1505" data-end="1523">Proposed Feature</h1>
<p data-start="1525" data-end="1589">Introduce a <code data-start="1537" data-end="1551">partition_by</code> argument for rolling transformations.</p>
<p data-start="1591" data-end="1613">The behavior would be:</p>
<ul data-start="1615" data-end="1832">
<li data-start="1615" data-end="1671">
<p data-start="1617" data-end="1671">aggregation happens <strong data-start="1637" data-end="1671">within each <code data-start="1651" data-end="1662">unique_id</code> series</strong></p>
</li>
<li data-start="1672" data-end="1761">
<p data-start="1674" data-end="1761">but only over rows where the values of <code data-start="1713" data-end="1727">partition_by</code> columns <strong data-start="1736" data-end="1761">match the current row</strong></p>
</li>
<li data-start="1762" data-end="1832">
<p data-start="1764" data-end="1832">the rolling window still follows the <strong data-start="1801" data-end="1832">time ordering of the series</strong></p>
</li>
</ul>
<p data-start="1834" data-end="1847">Conceptually:</p>
<pre class="overflow-visible! px-0!" data-start="1849" data-end="1952"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span>partition_by = conditional filtering inside the series</span><br><span>groupby      = aggregation across series</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>
<hr data-start="1954" data-end="1957">
<h1 data-start="1959" data-end="1968">Example</h1>
<h3 data-start="1970" data-end="1979">Input</h3>
<pre class="overflow-visible! px-0!" data-start="1981" data-end="2243"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span class="ͼm">df</span><span> </span><span class="ͼg">=</span><span> </span><span class="ͼm">pd</span><span class="ͼg">.</span><span>DataFrame(</span><br><span>    {</span><br><span>        </span><span class="ͼk">"unique_id"</span><span>: [</span><span class="ͼk">"a"</span><span>, </span><span class="ͼk">"a"</span><span>, </span><span class="ͼk">"a"</span><span>, </span><span class="ͼk">"a"</span><span>, </span><span class="ͼk">"b"</span><span>, </span><span class="ͼk">"b"</span><span>, </span><span class="ͼk">"b"</span><span>, </span><span class="ͼk">"b"</span><span>],</span><br><span>        </span><span class="ͼk">"ds"</span><span>: [</span><span class="ͼj">1</span><span>, </span><span class="ͼj">2</span><span>, </span><span class="ͼj">3</span><span>, </span><span class="ͼj">4</span><span>, </span><span class="ͼj">1</span><span>, </span><span class="ͼj">2</span><span>, </span><span class="ͼj">3</span><span>, </span><span class="ͼj">4</span><span>],</span><br><span>        </span><span class="ͼk">"y"</span><span>: [</span><span class="ͼj">1</span><span>, </span><span class="ͼj">2</span><span>, </span><span class="ͼj">3</span><span>, </span><span class="ͼj">4</span><span>, </span><span class="ͼj">10</span><span>, </span><span class="ͼj">20</span><span>, </span><span class="ͼj">30</span><span>, </span><span class="ͼj">40</span><span>],</span><br><span>        </span><span class="ͼk">"promo"</span><span>: [</span><span class="ͼj">True</span><span>, </span><span class="ͼj">True</span><span>, </span><span class="ͼj">False</span><span>, </span><span class="ͼj">True</span><span>, </span><span class="ͼj">False</span><span>, </span><span class="ͼj">True</span><span>, </span><span class="ͼj">False</span><span>, </span><span class="ͼj">True</span><span>],</span><br><span>    }</span><br><span>)</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>
<h3 data-start="2245" data-end="2263">Transformation</h3>
<pre class="overflow-visible! px-0!" data-start="2265" data-end="2531"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span class="ͼm">tfm</span><span> </span><span class="ͼg">=</span><span> </span><span class="ͼm">RollingMean</span><span>(</span><span class="ͼj">3</span><span>, </span><span class="ͼm">min_samples</span><span class="ͼg">=</span><span class="ͼj">1</span><span>, </span><span class="ͼm">partition_by</span><span class="ͼg">=</span><span>[</span><span class="ͼk">"promo"</span><span>])</span><br><br><span class="ͼm">ts</span><span> </span><span class="ͼg">=</span><span> </span><span class="ͼm">TimeSeries</span><span>(</span><span class="ͼm">freq</span><span class="ͼg">=</span><span class="ͼj">1</span><span>, </span><span class="ͼm">lag_transforms</span><span class="ͼg">=</span><span>{</span><span class="ͼj">1</span><span>: [</span><span class="ͼm">tfm</span><span>]})</span><br><br><span class="ͼm">prep</span><span> </span><span class="ͼg">=</span><span> </span><span class="ͼm">ts</span><span class="ͼg">.</span><span>fit_transform(</span><br><span>    </span><span class="ͼm">df</span><span>,</span><br><span>    </span><span class="ͼm">id_col</span><span class="ͼg">=</span><span class="ͼk">"unique_id"</span><span>,</span><br><span>    </span><span class="ͼm">time_col</span><span class="ͼg">=</span><span class="ͼk">"ds"</span><span>,</span><br><span>    </span><span class="ͼm">target_col</span><span class="ͼg">=</span><span class="ͼk">"y"</span><span>,</span><br><span>    </span><span class="ͼm">dropna</span><span class="ͼg">=</span><span class="ͼj">False</span><span>,</span><br><span>    </span><span class="ͼm">static_features</span><span class="ͼg">=</span><span>[],</span><br><span>)</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>
<h3 data-start="2533" data-end="2552">Expected Output</h3>
<pre class="overflow-visible! px-0!" data-start="2554" data-end="2755"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span class="ͼm">expected_by_key</span><span> </span><span class="ͼg">=</span><span> {</span><br><span>    (</span><span class="ͼk">"a"</span><span>, </span><span class="ͼj">1</span><span>): </span><span class="ͼm">np</span><span class="ͼg">.</span><span>nan,</span><br><span>    (</span><span class="ͼk">"a"</span><span>, </span><span class="ͼj">2</span><span>): </span><span class="ͼj">1.0</span><span>,</span><br><span>    (</span><span class="ͼk">"a"</span><span>, </span><span class="ͼj">3</span><span>): </span><span class="ͼm">np</span><span class="ͼg">.</span><span>nan,</span><br><span>    (</span><span class="ͼk">"a"</span><span>, </span><span class="ͼj">4</span><span>): </span><span class="ͼj">1.5</span><span>,</span><br><span>    (</span><span class="ͼk">"b"</span><span>, </span><span class="ͼj">1</span><span>): </span><span class="ͼm">np</span><span class="ͼg">.</span><span>nan,</span><br><span>    (</span><span class="ͼk">"b"</span><span>, </span><span class="ͼj">2</span><span>): </span><span class="ͼm">np</span><span class="ͼg">.</span><span>nan,</span><br><span>    (</span><span class="ͼk">"b"</span><span>, </span><span class="ͼj">3</span><span>): </span><span class="ͼj">10.0</span><span>,</span><br><span>    (</span><span class="ͼk">"b"</span><span>, </span><span class="ͼj">4</span><span>): </span><span class="ͼj">20.0</span><span>,</span><br><span>}</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>

<hr data-start="3068" data-end="3071">
<h1 data-start="3073" data-end="3085">Motivation</h1>
<p data-start="3087" data-end="3152">This functionality is very useful for many forecasting scenarios:</p>
<ul data-start="3154" data-end="3332">
<li data-start="3154" data-end="3184">
<p data-start="3156" data-end="3184"><strong data-start="3156" data-end="3184">promotion-aware features</strong></p>
</li>
<li data-start="3185" data-end="3216">
<p data-start="3187" data-end="3216"><strong data-start="3187" data-end="3216">price regime segmentation</strong></p>
</li>
<li data-start="3217" data-end="3254">
<p data-start="3219" data-end="3254"><strong data-start="3219" data-end="3254">holiday vs non-holiday patterns</strong></p>
</li>
<li data-start="3255" data-end="3291">
<p data-start="3257" data-end="3291"><strong data-start="3257" data-end="3291">weather condition segmentation</strong></p>
</li>
<li data-start="3292" data-end="3332">
<p data-start="3294" data-end="3332"><strong data-start="3294" data-end="3332">state-dependent rolling statistics</strong></p>
</li>
</ul>
<p data-start="3334" data-end="3418">Currently these features require <strong data-start="3367" data-end="3410">manual preprocessing with pandas or SQL</strong>, which:</p>
<ul data-start="3420" data-end="3562">
<li data-start="3420" data-end="3467">
<p data-start="3422" data-end="3467">duplicates logic outside the feature pipeline</p>
</li>
<li data-start="3468" data-end="3516">
<p data-start="3470" data-end="3516">prevents reuse of <code data-start="3488" data-end="3500">mlforecast</code> transformations</p>
</li>
<li data-start="3517" data-end="3562">
<p data-start="3519" data-end="3562">complicates reproducibility and backtesting</p>
</li>
</ul>
<p data-start="3564" data-end="3711">Supporting <code data-start="3575" data-end="3589">partition_by</code> natively would allow users to define these features <strong data-start="3642" data-end="3677">directly in lag transformations</strong>, consistent with the current API.</p>
<hr data-start="3713" data-end="3716">
<h1 data-start="3718" data-end="3733">Suggested API</h1>
<p data-start="3735" data-end="3743">Example:</p>
<pre class="overflow-visible! px-0!" data-start="3745" data-end="3838"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span class="ͼm">RollingMean</span><span>(</span><br><span>    </span><span class="ͼm">window_size</span><span class="ͼg">=</span><span class="ͼj">7</span><span>,</span><br><span>    </span><span class="ͼm">min_samples</span><span class="ͼg">=</span><span class="ͼj">1</span><span>,</span><br><span>    </span><span class="ͼm">partition_by</span><span class="ͼg">=</span><span>[</span><span class="ͼk">"promo"</span><span>]</span><br><span>)</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>
<p data-start="3840" data-end="3867">Possible interaction rules:</p>
<ul data-start="3869" data-end="3997">
<li data-start="3869" data-end="3917">
<p data-start="3871" data-end="3917"><code data-start="3871" data-end="3885">partition_by</code> operates <strong data-start="3895" data-end="3917">within <code data-start="3904" data-end="3915">unique_id</code></strong></p>
</li>
<li data-start="3918" data-end="3964">
<p data-start="3920" data-end="3964"><code data-start="3920" data-end="3929">groupby</code> aggregates <strong data-start="3941" data-end="3964">across <code data-start="3950" data-end="3961">unique_id</code>s</strong></p>
</li>
<li data-start="3965" data-end="3997">
<p data-start="3967" data-end="3997">both could potentially coexist</p>
</li>
</ul>
<p data-start="3999" data-end="4007">Example:</p>
<pre class="overflow-visible! px-0!" data-start="4009" data-end="4080"><div class="relative w-full my-4"><div class=""><div class="relative"><div class="h-full min-h-0 min-w-0"><div class="h-full min-h-0 min-w-0"><div class="border border-token-border-light border-radius-3xl corner-superellipse/1.1 rounded-3xl"><div class="h-full w-full border-radius-3xl bg-token-bg-elevated-secondary corner-superellipse/1.1 overflow-clip rounded-3xl lxnfua_clipPathFallback"><div class="pointer-events-none absolute inset-x-4 top-12 bottom-4"><div class="pointer-events-none sticky z-40 shrink-0 z-1!"><div class="sticky bg-token-border-light"></div></div></div><div class=""><div class="relative z-0 flex max-w-full"><div id="code-block-viewer" dir="ltr" class="q9tKkq_viewer cm-editor z-10 light:cm-light dark:cm-light flex h-full w-full flex-col items-stretch ͼd ͼr"><div class="cm-scroller"><div class="cm-content q9tKkq_readonly"><span class="ͼm">RollingMean</span><span>(</span><span class="ͼj">7</span><span>, </span><span class="ͼm">partition_by</span><span class="ͼg">=</span><span>[</span><span class="ͼk">"promo"</span><span>], </span><span class="ͼm">groupby</span><span class="ͼg">=</span><span>[</span><span class="ͼk">"brand"</span><span>])</span></div></div></div></div></div></div></div></div></div><div class=""><div class=""></div></div></div></div></div></pre>
<p data-start="4082" data-end="4146">(though this interaction may require further design discussion).</p>
<hr data-start="4148" data-end="4151">
<h1 data-start="4153" data-end="4162">Summary</h1>
<p data-start="4164" data-end="4332">Adding <code data-start="4171" data-end="4185">partition_by</code> would enable <strong data-start="4199" data-end="4257">SQL-style window semantics inside a single time series</strong>, which is a common requirement in retail and demand forecasting use cases.</p>
<p data-start="4334" data-end="4496">This would significantly expand the feature engineering capabilities of <code data-start="4406" data-end="4418">mlforecast</code> while keeping the API consistent with the existing lag transformation design.</p>
</body>
</html>

### Use case

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: partition_by support for window aggregations #587

Description

Description

Proposed Feature

Example

Input

Transformation

Expected Output

Motivation

Suggested API

Summary

Use case

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature Request: partition_by support for window aggregations #587

Description

Description

Description

Proposed Feature

Example

Input

Transformation

Expected Output

Motivation

Suggested API

Summary

Use case

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions