Autoresearch/mar27 by sandropapais · Pull Request #8 · TRAILab/ForeSight

sandropapais · 2026-03-31T22:06:23Z

No description provided.

L2: 0.5902 (Δ-0.0009 vs baseline), obj_box_col: 0.084% (Δ+0.004%) Marginal L2 gain; plan loss weight increase not a strong lever.

exp002: motion loss 0.2→0.5 badly hurt both L2+col (discard) exp004 staged: num_decoder 6→8 for richer instance features

queue_length=6: L2=0.5757 (best, Δ-0.0154 vs baseline) obj_box_col=0.100% (slightly worse than baseline) Stage exp005: queue6 + plan_loss_up combination

… revised exp004 decoder=8: obj_box_col=0.074% (best), L2=0.5955 (slightly worse) exp005 revised: queue=6 + decoder=8 combo (drop queue6+plan_loss_up)

exp005 queue6+decoder8: L2=0.5934, col=0.126% (negative synergy) Needs >10 epochs to benefit from combined capacity. Best results: - L2: 0.5757 (exp003 queue_length=6, -2.6% vs baseline) - col: 0.074% (exp004 num_decoder=8, -7.5% vs baseline)

Move research_log.md, results.tsv, docs/research_review.md → autoresearch/

…57), num_decoder=8 best col (0.074%) - autoresearch/ folder: research_log.md, results.tsv, research_review.md - scripts/dgx_run.sh: source ~/.bashrc fix for WANDB_API_KEY - 5 new experiment configs in projects/configs/auto_mar25_*

queue_length=6 as proper named config (from autoresearch mar25 exp003, best L2=0.5757)

…mar25 constraints - baseline run (exp000) always submitted first, doesn't count against max-experiments - sbatch now uses --export=ALL,WANDB_API_KEY=... to avoid WandB auth failure - hard constraints: no motion loss increase, no rotation augmentation

Run base config (nomap_queue6) as reproducible reference for mar26 session.

exp001: extended training 10→15 epochs (B2 bottleneck) exp002: plan_loss_reg 1→2, plan_loss_cls 0.5→1 on queue=6 base exp003: num_det 50→100 (more agents to planner) exp004: confidence_decay 0.6→0.8 (slower forgetting) exp005: combo of epochs15 + plan_loss_up

L2=0.5927, obj_box_col=0.104%, NDS=0.5236

epochs 10→15: L2=0.5738 (-0.019), obj_box_col=0.099% (-0.005pp) — new best both metrics

plan_loss_up: obj_box_col=0.094% (new best), L2=0.5904 (behind exp001's 0.5738)

num_det 50→100: L2=0.5676 (new best), obj_box_col=0.091% (new best) exp005 updated: epochs15 + plan_loss_up + num_det=100 (all confirmed wins)

confidence_decay 0.6→0.8: FAF +18, obj_box_col 0.120% (worse than baseline 0.104%) More false alarms from stale instances confuse the planner

exp005 (epochs15+plan_up+det100): L2=0.6109, col=0.107% — negative synergy discard Session best: exp003 num_det=100 — L2=0.5676, col=0.091% (both new bests) research_review: add mar26 findings section, update confirmed wins, negative evidence, best recipe, next experiments, open questions

Base: sparsedrive_r50_stage2_4gpu_bs24.py (WITH map, queue=4) exp001: num_det 50→100 exp002: queue_length 4→6 exp003: nomap + plan_loss_up combo exp004: epochs 10→15 exp005: epochs15 + num_det=100

L2=0.6274, obj_box_col=0.107%, NDS=0.5233, mAP_normal=0.5508

num_det=100: L2=0.6159 (-1.8%), col=0.091% (-15%), IDS 990→577 (-42%)

queue_length=6 hurts with map head active: col worsens 0.091%→0.147%, IDS stays at 995. Hard constraint: do NOT increase queue_length when map head is active.

Skip cross_gnn attention layers when map_output is None (with_map=False). The bs24 base config includes cross_gnn in operation_order; overriding with_map=False disables map output but the operation_order still references map_instance_feature_selected, causing UnboundLocalError at training start.

… config Skipping cross_gnn when map_output is None leaves map head params without gradients in DDP, causing reduction error. find_unused_parameters=True fixes.

nomap via with_map=False override on bs24 base is DDP-incompatible. Map head module stays registered with params; neither find_unused_parameters nor cross_gnn guard resolves PyTorch 1.13 mark-ready-twice error. Config updated to plan_loss_up alone (no nomap) for resubmission.

plan_loss_up alone: L2=0.6223 (beats baseline), col=0.110% (slightly worse than baseline). num_det=100 remains the strongest single lever. Moving to epochs=15.

epochs=15 hurts on bs24 with-map: L2=0.635, col=0.143% (worse than baseline). Map head gradient competition amplifies with more training. Hard constraint added. exp005 changed to num_det=100 + plan_loss_up combo.

exp005 (num_det=100 + plan_loss_up): L2=0.650, col=0.125% — negative synergy, worst in session. Best result remains exp001 (num_det=100): L2=0.616, col=0.091%. Key findings: - bs24 with-map is brittle: only num_det=100 reliably improves it - queue=6, epochs=15, plan_loss_up, and combinations all degrade performance - with_map=False override is DDP-incompatible (PyTorch 1.13) - num_det=100 confirmed universal win across 2 configs Updated research_review: new 3.13 section, updated confirmed wins/negative evidence tables, updated best recipe and next experiments.

sandropapais added 30 commits February 11, 2026 19:28

sparsedrive baseline

9326c83

added detection anchor propogation based on motion prediction

0db2dd9

update git ignore

149e940

Merge sd_anchorprop into sd

d4a2489

reverted anchor_prop changes for baseline model

ff18c99

added partial occ mask

2273e28

Added sparse4d configs

a9d6870

added test logging

764cfb7

added multiple prediciton refinements experiment

ce994af

removed unused configs

9acbaac

fixed bug with loading results in eval

8965682

remove breakpoint

e890261

fix typo

e063251

partial occluded evaluator using existing labels

8117b80

added occlusion counter printouts

12f7812

added config

a8f9822

flash attention patch for sparse4d

6901121

updated sparse4d configs

5075710

fixed occluded mask use_valid_flag interaction

589ee0c

fixed denominator for occluded/obj_box_col metric

fd9a8fa

added val/all metrics

9f8b0a4

added rotaug

59d90b2

updated occp configs

97fec71

removed old config

2d5de23

added dn config

1e6eb73

config typo

cda8426

initial pred only setup

9c5a489

freeze det head

ce0755b

Added CTRA motion model and fixed eval

f53d824

fix ca and ctr

7fc5dea

sandropapais added 30 commits March 25, 2026 17:52

autoresearchv1

5446891

autoresearch mar25 exp-001: log results (keep)

4be2835

L2: 0.5902 (Δ-0.0009 vs baseline), obj_box_col: 0.084% (Δ+0.004%) Marginal L2 gain; plan loss weight increase not a strong lever.

autoresearch mar25 exp-002: log results (discard) + stage exp004

d2d1217

exp002: motion loss 0.2→0.5 badly hurt both L2+col (discard) exp004 staged: num_decoder 6→8 for richer instance features

updated autoresearch files

a1fecb2

autoresearch mar25 exp-003: log results (keep, new best L2)

7aa5406

queue_length=6: L2=0.5757 (best, Δ-0.0154 vs baseline) obj_box_col=0.100% (slightly worse than baseline) Stage exp005: queue6 + plan_loss_up combination

autoresearch mar25 exp-004: log results (keep, new best col) + exp005…

6f994b4

… revised exp004 decoder=8: obj_box_col=0.074% (best), L2=0.5955 (slightly worse) exp005 revised: queue=6 + decoder=8 combo (drop queue6+plan_loss_up)

autoresearch mar25: consolidate research files into autoresearch/

266b920

Move research_log.md, results.tsv, docs/research_review.md → autoresearch/

add sparsedrive_r50_stage2_4gpu_nomap_queue6 config

c6ded2b

queue_length=6 as proper named config (from autoresearch mar25 exp003, best L2=0.5757)

autoresearch mar26 exp-000: baseline

a1998d9

Run base config (nomap_queue6) as reproducible reference for mar26 session.

autoresearch mar26 exp-000: log baseline results

e6e8521

L2=0.5927, obj_box_col=0.104%, NDS=0.5236

autoresearch mar26 exp-001: log results (keep)

06d245d

epochs 10→15: L2=0.5738 (-0.019), obj_box_col=0.099% (-0.005pp) — new best both metrics

autoresearch mar26 exp-002: log results (keep)

bd7a91a

plan_loss_up: obj_box_col=0.094% (new best), L2=0.5904 (behind exp001's 0.5738)

autoresearch mar26 exp-003: log results (keep) + update exp005

49995e7

num_det 50→100: L2=0.5676 (new best), obj_box_col=0.091% (new best) exp005 updated: epochs15 + plan_loss_up + num_det=100 (all confirmed wins)

autoresearch mar26 exp-004: log results (discard)

67278f0

confidence_decay 0.6→0.8: FAF +18, obj_box_col 0.120% (worse than baseline 0.104%) More false alarms from stale instances confuse the planner

autoresearch mar27: init session + exp000-005 configs

cca6623

Base: sparsedrive_r50_stage2_4gpu_bs24.py (WITH map, queue=4) exp001: num_det 50→100 exp002: queue_length 4→6 exp003: nomap + plan_loss_up combo exp004: epochs 10→15 exp005: epochs15 + num_det=100

autoresearch mar27 exp-000: log baseline results

eb576cb

L2=0.6274, obj_box_col=0.107%, NDS=0.5233, mAP_normal=0.5508

autoresearch mar27 exp-001: log results (keep)

7fafeff

num_det=100: L2=0.6159 (-1.8%), col=0.091% (-15%), IDS 990→577 (-42%)

autoresearch mar27 exp-002: log results (discard)

130d246

queue_length=6 hurts with map head active: col worsens 0.091%→0.147%, IDS stays at 995. Hard constraint: do NOT increase queue_length when map head is active.

autoresearch mar27 exp-003: add find_unused_parameters=True for nomap…

e6643ad

… config Skipping cross_gnn when map_output is None leaves map head params without gradients in DDP, causing reduction error. find_unused_parameters=True fixes.

autoresearch mar27 exp-003b: log results (discard)

36eb2f5

plan_loss_up alone: L2=0.6223 (beats baseline), col=0.110% (slightly worse than baseline). num_det=100 remains the strongest single lever. Moving to epochs=15.

autoresearch mar27 exp-004: log results (discard) + update exp005

ab3d42d

epochs=15 hurts on bs24 with-map: L2=0.635, col=0.143% (worse than baseline). Map head gradient competition amplifies with more training. Hard constraint added. exp005 changed to num_det=100 + plan_loss_up combo.

update notes

1eaa11b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Autoresearch/mar27#8

Autoresearch/mar27#8
sandropapais wants to merge 143 commits intomainfrom
autoresearch/mar27

sandropapais commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sandropapais commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant