Commit 704f309
João Felipe Santos
Remove redundant manual loop unrolling from activations and element-wise ops
ARM assembly analysis (-O2 -DNDEBUG) confirmed:
- GCC auto-unrolls simple activation loops; manual 4-wide gives no benefit
- expf() serializes sigmoid/SiLU; unrolling can't help
- Eigen element-wise ops (.leftCols + .leftCols) produce identical codegen
to raw float* loops when assertions are disabled
Simplify 5 activation classes to use inline helpers (relu, sigmoid, etc.)
and revert 3 wavenet element-wise operations back to Eigen expressions.
Inline GEMM (Conv1x1/Conv1D), depthwise unrolling, FiLM unrolling,
bias broadcast, and memcpy optimizations are retained — those show
measurable wins on both desktop and Cortex-M7.
Also restored comments that were accidentally removed from wavenet.h.1 parent 5d9ed6c commit 704f309
4 files changed
+15
-177
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
41 | 40 | | |
42 | 41 | | |
43 | 42 | | |
| |||
198 | 197 | | |
199 | 198 | | |
200 | 199 | | |
201 | | - | |
202 | | - | |
203 | | - | |
204 | | - | |
205 | | - | |
206 | 200 | | |
207 | 201 | | |
208 | | - | |
| 202 | + | |
209 | 203 | | |
210 | 204 | | |
211 | 205 | | |
| |||
220 | 214 | | |
221 | 215 | | |
222 | 216 | | |
223 | | - | |
224 | | - | |
225 | | - | |
226 | | - | |
227 | 217 | | |
228 | 218 | | |
229 | | - | |
| 219 | + | |
230 | 220 | | |
231 | 221 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
235 | 235 | | |
236 | 236 | | |
237 | 237 | | |
238 | | - | |
239 | | - | |
240 | | - | |
241 | | - | |
242 | | - | |
243 | | - | |
244 | | - | |
245 | | - | |
246 | | - | |
247 | | - | |
248 | | - | |
249 | | - | |
250 | | - | |
251 | | - | |
252 | | - | |
253 | | - | |
254 | | - | |
255 | | - | |
| 238 | + | |
| 239 | + | |
256 | 240 | | |
257 | 241 | | |
258 | 242 | | |
| |||
316 | 300 | | |
317 | 301 | | |
318 | 302 | | |
319 | | - | |
320 | | - | |
321 | | - | |
322 | | - | |
323 | | - | |
324 | | - | |
325 | | - | |
326 | | - | |
327 | | - | |
328 | | - | |
329 | | - | |
330 | | - | |
331 | | - | |
332 | | - | |
333 | | - | |
| 303 | + | |
334 | 304 | | |
335 | | - | |
336 | 305 | | |
337 | 306 | | |
338 | 307 | | |
| |||
341 | 310 | | |
342 | 311 | | |
343 | 312 | | |
344 | | - | |
345 | | - | |
346 | | - | |
347 | | - | |
348 | | - | |
349 | | - | |
350 | | - | |
351 | | - | |
352 | | - | |
353 | | - | |
354 | | - | |
355 | | - | |
356 | | - | |
357 | | - | |
358 | | - | |
359 | | - | |
360 | | - | |
361 | | - | |
362 | | - | |
363 | | - | |
| 313 | + | |
364 | 314 | | |
365 | | - | |
366 | 315 | | |
367 | 316 | | |
368 | 317 | | |
| |||
371 | 320 | | |
372 | 321 | | |
373 | 322 | | |
374 | | - | |
375 | | - | |
376 | | - | |
377 | | - | |
378 | | - | |
379 | | - | |
380 | | - | |
381 | | - | |
382 | | - | |
383 | | - | |
384 | | - | |
385 | | - | |
386 | | - | |
387 | | - | |
388 | | - | |
389 | | - | |
390 | | - | |
391 | | - | |
392 | | - | |
393 | | - | |
394 | | - | |
395 | | - | |
396 | | - | |
397 | | - | |
| 323 | + | |
398 | 324 | | |
399 | | - | |
400 | 325 | | |
401 | 326 | | |
402 | 327 | | |
| |||
405 | 330 | | |
406 | 331 | | |
407 | 332 | | |
408 | | - | |
409 | | - | |
410 | | - | |
411 | | - | |
412 | | - | |
413 | | - | |
414 | | - | |
415 | | - | |
416 | | - | |
417 | | - | |
418 | | - | |
419 | | - | |
420 | | - | |
421 | | - | |
422 | | - | |
| 333 | + | |
423 | 334 | | |
424 | | - | |
425 | 335 | | |
426 | 336 | | |
427 | 337 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
124 | 124 | | |
125 | 125 | | |
126 | 126 | | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
136 | | - | |
137 | | - | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
149 | | - | |
150 | | - | |
151 | 127 | | |
152 | 128 | | |
153 | | - | |
154 | 129 | | |
155 | 130 | | |
156 | 131 | | |
| |||
282 | 257 | | |
283 | 258 | | |
284 | 259 | | |
285 | | - | |
286 | | - | |
287 | | - | |
288 | | - | |
289 | | - | |
290 | | - | |
291 | | - | |
292 | | - | |
293 | | - | |
294 | | - | |
295 | | - | |
296 | | - | |
297 | | - | |
298 | | - | |
299 | | - | |
300 | | - | |
301 | | - | |
302 | | - | |
303 | | - | |
304 | 260 | | |
305 | 261 | | |
306 | | - | |
307 | 262 | | |
308 | 263 | | |
309 | 264 | | |
| |||
415 | 370 | | |
416 | 371 | | |
417 | 372 | | |
418 | | - | |
419 | | - | |
420 | | - | |
421 | | - | |
422 | | - | |
423 | | - | |
424 | | - | |
425 | | - | |
426 | | - | |
427 | | - | |
428 | | - | |
429 | | - | |
430 | | - | |
431 | | - | |
432 | | - | |
433 | | - | |
434 | | - | |
435 | | - | |
436 | 373 | | |
437 | | - | |
438 | 374 | | |
439 | 375 | | |
440 | 376 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
385 | 385 | | |
386 | 386 | | |
387 | 387 | | |
388 | | - | |
| 388 | + | |
389 | 389 | | |
| 390 | + | |
390 | 391 | | |
| 392 | + | |
391 | 393 | | |
392 | 394 | | |
393 | 395 | | |
| |||
604 | 606 | | |
605 | 607 | | |
606 | 608 | | |
607 | | - | |
| 609 | + | |
608 | 610 | | |
609 | | - | |
610 | | - | |
611 | 611 | | |
612 | 612 | | |
| 613 | + | |
| 614 | + | |
613 | 615 | | |
614 | 616 | | |
615 | 617 | | |
| |||
668 | 670 | | |
669 | 671 | | |
670 | 672 | | |
| 673 | + | |
671 | 674 | | |
672 | 675 | | |
673 | | - | |
674 | 676 | | |
675 | 677 | | |
676 | 678 | | |
| |||
0 commit comments