Architecture-Aware Synthesis of Fused Linear Algebra Kernels
- Authors
-
-
Shlok Shah
Author
-
- Keywords:
- Compiler autotuning, Loop fusion, Linear algebra kernels, Performance portability, Empirical optimization
- Abstract
-
High-performance code generation from high-level array-oriented prototypes remains challenging due to the tight coupling between memory hierarchy behavior, loop organization, and platform compilers in modern architectures. This paper presents a compiler autotuner pipeline that lowers numerical kernels from a high-level prototype to C, applies loop restructuring for parallelism and locality, and then performs empirical search over transformations such as fusion, unroll-and-jam, vectorization, and alignment to select architecture-optimal variants. Central to the approach is composing multiple dense operations into a single fused kernel to minimize data movement and improve cache residency compared to sequential library calls. An annotation-driven tuning workflow systematically explores implementation choices and search heuristics to realize portable performance across target systems without manual rewrite cycles. Evaluations on representative kernels (e.g., VADD, ATAX, GEMVER, GESUMMV, BiCG) show consistent speedups over baseline C and vendor-tuned libraries, while also highlighting cases where library baselines remain competitive, motivating hybrid generation strategies. The results highlight a systems-centric co-design of compiler analysis and empirical tuning. The abstract should clearly state the research problem, objectives, theoretical grounding, methodology, key findings, and principal managerial implications. Emphasis should be placed on the relevance of the study to adaptive management practices and business intelligence–driven decision-making. Avoid citations, undefined acronyms, and excessive technical detail.
- References
- Downloads
- Published
- 2026-06-01
- Issue
- Vol. 1 No. 2 (2026)
- Section
- Articles
- License
-
Copyright (c) 2026 International Journal of Intelligent Systems and Data Science

This work is licensed under a Creative Commons Attribution 4.0 International License.
