Optimizing Three-Dimensional Stencil-Operations on Heterogeneous Computing Environments
PragFormer: Data-driven Parallel Source Code Classification with Transformers
PAF-FHE: Low-Cost Accurate Non-Polynomial Operator Polynomial Approximation in Fully Homomorphic Encryption Based ML Inference
A Practical Approach for Employing Tensor Train Decomposition in Edge Devices
ControlPULP: A RISC-V On-Chip Parallel Power Controller for Many-Core HPC Processors with FPGA-Based Hardware-In-The-Loop Power and Thermal Emulation
On Design, Cost and Reliability Analysis of a Fault-Tolerant Multistage Interconnection Network Layout with Six Disjoint Paths
Interruptible Nodes: Reducing Queueing Costs in Irregular Streaming Dataflow Applications on Wide-SIMD Architectures
Generic Exact Combinatorial Search at HPC Scale
Declarative Data Flow in a Graph-Based Distributed Memory Runtime System
Learn more about International Journal of Parallel Programming