Publications
Found 28 results
Filters: Author is Felix Wolf [Clear All Filters]
“Accelerating Brain Simulations with the Fast Multipole Method”, in Proceedings of the 28th International Conference on Parallel and Distributed Computing (Euro-Par), 2022, pp. 22-26.
, “Editorial of Special Issue: Keeping up with technology: Teaching parallel, distributed, and highperformance computing”, Journal of Parallel and Distributed Computing, vol. 160. Elsevier, pp. 36-38, 2022.
, “ElastiSim: A Batch-System Simulator for Malleable Workloads”, Proceedings of the 51st International Conference on Parallel Processing (ICPP). ACM (to appear, Link), 2022.
, “Multi-objective Hybrid Autoscaling of Microservices in Kubernetes Clusters”, in Proceedings of the 28th International Conference on Parallel and Distributed Computing (Euro-Par), 2022, pp. 233-250.
, “Benchmarking, Measuring, and Optimizing”, Proceedings of the 3rd BenchCouncil International Symposium (Bench). Springer, pp. 1-268, 2021.
, “Design-Time Performance Modeling of Compositional Parallel Programs”, Parallel Computing, vol. 108, pp. 1-12, 2021.
, “Extracting Clean Performance Models from Tainted Programs”, in Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2021, pp. 403-417.
, “Influence of Noisy Environments on Behavior of HPC Applications”, Lobachevskii Journal of Mathematics, vol. 42, pp. 1560-1570, 2021.
, “Learning to Make Compiler Optimizations More Effective”, in Proceedings of the 5th ACM SIGPLAN International Symposium on Machine Programming (MAPS), 2021, pp. 9-20.
, “Noise-Resilient Empirical Performance Modeling with Deep Neural Networks”, in Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2021, pp. 23-34.
, “Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization”, in Proceedings of the 50th International Conference on Parallel Processing (ICPP), 2021, pp. 1-10.
, “Accelerating winograd convolutions using symbolic computation and meta-programming”, in Proceedings of the 15th European Conference on Computer Systems (EuroSys), 2020, pp. 1-14.
, “Dynamic Multi-objective Scheduling of Microservices in the Cloud”, in Proceedings of the 13th International Conference on Utility and Cloud Computing (UCC), 2020, pp. 386-393.
, “Efficient Ephemeris Models for Spacecraft Trajectory Simulations on GPUs”, in Proceedings of the 26th European Conference on Parallel and Distributed Computing (Euro-Par), 2020, pp. 561-577.
, “Empirical Modeling of Spatially Diverging Performance”, in Proceedings the 2020 IEEE/ACM International Workshop on HPC User Support Tools (HUST) and Workshop on Programming and Performance Visualization Tools (ProTools), 2020, pp. 71-80.
, “ExtraPeak: Advanced Automatic Performance Modeling for HPC Applications”, Software for Exascale Computing - SPPEXA 2016-2019, LNCSE 136. Springer, pp. 453-482, 2020.
, “Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling”, in Proceedings of the 34th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020, pp. 884-895.
, “Safer Parallelization”, in Proceedings of the 9th International Symposium on Leveraging Applications of Formal Methods (ISoLA), 2020, pp. 117-137.
, “Skipping Non-essential Instructions Makes Data-Dependence Profiling Faster”, in Proceedings of the 26th International Conference on Parallel and Distributed Computing (Euro-Par), 2020, pp. 3-17.
, “Static Neural Compiler Optimization via Deep Reinforcement Learning”, in Proceedings of the 6th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), 2020, pp. 1-11.
, “Accelerating Data-Dependence Profiling with Static Hints”, in Proceedings of the 25th International Conference on Parallel and Distributed Computing (Euro-Par), 2019, pp. 17-28.
, “Automatic Construct Selection and Variable Classification in OpenMP”, in Proceedings of the 33rd International Conference on Supercomputing (ICS), 2019, pp. 330-341.
, “Automatic Instrumentation Refinement for Empirical Performance Modeling”, in Proceedings of the 2019 IEEE/ACM International Workshop on Programming and Performance Visualization Tools (ProTools), 2019, pp. 40-47.
, “Designing Efficient Parallel Software via Compositional Performance Modeling”, in Proceedings of the Workshop on Programming and Performance Visualization Tools (ProTools), 2019, pp. 17-24.
, “Enhancing the Programmability and Performance Portability of GPU Tensor Operations”, in Proceedings of the 25th European Conference on Parallel and Distributed Computing (Euro-Par), 2019, pp. 213-226.
, “The Art of Getting Deep Neural Networks in Shape”, ACM Transactions on Architecture and Code Optimization, vol. 15, no. 4, pp. 62:1-62:21, 2019.
, “Estimating the Impact of External Interference on Application Performance”, in Proceedings of the 24th European Conference on Parallel and Distributed Computing (Euro-Par), 2018, pp. 46-58.
, “Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics”, in Proceedings of the 47th International Conference on Parallel Processing (ICPP), 2018, pp. 1-10.
,