Publications
Found 28 results
Filters: Autor is Felix Wolf [Clear All Filters]
„Accelerating Brain Simulations with the Fast Multipole Method“, in Proceedings of the 28th International Conference on Parallel and Distributed Computing (Euro-Par), 2022, S. 22-26.
, „Editorial of Special Issue: Keeping up with technology: Teaching parallel, distributed, and highperformance computing“, Journal of Parallel and Distributed Computing, Bd. 160. Elsevier, S. 36-38, 2022.
, „ElastiSim: A Batch-System Simulator for Malleable Workloads“, Proceedings of the 51st International Conference on Parallel Processing (ICPP). ACM (to appear, Link), 2022.
, „Multi-objective Hybrid Autoscaling of Microservices in Kubernetes Clusters“, in Proceedings of the 28th International Conference on Parallel and Distributed Computing (Euro-Par), 2022, S. 233-250.
, „Benchmarking, Measuring, and Optimizing“, Proceedings of the 3rd BenchCouncil International Symposium (Bench). Springer, S. 1-268, 2021.
, „Design-Time Performance Modeling of Compositional Parallel Programs“, Parallel Computing, Bd. 108, S. 1-12, 2021.
, „Extracting Clean Performance Models from Tainted Programs“, in Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2021, S. 403-417.
, „Influence of Noisy Environments on Behavior of HPC Applications“, Lobachevskii Journal of Mathematics, Bd. 42, S. 1560-1570, 2021.
, „Learning to Make Compiler Optimizations More Effective“, in Proceedings of the 5th ACM SIGPLAN International Symposium on Machine Programming (MAPS), 2021, S. 9-20.
, „Noise-Resilient Empirical Performance Modeling with Deep Neural Networks“, in Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2021, S. 23-34.
, „Tool-Supported Mini-App Extraction to Facilitate Program Analysis and Parallelization“, in Proceedings of the 50th International Conference on Parallel Processing (ICPP), 2021, S. 1-10.
, „Accelerating winograd convolutions using symbolic computation and meta-programming“, in Proceedings of the 15th European Conference on Computer Systems (EuroSys), 2020, S. 1-14.
, „Dynamic Multi-objective Scheduling of Microservices in the Cloud“, in Proceedings of the 13th International Conference on Utility and Cloud Computing (UCC), 2020, S. 386-393.
, „Efficient Ephemeris Models for Spacecraft Trajectory Simulations on GPUs“, in Proceedings of the 26th European Conference on Parallel and Distributed Computing (Euro-Par), 2020, S. 561-577.
, „Empirical Modeling of Spatially Diverging Performance“, in Proceedings the 2020 IEEE/ACM International Workshop on HPC User Support Tools (HUST) and Workshop on Programming and Performance Visualization Tools (ProTools), 2020, S. 71-80.
, „ExtraPeak: Advanced Automatic Performance Modeling for HPC Applications“, Software for Exascale Computing - SPPEXA 2016-2019, LNCSE 136. Springer, S. 453-482, 2020.
, „Learning Cost-Effective Sampling Strategies for Empirical Performance Modeling“, in Proceedings of the 34th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020, S. 884-895.
, „Safer Parallelization“, in Proceedings of the 9th International Symposium on Leveraging Applications of Formal Methods (ISoLA), 2020, S. 117-137.
, „Skipping Non-essential Instructions Makes Data-Dependence Profiling Faster“, in Proceedings of the 26th International Conference on Parallel and Distributed Computing (Euro-Par), 2020, S. 3-17.
, „Static Neural Compiler Optimization via Deep Reinforcement Learning“, in Proceedings of the 6th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), 2020, S. 1-11.
, „Accelerating Data-Dependence Profiling with Static Hints“, in Proceedings of the 25th International Conference on Parallel and Distributed Computing (Euro-Par), 2019, S. 17-28.
, „Automatic Construct Selection and Variable Classification in OpenMP“, in Proceedings of the 33rd International Conference on Supercomputing (ICS), 2019, S. 330-341.
, „Automatic Instrumentation Refinement for Empirical Performance Modeling“, in Proceedings of the 2019 IEEE/ACM International Workshop on Programming and Performance Visualization Tools (ProTools), 2019, S. 40-47.
, „Designing Efficient Parallel Software via Compositional Performance Modeling“, in Proceedings of the Workshop on Programming and Performance Visualization Tools (ProTools), 2019, S. 17-24.
, „Enhancing the Programmability and Performance Portability of GPU Tensor Operations“, in Proceedings of the 25th European Conference on Parallel and Distributed Computing (Euro-Par), 2019, S. 213-226.
, „The Art of Getting Deep Neural Networks in Shape“, ACM Transactions on Architecture and Code Optimization, Bd. 15, Nr. 4, S. 62:1-62:21, 2019.
, „Estimating the Impact of External Interference on Application Performance“, in Proceedings of the 24th European Conference on Parallel and Distributed Computing (Euro-Par), 2018, S. 46-58.
, „Unveiling Thread Communication Bottlenecks Using Hardware-Independent Metrics“, in Proceedings of the 47th International Conference on Parallel Processing (ICPP), 2018, S. 1-10.
,