Publications
2023
- IEEE HPECFAST-CON: a Multi-source Approach for Efficient ST Connectivity on Sparse GraphsIEEE High Performance Extreme Computing, 2023
2020
- OSAEfficient Implementation of the Shack-Hartmann Centroid Extraction for Edge ComputingJournal of the Optical Society of America (OSA), 2020
2019
- IEEE HPCSCConfiguring Graph Traversal Applications for GPUs: Analysis and Correlation of Implementation Strategies with Graph CharacteristicsIEEE High Performance Computing Systems Conference, 2019
- IEEE TCOMPMangrove: an Inference-based Dynamic Invariant Mining for GPU ArchitecturesIEEE Transactions on Computer, 2019
2018
- IEEE HPECHornet: An Efficient Data Structure for Dynamic Sparse Graphs and Matrices on GPUsIEEE High Performance Extreme Computing Conference, 2018
- ACM EUROPAREfficient Load Balancing Techniques for Graph Traversal Applications on GPUsACM International European Conference on Parallel and Distributed Computing, 2018
- BMC
2017
- ACM HPECQuickly Finding a Truss in a HaystackIEEE High Performance Extreme Computing Conference, IEEE/Amazon/DARPA Graph Challenge, 2017
- BITScuRnet: an R Package for the Single-source Shortest Paths Analysis on GPUsBioinformatics Italian Society (BITS), 2017
- IEEE SIESA Performance, Power, and Energy Efficiency Analysis of Load Balancing Techniques for GPUsIEEE International Symposium on Industrial Embedded Systems, 2017
- ACM/EDAC/IEEE DACPower-aware Performance Tuning of GPU Applications Through MicrobenchmarkingACM/EDAC/IEEE Design Automation Conference, 2017
- ACM EUROPARParametric Multi-Step Scheme for GPU-Accelerated Graph Decomposition into Strongly Connected ComponentsACM International European Conference on Parallel and Distributed Computing, 2017
- IEEE TPDSA Dynamic Approach for Workload Partitioning on GPU ArchitecturesIEEE Transactions of Parallel and Distributed Systems, 2017
2016
- IEEE SIESMIPP: A Microbenchmark Suite for Performance, Power, and Energy Consumption Characterization of GPU architecturesIEEE International Symposium on Industrial Embedded System, 2016
- ACM/IEEE DATEA Fine-grained Performance Model for GPU ArchitecturesACM/IEEE International Conference on Design, Automation and Test in Europe, 2016
- IEEE TPDSAn Efficient Implementation of the Bellman-Ford Algorithm for Kepler GPU ArchitecturesIEEE Transactions of Parallel and Distributed Systems, 2016
- Elsevier
2015
- IEEE ICCDExploiting GPU Architectures for Dynamic Invariant MiningIEEE International Conference on Computer Design, 2015
- IEEE MCSoCOn the Load Balancing Techniques for GPU Applications Based on Prefix-scanIEEE International Symposium on Embedded Multicore/Manycore System-on-Chip, 2015
- IEEE MCSoCAn Enhanced Profiling Framework for the Analysis and Development of Parallel Primitives for GPUsIEEE International Symposium on Embedded Multicore/Manycore System-on-Chip, 2015
- IEEE TPDSAPPAGATO: an APproximate PArallel and stochastic GrAph querying TOol for biological networksBMC Bioinformatics, 2015
- IEEE TECTPro++: A Profiling Framework for Primitive-based GPU ProgrammingIEEE Transactions on Emerging Topics in Computing, 2015
- IEEE TPDSBFS-4K: an Efficient Implementation of BFS for Kepler GPU ArchitecturesIEEE Transactions of Parallel and Distributed Systems, 2015