ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures
Abstract
Keywords
MSC codes
Get full access to this article
View all available purchase options and get full access to this article.
References
Information & Authors
Information
Published In

Copyright
History
Keywords
MSC codes
Authors
Funding Information
Metrics & Citations
Metrics
Citations
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited By
- A two-level GPU-accelerated incomplete LU preconditioner for general sparse linear systemsThe International Journal of High Performance Computing Applications, Vol. 39, No. 3 | 23 February 2025
- gpu-ISTL - Extending OPM Flow with GPU Linear SolversJournal of Open Source Software, Vol. 10, No. 109 | 1 May 2025
- PETSc/TAO developments for GPU-based early exascale systemsThe International Journal of High Performance Computing Applications, Vol. 39, No. 2 | 18 January 2025
- Theoretical framework for the difference of two negative binomial distributions and its application in comparative analysis of sequencing dataGenome Research, Vol. 34, No. 10 | 15 October 2024
- Optimization of Sparse Matrix Computation for Algebraic Multigrid on GPUsACM Transactions on Architecture and Code Optimization, Vol. 21, No. 3 | 14 September 2024
- Snowdrift‐Permitting Simulations of Seasonal Snowpack Processes Over Large Mountain ExtentsWater Resources Research, Vol. 60, No. 8 | 17 August 2024
- Ultra-efficient and parameter-free computation of submicron thermal transport with phonon Boltzmann transport equationFundamental Research, Vol. 4, No. 4 | 1 Jul 2024
- oclCUB: an OpenCL parallel computing library for deep learning operatorsCCF Transactions on High Performance Computing, Vol. 6, No. 3 | 16 February 2024
- Towards Dynamic Autotuning of SpMV in CUSP Library2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) | 27 May 2024
- Revisiting thread configuration of SpMV kernels on GPU: A machine learning based approachJournal of Parallel and Distributed Computing, Vol. 185 | 1 Mar 2024
- An efficient framework for matrix-free SpMV computation on GPU for elastoplastic problemsMathematics and Computers in Simulation, Vol. 216 | 1 Feb 2024
- Achieving high performance and portable parallel GMRES algorithm for compressible flow simulations on unstructured gridsThe Journal of Supercomputing, Vol. 79, No. 17 | 9 June 2023
- A simplified calculation for adaptive coefficients of finite-difference frequency-domain methodApplied Geophysics, Vol. 20, No. 3 | 5 December 2023
- Efficient Algorithm Design of Optimizing SpMV on GPUProceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing | 7 August 2023
- HeuriSPAI: a heuristic sparse approximate inverse preconditioning algorithm on GPUCCF Transactions on High Performance Computing, Vol. 5, No. 2 | 27 March 2023
- An enhanced implicit viscosity ISPH method for simulating free-surface flow coupled with solid-liquid phase changeJournal of Computational Physics, Vol. 474 | 1 Feb 2023
- parGeMSLR: A parallel multilevel Schur complement low-rank preconditioning and solution package for general sparse matricesParallel Computing, Vol. 113 | 1 Oct 2022
- Optimizing the sparse approximate inverse preconditioning algorithm on GPUBenchCouncil Transactions on Benchmarks, Standards and Evaluations, Vol. 2, No. 4 | 1 Oct 2022
- Quantum simulation with just-in-time compilationQuantum, Vol. 6 | 22 September 2022
- Focused wave interaction with a partially-immersed rectangular box using 2-D incompressible SPH on a GPU comparing with experiment and linear theoryEuropean Journal of Mechanics - B/Fluids, Vol. 95 | 1 Sep 2022
- Dynamic deformablesACM SIGGRAPH 2022 Courses | 2 August 2022
- Parallel, Portable Algorithms for Distance-2 Maximal Independent Set and Graph Coarsening2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS) | 1 May 2022
- Live user-guided depth map estimation for single imagesJournal of Real-Time Image Processing, Vol. 18, No. 6 | 13 January 2021
- Large scale simulation of pressure induced phase-field fracture propagation using UtopiaCCF Transactions on High Performance Computing, Vol. 3, No. 4 | 29 June 2021
- Toward performance-portable PETSc for GPU-based exascale systemsParallel Computing, Vol. 108 | 1 Dec 2021
- Consensus clustering of single-cell RNA-seq data by enhancing network affinityBriefings in Bioinformatics, Vol. 22, No. 6 | 23 June 2021
- PyPacho: A Python library that implements parallel basic operations on GPUs2021 IEEE 12th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) | 27 Oct 2021
- A parallel sparse approximate inverse preconditioning algorithm based on MPI and CUDABenchCouncil Transactions on Benchmarks, Standards and Evaluations, Vol. 1, No. 1 | 1 Oct 2021
- A Comparative Study of Block Incomplete Sparse Approximate Inverses Preconditioning on Tesla K20 and V100 GPUsAlgorithms, Vol. 14, No. 7 | 30 June 2021
- FEMS – A Mechanics-oriented Finite Element Modeling SoftwareComputer Physics Communications, Vol. 260 | 1 Mar 2021
- Point-block incomplete LU preconditioning with asynchronous iterations on GPU for multiphysics problemsThe International Journal of High Performance Computing Applications, Vol. 35, No. 2 | 28 December 2020
- Developing a Multi-GPU-Enabled Preconditioned GMRES with Inexact Triangular Solves for Block Sparse MatricesMathematical Problems in Engineering, Vol. 2021 | 27 Feb 2021
- Efficient Numerical Solution of the EMI Model Representing the Extracellular Space (E), Cell Membrane (M) and Intracellular Space (I) of a Collection of Cardiac CellsFrontiers in Physics, Vol. 8 | 13 January 2021
- Operator Splitting and Finite Difference Schemes for Solving the EMI ModelModeling Excitable Tissue | 31 October 2020
- Parallel Sorted Sparse Approximate Inverse Preconditioning Algorithm on GPUBenchmarking, Measuring, and Optimizing | 2 March 2021
- Matrix multiplication on batches of small matrices in half and half-complex precisionsJournal of Parallel and Distributed Computing, Vol. 145 | 1 Nov 2020
- BibliographyThe Big R‐Book | 26 October 2020
- Dynamic deformablesACM SIGGRAPH 2020 Courses | 17 August 2020
- scBatch: batch-effect correction of RNA-seq data through sample distance matrix adjustmentBioinformatics, Vol. 36, No. 10 | 13 February 2020
- Optimizing High Performance Markov Clustering for Pre-Exascale Architectures2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS) | 1 May 2020
- Automated OpenCL GPU kernel fusion for Stan MathProceedings of the International Workshop on OpenCL | 27 April 2020
- An efficient sparse approximate inverse preconditioning algorithm on GPUConcurrency and Computation: Practice and Experience, Vol. 32, No. 7 | 3 December 2019
- Preparing sparse solvers for exascale computingPhilosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 378, No. 2166 | 20 January 2020
- Statistical strategies for the analysis of massive data setsBiometrical Journal, Vol. 62, No. 2 | 12 September 2019
- "Equation missing" : A Cross-Platform Programming Framework for Quantum-Accelerated Scientific ComputingComputational Science – ICCS 2020 | 15 June 2020
- An Analysis for the Performance of Reservoir Simulations on a Multicore CPUProceedings of the International Field Exploration and Development Conference 2019 | 12 July 2020
- Hybrid OpenMP-CUDA parallel implementation of a deterministic solver for ultrashort DG-MOSFETsThe International Journal of High Performance Computing Applications, Vol. 34, No. 1 | 20 October 2019
- Performance optimization, modeling and analysis of sparse matrix-matrix products on multi-core and many-core processorsParallel Computing, Vol. 90 | 1 Dec 2019
- Development strategy and collaboration preference in S&T of enterprises based on funded papers: a case study of GoogleScientometrics, Vol. 121, No. 1 | 11 July 2019
- Time-Dependent Two-Fluid Magnetohydrodynamic Model and Simulation of the ChromosphereSolar Physics, Vol. 294, No. 9 | 18 September 2019
- A three-dimensional environmental hydrodynamic model, Fantom-Refined: Validation and application for saltwater intrusion in a meso-macrotidal estuaryOcean Modelling, Vol. 141 | 1 Sep 2019
- Anisotropic elasticity for inversion-safety and element rehabilitationACM Transactions on Graphics, Vol. 38, No. 4 | 12 July 2019
- Implicit Large Eddy Simulation of Low Pressure Turbine Airfoils Using a High Order Navier-Stokes SolverAIAA Aviation 2019 Forum | 14 June 2019
- An Introduction to hpxMPProceedings of the International Workshop on OpenCL | 13 May 2019
- Fast Batched Matrix Multiplication for Small Sizes Using Half-Precision Arithmetic on GPUs2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS) | 1 May 2019
- High-performance Processing of Covariance Matrices Using GPU ComputationsLobachevskii Journal of Mathematics, Vol. 40, No. 5 | 24 June 2019
- Extreme Scale FMM-Accelerated Boundary Integral Equation Solver for Wave ScatteringSIAM Journal on Scientific Computing, Vol. 41, No. 3 | 20 June 2019
- High-Performance Sparse Matrix-Matrix Products on Intel KNL and Multicore ArchitecturesProceedings of the 47th International Conference on Parallel Processing Companion | 13 August 2018
- ViennaCL++Proceedings of the International Workshop on OpenCL | 14 May 2018
- DM-HEOM: A Portable and Scalable Solver-Framework for the Hierarchical Equations of Motion2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) | 1 May 2018
- OuterSPACE: An Outer Product Based Sparse Matrix Multiplication Accelerator2018 IEEE International Symposium on High Performance Computer Architecture (HPCA) | 1 Feb 2018
- Memory-Efficient Sparse Matrix-Matrix Multiplication by Row Merging on Many-Core ArchitecturesSIAM Journal on Scientific Computing, Vol. 40, No. 4 | 3 July 2018
- Categorical Big Data ProcessingIntelligent Data Engineering and Automated Learning – IDEAL 2018 | 9 November 2018
- Control of accuracy in the Wang-Landau algorithmPhysical Review E, Vol. 96, No. 4 | 18 October 2017
- Adaptive co-simulation of functional-thermal behaviour of integrated circuits2017 23rd International Workshop on Thermal Investigations of ICs and Systems (THERMINIC) | 1 Sep 2017
- gpuR: GPU Functions for R Objects20 Nov 2015
View Options
- Access via your Institution
- Questions about how to access this content? Contact SIAM at [email protected].