The WY Representation for Products of Householder Matrices
Abstract
MSC codes
Keywords
Get full access to this article
View all available purchase options and get full access to this article.
References
Information & Authors
Information
Published In

Copyright
History
MSC codes
Keywords
Authors
Metrics & Citations
Metrics
Citations
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.
Cited By
- Probabilistic Rounding Error Analysis of Householder QR FactorizationSIAM Journal on Matrix Analysis and Applications, Vol. 44, No. 3 | 28 July 2023
- Efficient parallel reduction of bandwidth for symmetric matricesParallel Computing, Vol. 115 | 1 Feb 2023
- A Test for FLOPs as a Discriminant for Linear Algebra Algorithms2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) | 1 Nov 2022
- High-performance reconstruction of CT medical images by using out-of-core methods in GPUComputer Methods and Programs in Biomedicine, Vol. 218 | 1 May 2022
- Batch-Efficient EigenDecomposition for Small and Medium MatricesComputer Vision – ECCV 2022 | 28 October 2022
- Efficient Parallel Reduction of Bandwidth for Symmetric MatricesSSRN Electronic Journal | 1 Jan 2022
- Acceleration of Parallel-Blocked QR Decomposition of Tall-and-Skinny Matrices on FPGAsACM Transactions on Architecture and Code Optimization, Vol. 18, No. 3 | 10 May 2021
- Low synchronization Gram–Schmidt and generalized minimal residual algorithmsNumerical Linear Algebra with Applications, Vol. 28, No. 2 | 22 October 2020
- Rounding Error Analysis of Mixed Precision Block Householder QR AlgorithmsSIAM Journal on Scientific Computing, Vol. 43, No. 3 | 17 May 2021
- Parallel reduction of four matrices to condensed form for a generalized matrix eigenvalue algorithmNumerical Algorithms, Vol. 86, No. 1 | 2 March 2020
- Randomized Projection for Rank-Revealing Matrix Factorizations and Low-Rank ApproximationsSIAM Review, Vol. 62, No. 3 | 6 August 2020
- Parallelized QR decomposition using GPUs2019 IEEE Canadian Conference of Electrical and Computer Engineering (CCECE) | 1 May 2019
- Simultaneous band reduction of two symmetric matricesComputers & Mathematics with Applications, Vol. 77, No. 8 | 1 Apr 2019
- randUTVACM Transactions on Mathematical Software, Vol. 45, No. 1 | 14 March 2019
- Efficient Reduction of Banded Hermitian Positive Definite Generalized Eigenvalue Problems to Banded Standard Eigenvalue ProblemsSIAM Journal on Scientific Computing, Vol. 41, No. 1 | 19 February 2019
- Block Modified Gram--Schmidt Algorithms and Their AnalysisSIAM Journal on Matrix Analysis and Applications, Vol. 40, No. 4 | 29 October 2019
- The mathematical modeling of the incomplete algebraic eigenvector and eigenvalue problem for obtaining the reduced frequency equation and its solutionIOP Conference Series: Materials Science and Engineering, Vol. 456 | 31 December 2018
- An efficient solution of real-time data processing for multi-GNSS networkJournal of Geodesy, Vol. 92, No. 7 | 7 December 2017
- Computationally Efficient Orthogonal Precoding for Sidelobe Suppression of OFDM Signals2018 IEEE International Conference on Communications (ICC) | 1 May 2018
- QRkit: Sparse, Composable QR Decompositions for Efficient and Stable Solutions to Problems in Computer Vision2018 IEEE Winter Conference on Applications of Computer Vision (WACV) | 1 Mar 2018
- Distributed One-Stage Hessenberg-Triangular Reduction with Wavefront SchedulingSIAM Journal on Scientific Computing, Vol. 40, No. 2 | 13 March 2018
- The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme ScaleSIAM Review, Vol. 60, No. 4 | 8 November 2018
- A Householder-Based Algorithm for Hessenberg-Triangular ReductionSIAM Journal on Matrix Analysis and Applications, Vol. 39, No. 3 | 14 August 2018
- Bidiagonalization and R-Bidiagonalization: Parallel Tiled Algorithms, Critical Paths and Distributed-Memory Implementation2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) | 1 May 2017
- Orthogonal precoding for sidelobe suppression in DFT-based systems using block reflectors2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 1 Mar 2017
- High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methodsComputer Physics Communications, Vol. 211 | 1 Feb 2017
- An efficient solver for large structured eigenvalue problems in relativistic quantum chemistryMolecular Physics, Vol. 115, No. 1-2 | 11 March 2016
- Randomized QR with Column PivotingSIAM Journal on Scientific Computing, Vol. 39, No. 4 | 1 August 2017
- Householder QR Factorization With Randomization for Column Pivoting (HQRRP)SIAM Journal on Scientific Computing, Vol. 39, No. 2 | 11 April 2017
- Least Squares ProblemsScientific Computing | 15 May 2018
- Linear algebra software for large-scale accelerated multicore computingActa Numerica, Vol. 25 | 23 May 2016
- Orthogonal Factorization and Linear Least Squares ProblemsParallelism in Matrix Computations | 26 July 2015
- Performance Analysis of Updating-QR Supported OLS Against Stochastic Gradient DescentIntelligent Systems Technologies and Applications | 29 August 2015
- Reconstructing Householder vectors from Tall-Skinny QRJournal of Parallel and Distributed Computing, Vol. 85 | 1 Nov 2015
- Group-theoretical vector space modelInternational Journal of Computer Mathematics, Vol. 92, No. 8 | 16 September 2014
- BLIS: A Framework for Rapidly Instantiating BLAS FunctionalityACM Transactions on Mathematical Software, Vol. 41, No. 3 | 1 June 2015
- Performance Evaluation of the Eigen Exa Eigensolver on Oakleaf-FX: Tridiagonalization Versus Pentadiagonalization2015 IEEE International Parallel and Distributed Processing Symposium Workshop | 1 May 2015
- Performance Analysis and Optimisation of Two-sided Factorization Algorithms for Heterogeneous PlatformProcedia Computer Science, Vol. 51 | 1 Jan 2015
- A Time-split Discontinuous Galerkin Transport Scheme for Global Atmospheric ModelProcedia Computer Science, Vol. 51 | 1 Jan 2015
- Computing Low-Rank Approximation of a Dense Matrix on Multicore CPUs with a GPU and Its Application to Solving a Hierarchically Semiseparable Linear System of EquationsScientific Programming, Vol. 2015 | 1 Jan 2015
- Accelerating Computation of Eigenvectors in the Dense Nonsymmetric Eigenvalue ProblemHigh Performance Computing for Computational Science -- VECPAR 2014 | 18 April 2015
- Linear Least Squares ProblemsNumerical Methods in Matrix Computations | 7 October 2014
- Performance analysis and structured parallelisation of the space–time adaptive processing computational kernel on multi-core architecturesInternational Journal of Parallel, Emergent and Distributed Systems, Vol. 29, No. 5 | 11 February 2014
- Communication lower bounds and optimal algorithms for numerical linear algebraActa Numerica, Vol. 23 | 12 May 2014
- A novel hybrid CPU–GPU generalized eigensolver for electronic structure calculations based on fine-grained memory aware tasksThe International Journal of High Performance Computing Applications, Vol. 28, No. 2 | 30 August 2013
- Restructuring the Tridiagonal and Bidiagonal QR Algorithms for PerformanceACM Transactions on Mathematical Software, Vol. 40, No. 3 | 1 April 2014
- A multicore solution to Block–Toeplitz linear systems of equationsThe Journal of Supercomputing, Vol. 65, No. 3 | 25 September 2012
- Scaling LAPACK panel operations using parallel cache assignmentACM Transactions on Mathematical Software, Vol. 39, No. 4 | 23 July 2013
- Efficient generalized Hessenberg form and applicationsACM Transactions on Mathematical Software, Vol. 39, No. 3 | 3 May 2013
- Frequency response computation of structures including non-proportional damping in a shared memory environmentProceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, Vol. 227, No. 2 | 29 May 2012
- Implementations of Main Algorithms for Generalized Symmetric Eigenproblem on GPU AcceleratorGPU Solutions to Multi-scale Problems in Science and Engineering | 9 January 2013
- I/O efficient QR and QZ algorithms2012 19th International Conference on High Performance Computing | 1 Dec 2012
- Families of Algorithms for Reducing a Matrix to Condensed FormACM Transactions on Mathematical Software, Vol. 39, No. 1 | 1 November 2012
- Low-rank incremental methods for computing dominant singular subspacesLinear Algebra and its Applications, Vol. 436, No. 8 | 1 Apr 2012
- Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architecturesConcurrency and Computation: Practice and Experience, Vol. 24, No. 3 | 22 August 2011
- Divide and Conquer on Hybrid GPU-Accelerated Multicore SystemsSIAM Journal on Scientific Computing, Vol. 34, No. 2 | 12 April 2012
- I/O Efficient Algorithms for Block Hessenberg Reduction Using Panel ApproachBig Data Analytics | 1 Jan 2012
- Parallel solution of partial symmetric eigenvalue problems from electronic structure calculationsParallel Computing, Vol. 37, No. 12 | 1 Dec 2011
- Efficient implementation of QR decomposition on intel multi-core processors2011 seventh International Computer Engineering Conference (ICENCO'2011) | 1 Dec 2011
- High-performance up-and-downdating via householder-like transformationsACM Transactions on Mathematical Software, Vol. 38, No. 1 | 7 December 2011
- Algorithm 915, SuiteSparseQRACM Transactions on Mathematical Software, Vol. 38, No. 1 | 7 December 2011
- Developing algorithms and software for the parallel solution of the symmetric eigenvalue problemJournal of Computational Science, Vol. 2, No. 3 | 1 Aug 2011
- Minimizing Communication in Numerical Linear AlgebraSIAM Journal on Matrix Analysis and Applications, Vol. 32, No. 3 | 8 September 2011
- A Class of Hybrid LAPACK Algorithms for Multicore and GPU Architectures2011 Symposium on Application Accelerators in High-Performance Computing | 1 Jul 2011
- QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators2011 IEEE International Parallel & Distributed Processing Symposium | 1 May 2011
- Condensed forms for the symmetric eigenvalue problem on multi‐threaded architecturesConcurrency and Computation: Practice and Experience, Vol. 23, No. 7 | 8 November 2010
- Implementing Matrix Factorizations on the Cell B. E.Scientific Computing with Multicore and Accelerators | 28 January 2011
- Scaling LAPACK panel operations using parallel cache assignmentACM SIGPLAN Notices, Vol. 45, No. 5 | 9 January 2010
- Scheduling dense linear algebra operations on multicore processorsConcurrency and Computation: Practice and Experience, Vol. 22, No. 1 | 1 Jan 2010
- A class of parallel tiled linear algebra algorithms for multicore architecturesParallel Computing, Vol. 35, No. 1 | 1 Jan 2009
- Blocked algorithms for the reduction to Hessenberg-triangular form revisitedBIT Numerical Mathematics, Vol. 48, No. 3 | 15 July 2008
- Updating the QR decomposition of block tridiagonal and block Hessenberg matricesApplied Numerical Mathematics, Vol. 58, No. 6 | 1 Jun 2008
- Parallel block tridiagonalization of real symmetric matricesJournal of Parallel and Distributed Computing, Vol. 68, No. 5 | 1 May 2008
- Cache efficient bidiagonalization using BLAS 2.5 operatorsACM Transactions on Mathematical Software, Vol. 34, No. 3 | 16 May 2008
- Scheduling of QR Factorization Algorithms on SMP and Multi-Core Architectures16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008) | 1 Feb 2008
- Fast linear algebra is stableNumerische Mathematik, Vol. 108, No. 1 | 10 October 2007
- Block and Parallel Versions of One-Sided BidiagonalizationSIAM Journal on Matrix Analysis and Applications, Vol. 29, No. 3 | 31 August 2007
- Accumulating Householder transformations, revisitedACM Transactions on Mathematical Software, Vol. 32, No. 2 | 1 June 2006
- Algorithm 854ACM Transactions on Mathematical Software, Vol. 32, No. 2 | 1 June 2006
- Parallel Algorithms for the Determination of Lyapunov Characteristics of Large Nonlinear Dynamical SystemsApplied Parallel Computing. State of the Art in Scientific Computing | 1 Jan 2006
- An Implementation of the Block Householder MethodIPSJ Digital Courier, Vol. 2, No. 0 | 1 Jan 2006
- Fast direct solvers for some complex symmetric block Toeplitz linear systemsLinear Algebra and its Applications, Vol. 404 | 1 Jul 2005
- An Efficient Parallel Algorithm to Solve Block?Toeplitz SystemsThe Journal of Supercomputing, Vol. 32, No. 3 | 1 Jun 2005
- Parallel out-of-core computation and updating of the QR factorizationACM Transactions on Mathematical Software, Vol. 31, No. 1 | 1 March 2005
- Stewart's pivoted QLP decomposition for low‐rank matricesNumerical Linear Algebra with Applications, Vol. 12, No. 2-3 | 29 September 2004
- A Parallel Eigensolver for Dense Symmetric Matrices Based on Multiple Relatively Robust RepresentationsSIAM Journal on Scientific Computing, Vol. 27, No. 1 | 25 July 2006
- Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library SoftwareSIAM Review, Vol. 46, No. 1 | 4 August 2006
- Performance Evaluation of Parallel Gram-Schmidt Re-orthogonalization MethodsHigh Performance Computing for Computational Science — VECPAR 2002 | 15 April 2003
- Efficient Computation of the Riemannian SVD in Total Least Squares Problems in Information RetrievalTotal Least Squares and Errors-in-Variables Modeling | 1 Jan 2002
- Lanczos, Householder transformations, and implicit deflation for fast and reliable dominant singular subspace computationNumerical Linear Algebra with Applications, Vol. 8, No. 4 | 26 March 2001
- High-Performance Library Software for QR FactorizationApplied Parallel Computing. New Paradigms for HPC in Industry and Academia | 10 April 2001
- A framework for symmetric band reductionACM Transactions on Mathematical Software, Vol. 26, No. 4 | 1 December 2000
- Algorithm 807ACM Transactions on Mathematical Software, Vol. 26, No. 4 | 1 December 2000
- PARALLEL GIVENS SEQUENCES FOR SOLVING THE GENERAL LINEAR MODEL ON A EREW PRAM∗Parallel Algorithms and Applications, Vol. 15, No. 1-2 | 1 Jun 2000
- Blocked algorithms and software for reduction of a regular matrix pair to generalized Schur formACM Transactions on Mathematical Software, Vol. 25, No. 4 | 1 December 1999
- Efficient parallel reduction to bidiagonal formParallel Computing, Vol. 25, No. 8 | 1 Sep 1999
- Efficient eigenvalue and singular value computations on shared memory machinesParallel Computing, Vol. 25, No. 7 | 1 Jul 1999
- Multi-sweep Algorithms for the Symmetric EigenproblemVector and Parallel Processing – VECPAR’98 | 1 Jan 1999
- Using Pentangular Factorizations for the Reduction to Banded FormEuro-Par’99 Parallel Processing | 6 August 1999
- Massive Parallelism: The Hardware for Computational Chemistry?High-Performance Computing | 1 Jan 1999
- Orthogonal reduction of dense matrices to bidiagonal form on computers with distributed memory architecturesParallel Computing, Vol. 24, No. 2 | 1 Feb 1998
- New serial and parallel recursive QR factorization algorithms for SMP systemsApplied Parallel Computing Large Scale Scientific and Industrial Problems | 20 October 2006
- Fast Diagonalization of Large and Dense Complex Symmetric Matrices, with Applications to Quantum Reaction DynamicsSIAM Journal on Scientific Computing, Vol. 18, No. 5 | 25 July 2006
- Efficient householder QR factorization for superscalar processorsACM Transactions on Mathematical Software, Vol. 23, No. 3 | 1 September 1997
- Sparse Multifrontal Rank Revealing QR FactorizationSIAM Journal on Matrix Analysis and Applications, Vol. 18, No. 1 | 31 July 2006
- A block representation for products of hyperbolic householder transformApplied Mathematics Letters, Vol. 10, No. 1 | 1 Jan 1997
- A multishift Hessenberg method for pole assignment of single-input systemsIEEE Transactions on Automatic Control, Vol. 41, No. 12 | 1 Dec 1996
- On Tridiagonalizing and Diagonalizing Symmetric Matrices with Repeated EigenvaluesSIAM Journal on Matrix Analysis and Applications, Vol. 17, No. 4 | 31 July 2006
- Multifrontal Computation with the Orthogonal Factors of Sparse MatricesSIAM Journal on Matrix Analysis and Applications, Vol. 17, No. 3 | 17 February 2012
- High performance algorithms for Toeplitz and block Toeplitz matricesLinear Algebra and its Applications, Vol. 241-243 | 1 Jul 1996
- The design of a parallel dense linear algebra software library: Reduction to Hessenberg, tridiagonal, and bidiagonal formNumerical Algorithms, Vol. 10, No. 2 | 1 Sep 1995
- Fast rectangular matrix multiplication and QR decompositionLinear Algebra and its Applications, Vol. 221 | 1 May 1995
- Stability of blockLU factorizationNumerical Linear Algebra with Applications, Vol. 2, No. 2 | 1 Mar 1995
- Portable Parallel implementation of BLAS 3Concurrency: Practice and Experience, Vol. 6, No. 5 | 1 Aug 1994
- A parallel block implementation of Level-3 BLAS for MIMD vector processorsACM Transactions on Mathematical Software, Vol. 20, No. 2 | 1 June 1994
- On Solving Block Toeplitz Systems Using a Block Schur Algorithm1994 International Conference on Parallel Processing Vol. 3 | 1 Jan 1994
- Parallel tridiagonalization through two-step band reductionProceedings of IEEE Scalable High Performance Computing Conference | 1 Jan 1994
- Calculations of magnetic propertiesMolecular Physics, Vol. 80, No. 1 | 23 August 2006
- Parallel algorithm for solving some spectral problems of linear algebraLinear Algebra and its Applications, Vol. 188-189 | 1 Jul 1993
- An Algorithm for the Banded Symmetric Generalized MatrixSIAM Journal on Matrix Analysis and Applications, Vol. 14, No. 2 | 17 July 2006
- Reducing the Symmetric Matrix Eigenvalue Problem to Matrix MultiplicationsSIAM Journal on Scientific Computing, Vol. 14, No. 1 | 13 July 2006
- BibliographyScientific Computing | 1 Jan 1993
- The International Journal of Supercomputer Applications—The International Journal of Supercomputing Applications, Vol. 6, No. 4 | 16 September 2016
- Parallel Solution of Large Lyapunov EquationsSIAM Journal on Matrix Analysis and Applications, Vol. 13, No. 4 | 17 July 2006
- A block algorithm for computing rank-revealing QR factorizationsNumerical Algorithms, Vol. 2, No. 3 | 27 December 2013
- Stability of block algorithms with fast level-3 BLASACM Transactions on Mathematical Software, Vol. 18, No. 3 | 1 September 1992
- Parallel Block Matrix Factorizations on the Shared-Memory Multiprocessor Ibm 3090 VF/600JThe International Journal of Supercomputing Applications, Vol. 6, No. 1 | 12 September 2016
- Chapter 6 A survey of matrix computationsComputing | 1 Jan 1992
- LAPACK: A portable linear algebra library for high-performance computersConcurrency: Practice and Experience, Vol. 3, No. 6 | 1 Dec 1991
- Structure-Preserving and Rank-Revealing QR-FactorizationsSIAM Journal on Scientific and Statistical Computing, Vol. 12, No. 6 | 31 July 2006
- Block Gauss Reduction to Hessenberg FormSIAM Journal on Scientific and Statistical Computing, Vol. 12, No. 5 | 13 July 2006
- Parallel algorithms for QR decomposition on a shared memory multiprocessorParallel Computing, Vol. 17, No. 6-7 | 1 Sep 1991
- Use of Level 3 Blas in Lu Factorization in a Multiprocessing Environment On Three Vector Multiprocessors: the Alliant Fx/80, the Cray-2, and the Ibm 3090 VfThe International Journal of Supercomputing Applications, Vol. 5, No. 3 | 16 September 2016
- Chasing Algorithms for the Eigenvalue ProblemSIAM Journal on Matrix Analysis and Applications, Vol. 12, No. 2 | 17 July 2006
- A Parallel QR Factorization Algorithm with Controlled Local PivotingSIAM Journal on Scientific and Statistical Computing, Vol. 12, No. 1 | 13 July 2006
- Storage Schemes for Parallel Eigenvalue AlgorithmsNumerical Linear Algebra, Digital Signal Processing and Parallel Algorithms | 1 Jan 1991
- On GR Algorithms for the Eigenvalue ProblemNumerical Linear Algebra, Digital Signal Processing and Parallel Algorithms | 1 Jan 1991
- Parallel Algorithms for Singular Value ProblemsNumerical Linear Algebra, Digital Signal Processing and Parallel Algorithms | 1 Jan 1991
- Algorithms for the Polar DecompositionSIAM Journal on Scientific and Statistical Computing, Vol. 11, No. 6 | 13 July 2006
- Communication and matrix computations on large message passing systemsParallel Computing, Vol. 16, No. 1 | 1 Nov 1990
- Use of parallel level 3 BLAS in LU factorization on three vector multiprocessors the ALLIANT FX/80, the CRAY-2, and the IBM 3090 VFACM SIGARCH Computer Architecture News, Vol. 18, No. 3b | 1 June 1990
- Fast Polar Decomposition of an Arbitrary MatrixSIAM Journal on Scientific and Statistical Computing, Vol. 11, No. 4 | 13 July 2006
- Parallel Algorithms for Dense Linear Algebra ComputationsSIAM Review, Vol. 32, No. 1 | 18 July 2006
- A set of level 3 basic linear algebra subprogramsACM Transactions on Mathematical Software, Vol. 16, No. 1 | 1 March 1990
- An adaptive blocking strategy for matrix factorizationsCONPAR 90 — VAPP IV | 2 June 2005
- Gauß—EliminationLösung linearer Gleichungssysteme auf Parallelrechnern | 1 Jan 1990
- Fundamental Linear Algebra Computations on High-Performance ComputersSupercomputer ’90 | 1 Jan 1990
- Least-squares multiple updating algorithms on a hypercubeJournal of Parallel and Distributed Computing, Vol. 8, No. 1 | 1 Jan 1990
- Block reduction of matrices to condensed forms for eigenvalue computationsParallel Algorithms for Numerical Linear Algebra | 1 Jan 1990
- LAPACK: A portable linear algebra library for high-performance computersProceedings SUPERCOMPUTING '90 | 1 Jan 1990
- Distributed Orthogonal Factorization: Givens and Householder AlgorithmsSIAM Journal on Scientific and Statistical Computing, Vol. 10, No. 6 | 13 July 2006
- Adaptive blocking in the QR factorizationThe Journal of Supercomputing, Vol. 3, No. 3 | 1 Sep 1989
- Block reduction of matrices to condensed forms for eigenvalue computationsJournal of Computational and Applied Mathematics, Vol. 27, No. 1-2 | 1 Sep 1989
- An efficient parallel scheme for minimizing a sum of Euclidean normsLinear Algebra and its Applications, Vol. 121 | 1 Aug 1989
- Computing the singular value decomposition on a distributed system of vector processorsParallel Computing, Vol. 11, No. 2 | 1 Aug 1989
- Level 3 Blas in Lu Factorization On the Cray-2, Eta-10P, and Ibm 3090-200/VfThe International Journal of Supercomputing Applications, Vol. 3, No. 2 | 16 September 2016
- A note on the parallel Cholesky factorization of wide banded matricesParallel Computing, Vol. 10, No. 2 | 1 Apr 1989
- A Storage-Efficient $WY$ Representation for Products of Householder TransformationsSIAM Journal on Scientific and Statistical Computing, Vol. 10, No. 1 | 13 July 2006
- A fast algorithm for equality-constrained quadratic programming on the alliant FX/8Annals of Operations Research, Vol. 14, No. 1 | 1 Dec 1988
- Dense linear systems FORTRAN solvers on the IBM 3090 vector multiprocessorParallel Computing, Vol. 8, No. 1-3 | 1 Oct 1988
- SUPRENUM software for the symmetric eigenvalue problemParallel Computing, Vol. 7, No. 3 | 1 Sep 1988
- Solution of large, dense symmetric generalized eigenvalue problems using secondary storageACM Transactions on Mathematical Software, Vol. 14, No. 3 | 1 September 1988
- New software for large dense symmetric generalized eigenvalue problems using secondary storageJournal of Computational Physics, Vol. 77, No. 1 | 1 Jul 1988
- Designing linear algebra algorithms on the IBM 3090 vector multiprocessor with a hierarchical memory systemCalcolo, Vol. 25, No. 1-2 | 1 Mar 1988
- Block Reflectors: Theory and ComputationSIAM Journal on Numerical Analysis, Vol. 25, No. 1 | 14 July 2006
- The LINPACK Benchmark: An explanationSupercomputing | 27 May 2005
- Parallel Linear Algebra in Statistical ComputationsCompstat | 1 Jan 1988
- A proposal for a set of level 3 basic linear algebra subprogramsACM SIGNUM Newsletter, Vol. 22, No. 3 | 1 July 1987
- Skew-Hamiltonian and Hamiltonian Eigenvalue Problems: Theory, Algorithms and ApplicationsProceedings of the Conference on Applied Mathematics and Scientific Computing
- Parallel Tiled QR Factorization for Multicore ArchitecturesParallel Processing and Applied Mathematics
- Implementing Linear Algebra Routines on Multi-core Processors with Pipelining and a Look AheadApplied Parallel Computing. State of the Art in Scientific Computing
- A three-parameter fast Givens QR algorithm for superscalar processorsProceedings of the 1996 ICPP Workshop on Challenges for Parallel Processing
- Parallel compact WY QR factorization for asynchronous message passingConference Proceedings of the IEEE International Performance, Computing, and Communications Conference (Cat. No.02CH37326)
- Parallel out-of-core cholesky and QR factorizations with POOCLAPACKProceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001
- Performance evaluation of two emerging media processors: VIRAM and ImagineProceedings International Parallel and Distributed Processing Symposium
- A parallel QR factorization algorithm using local pivotingProceedings. SUPERCOMPUTING '88
- On block Householder algorithms for the reduction of a matrix to Hessenberg formProceedings Supercomputing Vol.II: Science and Applications