Research Article
Computing Low-Rank Approximation of a Dense Matrix on Multicore CPUs with a GPU and Its Application to Solving a Hierarchically Semiseparable Linear System of Equations
Figure 5
Illustration of the hybrid QP3 implementation.
(a) Column-wise (left) and blocked (right bar) algorithm |
(b) Ratio of exact and estimated conditioner number of leading triangular factors after the QPR factorization |
(c) StruMF solution time using task-1 (left), task-2 (middle), and task-1 and task-2 (right bar) postprocessing |
(d) StruMF solution time with QP3 (left) and QPR (right bar) |
(e) Hybrid paradigm |
(f) Algorithmic flow |