Skip to content

Kokkos Kernels is a package which provides Blas, Sparse and Graph kernels based on Kokkos implementations.

License

Notifications You must be signed in to change notification settings

dholladay00/kokkos-kernels

 
 

Repository files navigation

KokkosKernels provides dense and sparse linear algebra as well graph computing
local computational kernels using the Kokkos programming model.  
"Local" means no MPI or other distributed-memory communication.  
"Computational kernels" are coarse-grained operations; they take a lot of work 
and make sense to parallelize inside using Kokkos.  

Computational kernels in this subpackage include the following:

  - (Multi)vector dot products, norms, and updates (AXPY-like
    operations that add vectors together entry-wise)
  - Sparse matrix-vector multiply and other sparse matrix / dense
    vector kernels
  - Other operations that the Core subpackage of Tpetra (TpetraCore)
    needs

Kokkos is licensed under standard 3-clause BSD terms of use. For specifics
see the LICENSE file contained in the repository or distribution.

We organize this directory as follows:

  1. Public interfaces to computational kernels live in the src/
     subdirectory (tpetra/kernels/src):

     - Kokkos_Blas1_MV.hpp: (Multi)vector operations that
       Tpetra::MultiVector uses
     - Kokkos_CrsMatrix.hpp: Sparse matrix data structure used for the
       computational kernels below
     - Kokkos_Sparse.hpp: Sparse matrix-vector multiply with a
       single vector, stored in a 1-D View
     - Kokkos_Sparse_MV.hpp: Sparse matrix-vector multiply with
       multiple vectors at a time (multivectors), stored in a 2-D View

  2. Implementations of computational kernels live in the src/impl/
     subdirectory (tpetra/kernels/src/impl)

  3. Correctness tests live in the unit_test/ subdirectory, and
     performance tests live in the perf_test subdirectory

Do NOT use or rely on anything in the KokkosBlas::Impl namespace, or
on anything in the impl/ subdirectory.

This separation of interface and implementation lets the interface
assign the users' Views to View types with the desired attributes
(e.g., read-only, RandomRead).  This also makes it easier to provide
full specializations of the implementation.  "Full specializations"
mean that all the template parameters are fixed, so that the compiler
can actually compile the code.  This technique keeps Tpetra build
times down, since kernels are already precompiled for certain Scalar
types.  It also improves performance, since compilers have an easier
time optimizing code in shorter .cpp files.




About

Kokkos Kernels is a package which provides Blas, Sparse and Graph kernels based on Kokkos implementations.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 98.5%
  • Other 1.5%