rocBLAS User Guide¶
Contents:
- 1. Getting Started Guide for Linux
- 2. API Reference Guide
- 2.1. Introduction
- 2.2. rocBLAS API and Legacy BLAS functions
- 2.3. Using rocBLAS API
- 2.3.1. rocBLAS Datatypes
- 2.3.2. rocBLAS Enumeration
- 2.3.3. rocBLAS Helper functions
- 2.3.4. rocBLAS Level-1 functions
- 2.3.4.1. rocblas_iXamax + batched, strided_batched
- 2.3.4.2. rocblas_iXamin + batched, strided_batched
- 2.3.4.3. rocblas_Xasum + batched, strided_batched
- 2.3.4.4. rocblas_Xaxpy + batched, strided_batched
- 2.3.4.5. rocblas_Xcopy + batched, strided_batched
- 2.3.4.6. rocblas_Xdot + batched, strided_batched
- 2.3.4.7. rocblas_Xnrm2 + batched, strided_batched
- 2.3.4.8. rocblas_Xrot + batched, strided_batched
- 2.3.4.9. rocblas_Xrotg + batched, strided_batched
- 2.3.4.10. rocblas_Xrotm + batched, strided_batched
- 2.3.4.11. rocblas_Xrotmg + batched, strided_batched
- 2.3.4.12. rocblas_Xscal + batched, strided_batched
- 2.3.4.13. rocblas_Xswap + batched, strided_batched
- 2.3.5. rocBLAS Level-2 functions
- 2.3.5.1. rocblas_Xgbmv + batched, strided_batched
- 2.3.5.2. rocblas_Xgemv + batched, strided_batched
- 2.3.5.3. rocblas_Xger + batched, strided_batched
- 2.3.5.4. rocblas_Xsbmv + batched, strided_batched
- 2.3.5.5. rocblas_Xspmv + batched, strided_batched
- 2.3.5.6. rocblas_Xspr + batched, strided_batched
- 2.3.5.7. rocblas_Xspr2 + batched, strided_batched
- 2.3.5.8. rocblas_Xsymv + batched, strided_batched
- 2.3.5.9. rocblas_Xsyr + batched, strided_batched
- 2.3.5.10. rocblas_Xsyr2 + batched, strided_batched
- 2.3.5.11. rocblas_Xtbmv + batched, strided_batched
- 2.3.5.12. rocblas_Xtbsv + batched, strided_batched
- 2.3.5.13. rocblas_Xtpmv + batched, strided_batched
- 2.3.5.14. rocblas_Xtpsv + batched, strided_batched
- 2.3.5.15. rocblas_Xtrmv + batched, strided_batched
- 2.3.5.16. rocblas_Xtrsv + batched, strided_batched
- 2.3.5.17. rocblas_Xhemv + batched, strided_batched
- 2.3.5.18. rocblas_Xhbmv + batched, strided_batched
- 2.3.5.19. rocblas_Xhpmv + batched, strided_batched
- 2.3.5.20. rocblas_Xher + batched, strided_batched
- 2.3.5.21. rocblas_Xher2 + batched, strided_batched
- 2.3.5.22. rocblas_Xhpr + batched, strided_batched
- 2.3.5.23. rocblas_Xhpr2 + batched, strided_batched
- 2.3.6. rocBLAS Level-3 functions
- 2.3.6.1. rocblas_Xgemm + batched, strided_batched
- 2.3.6.2. rocblas_Xsymm + batched, strided_batched
- 2.3.6.3. rocblas_Xsyrk + batched, strided_batched
- 2.3.6.4. rocblas_Xsyr2k + batched, strided_batched
- 2.3.6.5. rocblas_Xsyrkx + batched, strided_batched
- 2.3.6.6. rocblas_Xtrmm + batched, strided_batched
- 2.3.6.7. rocblas_Xtrsm + batched, strided_batched
- 2.3.6.8. rocblas_Xhemm + batched, strided_batched
- 2.3.6.9. rocblas_Xherk + batched, strided_batched
- 2.3.6.10. rocblas_Xher2k + batched, strided_batched
- 2.3.6.11. rocblas_Xherkx + batched, strided_batched
- 2.3.6.12. rocblas_Xtrtri + batched, strided_batched
- 2.3.7. rocBLAS Extension
- 2.3.7.1. rocblas_axpy_ex + batched, strided_batched
- 2.3.7.2. rocblas_dot_ex + batched, strided_batched
- 2.3.7.3. rocblas_dotc_ex + batched, strided_batched
- 2.3.7.4. rocblas_nrm2_ex + batched, strided_batched
- 2.3.7.5. rocblas_rot_ex + batched, strided_batched
- 2.3.7.6. rocblas_scal_ex + batched, strided_batched
- 2.3.7.7. rocblas_gemm_ex + batched, strided_batched
- 2.3.7.8. rocblas_gemm_ext2
- 2.3.7.9. rocblas_trsm_ex + batched, strided_batched
- 2.3.7.10. rocblas_Xgeam + batched, strided_batched
- 2.3.7.11. rocblas_Xdgmm + batched, strided_batched
- 2.3.8. Device Memory Allocation in rocBLAS
- 2.3.9. Logging in rocBLAS
- 2.3.10. References
- 3. Programmer’s Guide
- 3.1. Library Source Code Organization
- 3.2. Handle, Stream, and Device Management
- 3.3. Device Memory Allocation
- 3.4. Thread Safe Logging
- 3.5. rocBLAS Numerical Checking
- 3.6. rocBLAS order of argument checking and logging
- 3.6.1. Legacy BLAS
- 3.6.2. rocBLAS
- 3.6.3. rocBLAS has the following differences when compared to Legacy BLAS
- 3.6.4. To accommodate the additions
- 3.6.5. Device Memory Size Queries
- 3.6.6. rocBLAS control flow
- 3.6.7. Legacy L1 BLAS “single vector”
- 3.6.8. Legacy L1 BLAS “two vector”
- 3.6.9. Legacy L2 BLAS
- 3.6.10. Legacy L3 BLAS
- 3.7. rocBLAS Benchmarking & Testing
- 4. Contributor’s Guide
- 5. Acknowledgement
- 6. Disclaimer