v4.3.0
CUDA v4.3.0
Closed issues:
- Multidimensional
reverse
(#1126) - Test errors on master (#1866)
- Integer overflow error with svd for large matrix (#1880)
- Erratic behaviour of
CUDA.jl
if used in the REPL of VSCode. (#1892) - QR decomposition requires scalar indexing (#1893)
- BSOD during package tests (#1898)
- Insufficient coverage of CuArrays in the documentation (#1901)
- Failed to compile with Julia v1.9 on PowerPC (#1911)
- CUDA test failed in wmma.jl (#1914)
- Fix deprecation warnings (#1920)
Merged pull requests:
- CUSOLVER: Fix workspace size passing. (#1890) (@maleadt)
- Lovelace fixes (#1894) (@maleadt)
- Update manifest (#1897) (@github-actions[bot])
- Reverse with multiple dimensions (#1899) (@RainerHeintzmann)
- Restrict number of test jobs based on available memory. (#1900) (@maleadt)
- Avoid unneeded macros to cut down on generated code (#1905) (@maleadt)
- Avoid unneeded macros to cut down on generated code (#1906) (@maleadt)
- Update manifest (#1907) (@github-actions[bot])
- Bump GPUCompiler. (#1908) (@maleadt)
- Don't use Float64 atomics on unsupported platforms. (#1912) (@maleadt)
- Report package versions as part of versioninfo(). (#1913) (@maleadt)
- Align variables in constant memory by 256 bit (#1915) (@Zentrik)
- Add norm functions for 3 floats (#1916) (@Zentrik)
- cuDNN: only choose conv algorithms if they match descriptor mathType (#1917) (@ToucheSir)
- Update manifest (#1918) (@github-actions[bot])
- Skip Integer WMMA tests on older devices. (#1919) (@maleadt)