Skip to content

Releases: JuliaGPU/AMDGPU.jl

v0.7.1

23 Oct 14:05
Compare
Choose a tag to compare

AMDGPU v0.7.1

Diff since v0.7.0

Merged pull requests:

Closed issues:

  • Support for multi-GPU nodes broken in 0.7 (#527)

v0.7.0

23 Oct 11:11
Compare
Choose a tag to compare

AMDGPU v0.7.0

Diff since v0.6.1

Merged pull requests:

Closed issues:

  • Runtime Locking (#64)
  • 2x slower AMDGPU.jl kernel compared to HIP (#331)
  • sincos() x3.5 slower than separate sin()/cos() calls (#341)
  • HSA memory fault using AMDGPU.rand() on device β‰  1 (#386)
  • WARNING: could not import AMDGPU.device_libs_path into Compiler (#434)
  • sincos intrinsic is broken with GPUCompiler 0.24 (#502)
  • Navi 3 causes malloc(): unsorted double linked list corrupted (#518)

v0.6.1

23 Sep 22:01
Compare
Choose a tag to compare

AMDGPU v0.6.1

Diff since v0.6.0

Merged pull requests:

Closed issues:

  • Precompilation fails (#499)
  • Synchronization in REPL (#500)

v0.6.0

09 Sep 10:38
Compare
Choose a tag to compare

AMDGPU v0.6.0

Diff since v0.5.7

Closed issues:

  • Functions to map to/from HIP agent IDs (#5)
  • Use refcounting for memory management (#207)
  • Make unsafe_copy3d! TLS compatible (#421)

Merged pull requests:

v0.5.7

04 Sep 16:24
Compare
Choose a tag to compare

AMDGPU v0.5.7

Diff since v0.5.6

Merged pull requests:

v0.5.6

02 Sep 21:00
Compare
Choose a tag to compare

AMDGPU v0.5.6

Diff since v0.5.5

Closed issues:

  • Implement exponential back-off for signal wait (#84)
  • Implement occupancy estimator (#112)
  • AMDGPU test errors on gfx908 (Ubuntu 20.04, ROCm 4.2, Julia 1.6.1) (#138)
  • randn(Float32, 111) and rand(Float32, 111) fail (#161)
  • Feature request: allow hsa_amd_memory_copy_async to pick a queue (#204)
  • HSA memory test hang the GPU in CI (#226)
  • AMDGPU.agents() doesn't see GPU (#236)

Merged pull requests:

v0.5.5

21 Aug 15:25
Compare
Choose a tag to compare

Diff since v0.5.4

Merged pull requests:

v0.5.4

12 Aug 13:30
Compare
Choose a tag to compare

AMDGPU v0.5.4

Diff since v0.5.3

Closed issues:

  • accumulate function missing? (#317)

Merged pull requests:

v0.5.3

08 Aug 22:16
Compare
Choose a tag to compare

AMDGPU v0.5.3

Diff since v0.5.2

Closed issues:

  • AMDGPU.jl master is broken on Julia 1.7 (#372)
  • Failure calling upon calling Enzyme autodiff_deferred (#444)
  • Segmentation fault on hipStreamDestroy (#449)
  • Setting HIP_VISIBLE_DEVICES to an invalid ID fails in an unhelpful way (#450)
  • hipErrorSharedObjectInitFailed (#451)
  • Unexpected error: ccall requires compiler when using QR (#461)

Merged pull requests:

v0.5.2

30 Jul 14:03
Compare
Choose a tag to compare

AMDGPU v0.5.2

Diff since v0.5.1

Merged pull requests: