NxIREE.Backend

livebooks/nx_iree/nxiree_backend.livemd

Ryo Wakabayashi

@RyoWakabayashi

elixir-learning

Share to X

Share to Bluesky

More notebooks

NxIREE.Backend

Mix.install([
  {:nx_iree, "~> 0.0"},
  {:benchee, "~> 1.3"}
])

デバイスの取得

dev =
  NxIREE.list_devices("metal")
  |> elem(1)
  |> hd()

Softmax 関数のコンパイル

flags = [
  "--iree-hal-target-backends=metal-spirv",
  "--iree-input-type=stablehlo_xla",
  "--iree-execution-model=async-internal"
]

Nx.Defn.default_options(
  compiler: NxIREE.Compiler,
  iree_compiler_flags: flags,
  iree_runtime_options: [device: dev]
)

softmax = fn tensor ->  
  Nx.divide(
    Nx.exp(tensor),
    Nx.sum(Nx.exp(tensor), axes: [-1], keep_axes: true)
  )
end

iree_input =
  {1000, 1000, 5}
  |> Nx.iota(type: :f32, backend: NxIREE.Backend)
  |> Nx.divide(1024 * 1024)

softmax.(iree_input)

compiled_softmax = Nx.Defn.compile(softmax, [Nx.template({1000, 1000, 5}, :f32)])

compiled_softmax.(iree_input)

速度比較

binary_input =
  {1000, 1000, 5}
  |> Nx.iota(type: :f32, backend: Nx.BinaryBackend)
  |> Nx.divide(1024 * 1024)

exla_input =
  {1000, 1000, 5}
  |> Nx.iota(type: :f32, backend: EXLA.Backend)
  |> Nx.divide(1024 * 1024)

Benchee.run(%{
  "nx" => fn -> softmax.(binary_input) end,
  "exla" => fn -> softmax.(exla_input) end,
  "nx_iree" => fn -> compiled_softmax.(iree_input) end
})

Other notebooks:

Łukasz Jan Niemier
@hauleth

notes

Benchmark

benchmark.livemd

advanced testing debugging data-structures benchee ecto_sql postgrex

2022-8-18
Bret Kim
@chitacan

livebooks

FFmpeg batch vs stream

ffmpeg-batch-vs-stream.livemd

tutorial advanced ffmpex kino benchee exile

2022-8-18
Mika Kalathil
@MikaAK

request_cache_test

Benchmarking Request Cache Plug

benchmarking.livemd

tutorial advanced benchee kino_vega_lite

2022-8-18
Jonatan Kłosko
@jonatanklosko

notebooks

Nx/EXLA benchmarking

nx_exla_bench.livemd

advanced data-science nx exla benchee

2022-8-18
Cocoa
@cocoa-xu

evision

Evision.DNN Example - Object Detection Task with G...

dnn-googlenet.livemd

tutorial advanced data-science evision kino req

2022-8-18
Vanessa Lee
@vanessaklee

Notebooks

3. Trust & Safety

3_trust_and_safety.livemd

tutorial advanced data-science kino_bumblebee exla stb_image kino_vega_lite

2024-8-12
Ash Framework
@ash-project

ash

Encrypt Attributes

encrypt-attributes.livemd

tutorial advanced security ash ash_cloak cloak

2024-5-10

Back