Object Detection: YOLOv4

demo_yolov4/YOLOv4.livemd

Shozo Fukuda

@shoz-f

tfl_interp

Share to X

Share to Bluesky

More notebooks

Object Detection: YOLOv4

File.cd!(__DIR__)
# for windows JP
# System.shell("chcp 65001")
System.put_env("NNCOMPILED", "YES")

Mix.install([
  {:tfl_interp, path: ".."},
  {:cimg, "~> 0.1.14"},
  {:postdnn, "~> 0.1.5"},
  {:kino, "~> 0.7.0"}
])

0.Original work

Pytorch-YOLOv4

https://github.com/Tianxiaomo/pytorch-YOLOv4

This note usee the pretraind model converted from above project ;-)

Thanks a lot!!!

Implementation with TflInterp in Elixir

1.Defining the inference module: YOLOv4

Model

yolov4_1_3_608_608_static.tflite
Pre-processing

Resize the input image to the size {608,608}, normalize to {0.0,1.0} and transpose NCHW.
Post-processing

Filtering Boxes(output[0]) and Scores(output[1]) with Multi-class Non Maximum Suppression.

defmodule YOLOv4 do
  @moduledoc """
  Original work:
    Pytorch-YOLOv4 - https://github.com/Tianxiaomo/pytorch-YOLOv4
  """

  alias TflInterp, as: NNInterp

  use NNInterp,
    model: "./model/yolov4_1_3_608_608_static.tflite",
    url:
      "https://drive.google.com/uc?authuser=0&export=download&confirm=t&id=1z02aAUJ_phk3mom_w7J1fKbe9F_UM1HT",
    inputs: [f32: {1, 3, 608, 608}],
    outputs: [f32: {1, 22743, 1, 4}, f32: {1, 22743, 80}]

  def apply(img) do
    # preprocess
    input0 =
      img
      |> CImg.resize({608, 608})
      |> CImg.to_binary([{:range, {0.0, 1.0}}, :nchw])

    # prediction
    outputs =
      session()
      |> NNInterp.set_input_tensor(0, input0)
      |> NNInterp.invoke()

    # postprocess
    boxes = NNInterp.get_output_tensor(outputs, 0)
    scores = NNInterp.get_output_tensor(outputs, 1)

    PostDNN.non_max_suppression_multi_class(
      {22743, 80},
      boxes,
      scores,
      boxrepr: :corner,
      label: "./model/coco.label"
    )
  end
end

Launch YOLOv4.

# OnnxInterp.stop(Resnet18)
YOLOv4.start_link([])

Display the properties of the YOLOv4 model.

TflInterp.info(YOLOv4)

2.Defining execution module LiveYOLOv4

defmodule LiveYOLOv4 do
  @palette CImg.Util.rand_palette("./model/coco.label")

  def run(path) do
    img = CImg.load(path)

    with {:ok, res} <- YOLOv4.apply(img) do
      IO.inspect(res)

      Enum.reduce(res, CImg.builder(img), &amp;draw_item(&amp;1, &amp;2))
      |> CImg.display_kino(:jpeg)
    end
  end

  defp draw_item({name, boxes}, canvas) do
    color = @palette[name]

    Enum.reduce(boxes, canvas, fn [_score, x1, y1, x2, y2, _index], canvas ->
      [x1, y1, x2, y2] = PostDNN.clamp([x1, y1, x2, y2], {0.0, 1.0})

      CImg.fill_rect(canvas, x1, y1, x2, y2, color, 0.35)
    end)
  end
end

3.Let’s try it

LiveYOLOv4.run("dog.jpg")

Appendix

□

Other notebooks:

Yejun Su
@goofansu

ogp

ogp

ogp.livemd

tutorial intermediate ogp kino

2022-8-18
@TomBers

livebookNotes

Attractors

attractors.livemd

advanced data-science decimal vega_lite kino

2022-8-18
Kevin Pan
@feng19

spider_man

ElixirJobs

elixirjobs.livemd

tutorial advanced spider_man floki nimble_csv kino

2022-8-18
@TomBers

livebookNotes

Fun with Graphs

graphs.livemd

tutorial advanced intermediate vega_lite kino math

2022-8-18
Todd Pickell
@tapickell

nx_ml_learning

Chapt 9 notebook

chapt9.livemd

advanced data-science ai benchee explorer axon_onnx axon nx exla stb_image kino bumblebee kino_vega_lite scholar scidata table_rex tucan vega_lite

2024-9-9
Livebook
@livebook-dev

kino_db

Components

components.livemd

intermediate sql postgrex myxql kino_db

2022-8-18
Ryo Wakabayashi
@RyoWakabayashi

elixir-learning

Tellus から台風情報を取得する

typhoon.livemd

advanced apis req kino kino_maplibre

2022-12-17

Back