Object detection by YOLOv7

YoloV7.livemd

Shozo Fukuda

@shoz-f

tinyML_livebook

Share to X

Share to Bluesky

More notebooks

Object detection by YOLOv7

Mix.install([
  {:nx, "~> 0.2.1"},
  {:kino, "~> 0.6.2"},
  {:onnx_interp, github: "shoz-f/onnx_interp"},
  {:cimg, github: "shoz-f/cimg_ex"}
])

0.Original work

Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao
“YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors”

Ibai Gorordo (@ibai_gorordo)
“YOLOv7 ONNX Converson(Google Colab)”

https://colab.research.google.com/drive/1733xwaETLhAJguRKDqhjgPWf7a2NaOro?usp=sharing

Thanks a lot!!!

Implementation with OnnxInterp in Elixir

1.Prepare the onnx model

Use Ibai’s jupyter notebook (URL above) to get the converted YOLOv7 onnx model from the Pytorch model. You put the model into the livebook home directory. And also you download the coco.label file and put it in the livebook directory.

> cd livebook
> cp down-load-directory/yolov7.onnx .
> wget https://raw.githubusercontent.com/shoz-f/onnx_interp/main/demo_yolo7/coco.label

2.Defining the inference module: DemoYolo7

Model
Standard Model: YOLOv7.onnx converted from Pytorch model.
Pre-processing:
Resize the input image to the size of @yolo7_shape and create a Float32 binary sequence normalized to the range {0.0, 1.0}, NCHW.
Post-processing:
Split the output tensor f32[18900][85] into class scores and bounding boxes and sieve the inference results by the score value threshold and NMS.

defmodule DemoYolo7 do
  # use OnnxInterp, model: Helper.model(), label: Helper.label()
  use OnnxInterp, model: "./yolov7.onnx", label: "./coco.label"

  @yolo7_shape {640, 480}

  def apply(img) do
    # preprocess
    bin = img
      |> CImg.resize(@yolo7_shape)
      |> CImg.to_binary([{:range, {0.0, 1.0}}, :nchw])

    # prediction
    outputs =
      __MODULE__
      |> OnnxInterp.set_input_tensor(0, bin)
      |> OnnxInterp.invoke()
      |> OnnxInterp.get_output_tensor(0)
      |> Nx.from_binary({:f, 32}) |> Nx.reshape({:auto, 85})

    # postprocess
    boxes  = extract_boxes(outputs, scale(img))
    scores = extract_scores(outputs)

    OnnxInterp.non_max_suppression_multi_class(__MODULE__,
      Nx.shape(scores), Nx.to_binary(boxes), Nx.to_binary(scores)
    )
  end

  defp extract_boxes(tensor, scale) do
    Nx.slice_along_axis(tensor, 0, 4, axis: 1) |> Nx.multiply(scale)
  end

  defp extract_scores(tensor) do
    Nx.multiply(Nx.slice_along_axis(tensor, 4, 1, axis: 1), Nx.slice_along_axis(tensor, 5, 80, axis: 1))
  end
  
  defp scale(img) do
    {w, h, _, _}   = CImg.shape(img)
    {wsize, hsize} = @yolo7_shape
    max(w/wsize, h/hsize)
  end
end

Launch DemoYolo7.

DemoYolo7.start_link([])

Displays the properties of the YOLOv7 model.

OnnxInterp.info(DemoYolo7)

3.Let’s try it

draw_object = fn builder, {name, boxes} ->
  Enum.reduce(boxes, builder, fn [_score | box], canvas ->
    [x0, y0, x1, y1] = Enum.map(box, &amp;round(&amp;1))

    CImg.draw_rect(canvas, x0, y0, x1, y1, {255, 0, 0})
    |> CImg.draw_text(x0, y0 - 16, name, 16, :red)
  end)
end

img = CImg.load("dog.jpg")

with {:ok, res} <- DemoYolo7.apply(img) do
  # draw result box
  Enum.reduce(Map.to_list(res), CImg.builder(img), &amp;draw_object.(&amp;2, &amp;1))
  |> CImg.run()
else
  _ -> img
end
|> CImg.resize({640, 480})
|> CImg.display_kino(:jpeg)

4.TIL ;-)

□

Other notebooks:

Michal Slaski
@michalslaski

livebook_examples

Salary predictions

salary_prediction.livemd

exla axon nx

2022-8-18
Dr. Christian Geuer-Pollmann
@chgeuer

livebook_on_azure

Christian's first LiveBook test

notebook1.livemd

axon exla nx

2022-8-18
@andyl

elix_util

MNIST

mnist.livemd

req axon exla nx

2022-8-18
@TomBers

livebookNotes

Attractors

attractors.livemd

decimal vega_lite kino

2022-8-18
Ryan Young
@ryoung786

AdventOfCode

Day 10

10.livemd

req vega_lite kino_vega_lite

2022-12-11
@DockYard-Academy

curriculum

Games: Rock Paper Scissors

games_rock_paper_scissors.livemd

jason kino youtube hidden_cell

2023-3-21
Hugo Baraúna
@hugobarauna

livebook-notebooks

How to query and visualize data from Google BigQue...

livebook_google_big_query.livemd

kino_db req_bigquery kino_vega_lite

2022-8-18

Back