Generative Inpainting

demo_deepfillv2/DeepFillV2.livemd

Shozo Fukuda

@shoz-f

tfl_interp

Share to X

Share to Bluesky

More notebooks

Generative Inpainting

File.cd!(__DIR__)
# for windows JP
System.shell("chcp 65001")
System.put_env("NNCOMPILED", "YES")

Mix.install([
  {:tfl_interp, path: ".."},
  {:cimg, "~> 0.1.19"},
  {:kino, "~> 0.7.0"}
])

0.Original work

“Generative Image Inpainting with Contextual Attention”

https://arxiv.org/abs/1801.07892

“Free-Form Image Inpainting with Gated Convolution”

https://arxiv.org/abs/1806.03589

GitHub: Generative Image Inpainting

https://github.com/JiahuiYu/generative_inpainting

The tflite model deepfillv2.tflite is converted from their pretraind model.

Thanks a lot!!!

Implementation with TflInterp in Elixir

1.Defining the inference module: DeepFillV2

Model

deepfillv2.tflite: get from “https://github.com/shoz-f/tfl_interp/releases/download/0.0.1/deepfillv2.tflite” if not existed.
Pre-processing

Combine the original and mask images into a single image, then resize it to {@width, @height} and normalize it to a range of {0.0, 255.0} for further manipulation.
Post-processing

The inpainted image is outputted directly by this model.

defmodule DeepFillV2 do
  @width 680
  @height 512

  alias TflInterp, as: NNInterp

  use NNInterp,
    model: "./model/deepfillv2.tflite",
    url: "https://github.com/shoz-f/tfl_interp/releases/download/0.0.1/deepfillv2.tflite",
    inputs: [f32: {1, @height, 2 * @width, 3}],
    outputs: [u8: {1, @height, @width, 3}]

  def apply(img, mask) do
    # preprocess
    input0 =
      CImg.builder(img)
      |> CImg.append(mask, :x)
      |> CImg.resize({2 * @width, @height})
      |> CImg.to_binary(range: {0.0, 255.0})

    # prediction
    {w, h, _, _} = CImg.shape(img)

    session()
    |> NNInterp.set_input_tensor(0, input0)
    |> NNInterp.invoke()
    |> NNInterp.get_output_tensor(0)
    |> CImg.from_binary(@width, @height, 1, 3, [{:dtype, " CImg.resize({w, h})
  end
end

Launch DeepFillV2.

# TflInterp.stop(DeepFillV2)
DeepFillV2.start_link([])

Display the properties of the DeepFillV2 model.

TflInterp.info(DeepFillV2)

2.Let’s try it

Load a photo and apply DeepFillV2 to it.

origin = CImg.load("sample_raw.jpg")
mask = CImg.load("sample_mask.jpg")

result = DeepFillV2.apply(origin, mask)

Enum.map([origin, mask, result], &amp;CImg.display_kino(&amp;1, :jpeg))
|> Kino.Layout.grid(columns: 2)

□

Other notebooks:

@TomBers

livebookNotes

Attractors

attractors.livemd

decimal vega_lite kino

2022-8-18
Kevin Pan
@feng19

spider_man

ElixirJobs

elixirjobs.livemd

spider_man floki nimble_csv kino

2022-8-18
@TomBers

livebookNotes

Fun with Graphs

graphs.livemd

vega_lite kino math

2022-8-18
@TomBers

livebookNotes

Epicycloid - draw Curves with Straight Lines

Epicycloid.livemd

vega_lite kino math

2022-8-18
Callum McIntyre
@mcintyre94

solana-transaction-livebo...

Helius Transaction Render

helius.livemd

req jason kino

2023-12-25
@andyl

livebooks

Matrix multiplication on GPU - XLA

exla.livemd

nx scidata axon exla

2023-12-4
Tchadel Icard
@tchadelicard

fiab

IMT TAF Fiabilité : Test de charge

simulation_analysis.livemd

kino kino_vega_lite nimble_csv explorer

2025-2-25

Back