Examples

notebooks/examples.livemd

Numerical Elixir (Nx)

@elixir-nx

nx_image

Share to X

Share to Bluesky

More notebooks

Examples

Mix.install([
  {:nx, "~> 0.6.0"},
  {:nx_image, "~> 0.1.2"},
  {:kino, "~> 0.12.0"}
])

Upload your test image

Using a Livebook v0.8.0 or higher, we can add the new Kino.Input.image/1 to upload an image to our notebook.

image_input = Kino.Input.image("Uploaded Image")

We can use Input.read/1 to retrieve the information about our image.

%{file_ref: file_ref, format: :rgb, height: height, width: width} = Kino.Input.read(image_input)

content = file_ref |> Kino.Input.file_path() |> File.read!()

NxImage requires that the images be tensors in either HWC (default) or CHW order, with an arbitrary number of leading batch axes. The input data is already HWC, so creating a tensor is straightforward:

image_tensor =
  Nx.from_binary(content, :u8)
  |> Nx.reshape({height, width, 3})

Now that we have a tensor in the shape of {height, width, channels} we operate on it using the NxImage module.

Center crop

The first capability we’ll look at is center cropping.

center_cropped_tensor = NxImage.center_crop(image_tensor, {300, 300})

We’ve transformed the image from its original size to 300 x 300 by taking the pixels 150 above and below the image center. Similarly we have the 150 pixels to the left and right of the center point.

Visualization

Numbers are great, but most of us are visual focused. Let’s see what center crop did to our uploaded image:

Kino.Layout.grid(
  [
    Kino.Image.new(center_cropped_tensor),
    Kino.Markdown.new("**Center of the image**")
  ],
  boxed: true
)

Resize

We’ll resize the image. Whether this resized image is shrunk or enlarged is dependent upon the original image size. Resizing to a standard size can be useful when training visual models on a diverse set of source images.

resized_tensor = NxImage.resize(image_tensor, {768, 768}, method: :nearest)

Let’s display the original image and the resized image. If you can’t tell the difference, try a non-square image, or resizing to a very small resolution instead.

original_image = Kino.Image.new(image_tensor)
original_label = Kino.Markdown.new("**Original image**")

resized_image = Kino.Image.new(resized_tensor)
resized_label = Kino.Markdown.new("**Resized image**")

Kino.Layout.grid([
  Kino.Layout.grid([original_image, original_label], boxed: true),
  Kino.Layout.grid([resized_image, resized_label], boxed: true)
])

Let’s double check the shape of both images.

{image_tensor.shape, resized_tensor.shape}

We can see that the resized image has a different shape from the original image shape.

You can try other resize strategies: :bilinear, :bicubic, :lanczos3, :lanczos5. How do they affect the resulting image?

Further exploration

There are other functions in the NxImage module.

For example, NxImage.normalize/3 can be useful for transfer learning, where an original model is further trained on a set of images from your custom domain. The original images had a particular mean and standard deviation. When transfer learning from the base model, your source images are normalized in the same manner as the distribution of the original set of images.

> Note: in this notebook we were using the default Nx.BinaryBackend for all the operations. To speed up the operations you can configure an optimised backend or compiler, such as EXLA.

Other notebooks:

Michal Slaski
@michalslaski

livebook_examples

Salary predictions

salary_prediction.livemd

data-science advanced exla axon nx

2022-8-18
Dr. Christian Geuer-Pollmann
@chgeuer

livebook_on_azure

Christian's first LiveBook test

notebook1.livemd

data-science advanced tutorial axon exla nx

2022-8-18
@andyl

elix_util

MNIST

mnist.livemd

data-science advanced tutorial req axon exla nx

2022-8-18
Yejun Su
@goofansu

ogp

ogp

ogp.livemd

tutorial intermediate ogp kino

2022-8-18
@solidsnakecase

livebook

Iris Classification

iris.livemd

data-science advanced tutorial axon nx explorer kino

2024-8-26
Bret Kim
@chitacan

livebooks

Membrane and Kino

membrane-and-kino.livemd

advanced gen-server tutorial kino membrane_h264_ffmpeg_plugin membrane_hackney_plugin turbojpeg

2022-8-18
@DeSchoel

Elixir_Curriculum

Document Tools

document_tools.livemd

tutorial intermediate jason kino youtube hidden_cell

2026-1-10

Back