Bumblebee

oss_survey/bumblebee.livemd

Yuki Hisae

@hisaway

codex_fragments

Share to X

Share to Bluesky

More notebooks

Bumblebee

Mix.install([
  {:bumblebee, "~> 0.6.0"},
  {:nx, "~> 0.9.0"},
  {:exla, "~> 0.9.0"},
  {:axon, "~> 0.7.0"},
  {:kino, "~> 0.14.0"}
])

Nx.global_default_backend(EXLA.Backend)

Bumblebeeの構造

一言でいうと、各種ドメイン処理に適したE2Eのタスクを提供するライブラリ。
Bumblebeeは、Axonで作成したモデルやHuggingFaceなどのモデルを利用をサポートすることに加えて、モデルを利用するために必要な前処理（テキストのトークン化）や結果の後処理（出力の抽出やラベリング）をNx.Serving.runで利用できる形で提供する。

構成

lib/bumblebee.exはモデルを利用するための準備段階に必要なコード
lib/bumblebee/**.exは各種ドメインに対応した前処理や後処理を含んだservingを提供するコード

内部的に利用されるライブラリを抜粋

Bumblebee.HuggingFace.*: 外部からモデルロードするときに利用。ローカルはFileモジュールを利用（重点を置く）
Jason: モデル仕様を読み込み際に使用
Unpickler: モデルのパラメータ読み込み時に使用。Pythonのpickleを読み込む。
Axon: モデルのビルドと、モデルを用いた予測(これをCPUモードで動かす)

ElixirChip導入方針（コードリーディング方針）

AI予測のドメインを絞って前処理や後処理の一部を担うのがよい

将来方針

下記にモデルに関連するエントリーポイントをまとめた。

モデルロード: ローカルから読み込む前提でget_repo_filesから読み込む
モデル予測: Axon.Compiler.build

サンプルコード

{:ok, bert} = Bumblebee.load_model({:hf, "google-bert/bert-base-uncased"})
{:ok, tokenizer} = Bumblebee.load_tokenizer({:hf, "google-bert/bert-base-uncased"})

serving = Bumblebee.Text.fill_mask(bert, tokenizer)

text_input = Kino.Input.text("Sentence with mask", default: "The capital of [MASK] is Paris.")

text = Kino.Input.read(text_input)

Nx.Serving.run(serving, text)

inputs = Bumblebee.apply_tokenizer(tokenizer, "Hello Bumblebee!")
outputs = Axon.predict(bert.model, bert.params, inputs)

Other notebooks:

Michal Slaski
@michalslaski

livebook_examples

Salary predictions

salary_prediction.livemd

advanced data-science exla axon nx

2022-8-18
Dr. Christian Geuer-Pollmann
@chgeuer

livebook_on_azure

Christian's first LiveBook test

notebook1.livemd

tutorial advanced data-science axon exla nx

2022-8-18
@andyl

elix_util

MNIST

mnist.livemd

tutorial advanced data-science req axon exla nx

2022-8-18
Yejun Su
@goofansu

ogp

ogp

ogp.livemd

tutorial intermediate ogp kino

2022-8-18
Himanshu Jain
@himanshuinvideo

livebook

Clip

clip.livemd

tutorial advanced data-science bumblebee nx exla kino kino_bumblebee annoy_ex stb_image req

2023-11-27
@DockYard-Academy

curriculum

Start Here

start_here.livemd

tutorial advanced beginner jason kino youtube hidden_cell

2023-1-24
@DockYard-Academy

curriculum

Data Traversal

data_traversal.livemd

advanced data-structures jason kino youtube hidden_cell

2023-6-5

Back