Chapter 9: Designing the Network

09_designing_the_network.livemd

@dgigafox

programming_machine_learn...

Share to X

Share to Bluesky

More notebooks

Chapter 9: Designing the Network

Chaining Perceptrons

Neural network can be built by serializing two perceptrons (outputs of the first are also the input of the second) where each perceptron has its own weights and its own sigmoid operation.

This structure is called multilayer perceptron or artificial neural network

Some definitions:
Nodes - are inputs on each layer
Layer - arrangement of nodes. In 3 layer neural network, they are called input layer, hidden layer and output layer. Activation functions - functions between layers

How many nodes?

Number of input nodes for classifying MNIST

Input layer

784 pixels + 1 bias = 785 nodes

Hidden layer:

Simple rule of thumb is that the number of hidden nodes is somewhere between the number of input and output nodes
200 nodes + 1 bias = 201 nodes

Output layer:

10 classes (digits 0-9) = 10 nodes

Weights:

The neural network has 2 matrices of weights, one between input & hidden layer and the other one between hidden & output layer.

Simple general rule to get the weights dimension:

row of weight = number of input elements
column of weight = number of output elements

Shape of weight between input layer (M, 785) and hidden layer (M, 201) => (785, 201)
Shape of weight between hidden layer (M, 201) and output layer (M, 10) => (201, 10)

Enter the Softmax

Most neural network replace the last sigmoid function with a function called softmax. Takes an array of number called logits and returns the same size of array

softmax(li) = eli / ∑el

Can be read as: “The exponential of each logit divided by the sum of exponentials of all the logits”. The sum of its outputs is always 1.

Other notebooks:

Masatoshi Nishiguchi
@mnishiguchi

livebooks

MNIST手書き文字識別

mnist_20240505.livemd

nx axon exla scidata table_rex kino kino_vega_lite

2024-5-5
Stewart
@imakestews

cur

Pascal's Triangle

pascals_triangle.livemd

jason kino youtube hidden_cell

2025-6-30
@DockYard-Academy

curriculum

Group Project: Presentation

group_project_blog_presentation.livemd

jason kino youtube hidden_cell

2023-3-21
@elixir-or-tools

exhort

Nurse Scheduling

nurse-scheduling.livemd

vega_lite kino_vega_lite exhort

2022-8-18
@zoten

eceu2023

Remote Debugging with Livebook

index.livemd

cluster_connect_smartcell vega_lite kino_vega_lite httpoison jason eflambe eflambe_live

2023-4-21
Ryo Wakabayashi
@RyoWakabayashi

elixir-learning

VOICEVOX MCP Server

voicevox_mcp_server.livemd

mcp_sse kino jason plug bandit req

2025-4-28
Tomoki Aonuma
@uasi

rwkv-in-150-lines-ex

RWKV-Axon in 150 lines in Livebook

rwkv_axon.livemd

axon bumblebee exla

2024-4-9

Back