Tensorflow Day16 Autoencoder 實作

今日目標

實作 Autoencoder
比較輸入以及輸出

Github Ipython Notebook 好讀完整版

實作

定義 weight 以及 bias 函數

def weight_variable(shape, name):

return tf.Variable(tf.truncated_normal(shape = shape, stddev = 0.1), name)

def bias_variable(shape, name):

return tf.Variable(tf.constant(0.1, shape = shape), name)

autoencoder 架構

初步的構想是建立一個七層的神經網路，在 encoder 維度會從 784 逐步變成 300, 100, 5 (code layer)， decoder 的時候再逐步轉回 100, 300, 784．而這個想法在現在對 tensorflow 比較熟悉的情形下是很容易實現的，以下就是我實現的程式碼片段

input -> 784 -> 300 -> 100 -> 5 (code layer) -> 100 -> 300 -> 784 -> output

x = tf.placeholder(tf.float32, shape = [None, 784])

e_W_1 = weight_variable([784, 300], "e_W_1")

e_b_1 = bias_variable([300], "e_b_1")

e_layer1 = tf.nn.relu(tf.matmul(x, e_W_1) + e_b_1)

e_W_2 = weight_variable([300, 100], "e_W_2")

e_b_2 = bias_variable([100], "e_b_2")

e_layer2 = tf.nn.relu(tf.matmul(e_layer1, e_W_2) + e_b_2)

e_W_3 = weight_variable([100, 20], "e_W_3")

e_b_3 = bias_variable([20], "e_b_3")

code_layer = tf.nn.relu(tf.matmul(e_layer2, e_W_3) + e_b_3)

d_W_1 = weight_variable([20, 100], "d_W_1")

d_b_1 = bias_variable([100], "d_b_1")

d_layer1 = tf.nn.relu(tf.matmul(code_layer, d_W_1) + d_b_1)

d_W_2 = weight_variable([100, 300], "d_W_2")

d_b_2 = bias_variable([300], "d_b_2")

d_layer2 = tf.nn.relu(tf.matmul(d_layer1, d_W_2) + d_b_2)

d_W_3 = weight_variable([300, 784], "d_W_3")

d_b_3 = bias_variable([784], "d_b_3")

output_layer = tf.nn.relu(tf.matmul(d_layer2, d_W_3) + d_b_3)

loss

loss 函數我使用了 mean square error，而 optimizer 原本是使用 GradientDescentOptimizer，但是做出來的 decode 結果變得非常糟糕 (如下)．後來上網搜尋了以後改用 RMSPropOptimizer，以下會隨機選出數個數字來看看所做出來的結果．