Question 1

I am aiming to do big things with TensorFlow, but I'm trying to start small.

I have small greyscale squares (with a little noise) and I want to classify them according to their colour (e.g. 3 categories: black, grey, white). I wrote a little Python class to generate squares, and 1-hot vectors, and modified their basic MNIST example to feed them in.

But it won't learn anything - e.g. for 3 categories it always guesses ≈33% correct.

import tensorflow as tf
import generate_data.generate_greyscaledata_generator = generate_data.generate_greyscale.GenerateGreyScale(28, 28, 3, 0.05)
ds = data_generator.generate_data(10000)
ds_validation = data_generator.generate_data(500)
xs = ds[0]
ys = ds[1]
num_categories = data_generator.num_categoriesx = tf.placeholder("float", [None, 28*28])
W = tf.Variable(tf.zeros([28*28, num_categories]))
b = tf.Variable(tf.zeros([num_categories]))
y = tf.nn.softmax(tf.matmul(x,W) + b)
y_ = tf.placeholder("float", [None,num_categories])
cross_entropy = -tf.reduce_sum(y_*tf.log(y))
train_step = tf.train.GradientDescentOptimizer(0.01).minimize(cross_entropy)
init = tf.initialize_all_variables()
sess = tf.Session()
sess.run(init)# let batch_size = 100 --> therefore there are 100 batches of training data
xs = xs.reshape(100, 100, 28*28) # reshape into 100 minibatches of size 100
ys = ys.reshape((100, 100, num_categories)) # reshape into 100 minibatches of size 100for i in range(100):batch_xs = xs[i]batch_ys = ys[i]sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys})correct_prediction = tf.equal(tf.argmax(y,1), tf.argmax(y_,1))
accuracy = tf.reduce_mean(tf.cast(correct_prediction, "float"))xs_validation = ds_validation[0]
ys_validation = ds_validation[1]
print sess.run(accuracy, feed_dict={x: xs_validation, y_: ys_validation})

My data generator looks like this:

import numpy as np
import randomclass GenerateGreyScale():def __init__(self, num_rows, num_cols, num_categories, noise):self.num_rows = num_rowsself.num_cols = num_colsself.num_categories = num_categories# set a level of noisiness for the dataself.noise = noisedef generate_label(self):lab = np.zeros(self.num_categories)lab[random.randint(0, self.num_categories-1)] = 1return labdef generate_datum(self, lab):i = np.where(lab==1)[0][0]frac = float(1)/(self.num_categories-1) * iarr = np.random.uniform(max(0, frac-self.noise), min(1, frac+self.noise), self.num_rows*self.num_cols)return arrdef generate_data(self, num):data_arr = np.zeros((num, self.num_rows*self.num_cols))label_arr = np.zeros((num, self.num_categories))for i in range(0, num):label = self.generate_label()datum = self.generate_datum(label)data_arr[i] = datumlabel_arr[i] = label#data_arr = data_arr.astype(np.float32)#label_arr = label_arr.astype(np.float32)return data_arr, label_arr

Question 2

For starters, try initializing your W matrix with random values, not zeros - you're not giving the optimizer anything to work with when the output is all zeros for all inputs.

Instead of:

W = tf.Variable(tf.zeros([28*28, num_categories]))

Try:

W = tf.Variable(tf.truncated_normal([28*28, num_categories],stddev=0.1))

TensorFlow - why doesnt this sofmax regression learn anything?

Related Q&A

Extended example to understand CUDA, Numba, Cupy, etc

Python 2 newline tokens in tokenize module

Prevent encoding errors in Python

How do I get the operating system name in a friendly manner using Python 2.5?

Extend dataclass repr programmatically

find least common denominator for list of fractions in python

How to configure uwsgi to encode logging as json except app output

Testing aiohttp client with unittest.mock.patch

GridsearchCV: cant pickle function error when trying to pass lambda in parameter

How to insert a carriage return in a ReportLab paragraph?