Description
To start, download the following code files:
• learn_perceptron.m
• plot_perceptron.m
And the following datasets:
• dataset1.mat • dataset2.mat • dataset3.mat
• dataset4.mat
Attention: some people have notified us that the provided datasets do not load under some versions of Octave. We are providing the same datasets in a different format that will hopefully work with more versions. You can find these files below.
And the following datasets:
• dataset1_ancient_octave.mat • dataset2_ancient_octave.mat • dataset3_ancient_octave.mat
• dataset4_ancient_octave.mat
For those who want to download all of the files together in a zip archive, you get get them here: Assignment1.zip
To run the code, you first need to load a dataset. To do so enter the following command in the Octave console (to load dataset 1): load dataset1
This should load 4 variables:
• neg_examples_nobias – The matrix containing the examples belonging to class 0.
• pos_examples_nobias – The matrix containing the examples belonging to class 1.
• w_init – Some initial weight vector.
• w_gen_feas – A generously feasible weight vector (empty if one doesn’t exist).
The variables have _nobias appended to their names because they do not have an additional column of 1’s appended to them. This is done automatically in the learn_perceptron.m code already.
Now that you have loaded a dataset, you can run the algorithm by entering the following at the Octave console:
w = learn_perceptron(neg_examples_nobias,pos_examples_nobias,w_init,w_gen_feas)
This will start the algorithm and plot the results as it proceeds. Until the algorithm converges you can keep pressing enter to run the next iteration. Pressing ‘q’ will terminate the program. At each iteration it should produce a plot that looks something like this.
The top left plot shows the data points. The circles represent one class while the squares represent the other. The line shows the decision boundary of the perceptron using the current set of weights.
The green examples are those that are correctly classified while the red are incorrectly classified. The top-right plot will show the number of mistakes made by the perceptron. If a generously feasible weight vector is provided (and not empty), then the bottom left plot will show the distance of the learned weight vectors to the generously feasible weight vector.
Currently, the code doesn’t do any learning. It is your job to fill this part in. Specifically, you need to fill in the lines under learn_perceptron.m marked %YOUR CODE HERE (lines 114 and 122). When you are finished, use this program to help you answer the questions below.
Question 1
Which of the provided datasets are not linearly separable? Check all that apply. Dataset 1
Dataset 2 y
Dataset 3
Dataset 4 y
Question 2
True or false: if the dataset is not linearly separable, then it is possible for the number of classification errors to increase during learning.
True y
False
Question 3
True or false: If a generously feasible region exists, then the distance between the current weight vector and a weight vector in the generously feasible region will monotonically decrease as the learning proceeds.
True y
False
Question 4
The perceptron algorithm as implemented and described in class implicitly uses a learning rate of 1. We can modify the algorithm to use a different learning rate α so that the update rule for an input x and target t becomes: w(t)←w(t−1)+α(t−prediction)x,
where prediction is the decision made by the perceptron using the current weight vector w(t−1), given by:
prediction={1 if wTx≥00 otherwise
True or false: if we use a learning rate of 0.5, then the perceptron algorithm will always converge to a solution for linearly separable datasets.
True y
False
Question 5
According to the code, how many iterations does it take for the perceptron to converge to a solution on dataset 1 using the provided initial weight vector w_init?
Note: the program will output
Number of errors in iteration x: 0
You simply need to report x.
1
3
5 y
It doesn’t converge.




Reviews
There are no reviews yet.