If you take a deeper look at the steps of the PLA algorithm, you'll find that setting the learning rate to any positive value gives you equivalent results (subject to the same random sequence and equivalent starting weights, of course). For instance, if you start with the zero vector, the final weights that you get for learning rate 1 are simply twice the final weights that you get for learning rate 0.5.