generate an artificial data set for context bandit algorithm

Question

I want to generate the following artificial dataset to test a contextual bandit algorithm. What is the easiest way to get it done in python may be? Can anyone point me to a link which demonstrates a code for it?

The unit vectors θ1 , ..., θK for K actions are drawn uniformly from Rd . in each iteration t of T complete iterations, a context xt is first sampled from an uniform distribution within ∥x| ≤ 1.

score 0 · Answer 1 · answered May 14 '15 at 06:11

If I understand your question right, you want to generate:

context xt from uniform distribution
a unit vector of K elements indicating which arm to choose with only a single value being set to one, again from a uniform distribution

Both tasks can be easily achieved with the numpy package:

Use numpy.random.uniform to generate values from uniform distribution within any range.
Use numpy.random.randint to generate integers from uniform distribution and then use the generated values to set certain list element to 1.

would this kind of sampling ensure that the l2-norm of the context vector x is less than or equal to 1? — user77005, Jan 15 '16 at 07:29

generate an artificial data set for context bandit algorithm

1 Answers1