How to get distributions of LinearThompsonagent in tf_Agents

Asked Apr 03 '22 at 19:55

Active Nov 21 '22 at 14:49

Viewed 76 times

I am working on contextual bandits in tf_Agents and using the linearUCB agent and leanr thompson sampling agent.

I can get the actions, but not sure how to get the distributions (over actions) out of the agents for a given timestep.

I know linearUCB is deterministic and hence no distribution, but couldn't get the distribution from thompson sampling even with linearthompsonsamplingagent.policy.distribution(timestep). It says distribution are deterministic and the log_probability is blank. Can someone please explain how to get distributions out of it.

asked Apr 03 '22 at 19:55

tjt

We're also facing a similar problem using the tf-agents library for Lin-UCB. Just checking if you were able to find any workaround for this. Did any other policy help with providing a distribution? – sreeraag Nov 22 '22 at 12:40

How to get distributions of LinearThompsonagent in tf_Agents

0 Answers0

Linked