I'm wondering what methods I can use for predicting the probability distribution. The information about the AI model I want to learn is as follows:
Input: Feature vector, Weight vector, Result value (Initial vector is generated by random sampling of the uniform distribution, the Result value is a real number like 120.12)
Assuming that 'score' is assigned by ':= dot(Feature vector, Weight vector)', I want to train an AI model that predicts the probability distribution of the weight vector so that the score value can be approximated. (For minimizing the difference between the score value and result value) From the generated probability distribution, the program calculates the score value using a weight vector by random sampling.
In this case, what AI model can I use to train the most appropriate probability distribution of the weight vector? I want to use reinforcement learning, but I'm not sure if reinforcement learning matches the current situation. What methods can I use?