Simulation Performance Metrics

Question

This is a semi-broad question, but it's one that I feel on some level is answerable or at least approachable.

I've spent the last month or so making a fairly extensive simulation. In order to protect the interests of my employer, I won't state specifically what it does... but an analogy of what it does may be explained by... a high school dance.

A girl or boy enters the dance floor, and based on the selection of free dance partners, an optimal choice is made. After a period of time, two dancers finish dancing and are now free for a new partnership.

I've been making partner selection algorithms designed to maximize average match outcome while not sacrificing wait time for a partner too much.

I want a way to gauge / compare versions of my algorithms in order to make a selection of the optimal algorithm for any situation. This is difficult however since the inputs of my simulation are extremely large matrices of input parameters (2-5 per dancer), and the simulation takes several minutes to run (a fact that makes it difficult to test a large number of simulation inputs). I have a few output metrics, but linking them to the large number of inputs is extremely hard. I'm also interested in finding which algorithms completely fail under certain input conditions...

Any pro tips / online resources which might help me in defining input constraints / output variables which might give clarity on an optimal algorithm?

I don't get it. You have to find the optimal input or optimal algorithm? Also your example doesn't seem to add any value to the question. — ElKamina, Jul 16 '13 at 19:03
In this sort of situation I've generally resorted to brute force. Run hundreds of tests, and extract the relevant numbers from their output files to create charts etc. Parallel make can be very useful. — Patricia Shanahan, Jul 16 '13 at 19:09
@ElKamina, I've made algorithms, I'd like to test them... but I have way too many inputs in order to find a relationship between them and each output for each algorithm. That information would be considerably useful in comparing algorithms and their relative flexibility. — jameselmore, Jul 16 '13 at 19:51
@ElKamina, I guess what I'm getting at is... in the real world I won't have control over the inputs, and they will vary considerably from implementation to implementation... so I'd like to see where my algorithms break as well as test their effectiveness across all scenarios. — jameselmore, Jul 16 '13 at 20:13

score 0 · Accepted Answer · answered Jul 16 '13 at 20:34

I might not understand what you exactly want. But here is my suggestion. Let me know if my solution is inaccurate/irrelevant and I will edit/delete accordingly.

Assume you have a certain metric (say compatibility of the pairs or waiting time). If you just have the average or total number for this metric over all the users, it is kind of useless. Instead you might want to find the distribution of of this metric over all users. If nothing, you should always keep track of the variance. Once you have the distribution, you can calculate a probability that particular algorithm A is better than B for a certain metric.

If you do not have the distribution of the metric within an experiment, you can always run multiple experiments, and the number of experiments you need to run depends on the variance of the metric and difference between two algorithms.

That sounds pretty close to what I need to do. I've been tracking mean and variance of match quality, but an overall distribution would be very effective (those to stats don't tell it all). It's looking like I'm going to need to run a lot of simulations no matter what... thanks for the input! — jameselmore, Jul 16 '13 at 20:53

Simulation Performance Metrics

1 Answers1