Intervals for term structures

Question

For a student learning platform (mathematics) we have managed to include Maxima and evaluate terms/equations/numbers on equivalence. For this we have programmed an algorithm randomly choosing numbers for all the variables and then comparing the two results whether they lead to the same values or not (more mathematically speaking we are seeing the terms as functions and comparing them at specific places).

Now the problem comes: Unfortunately, there must be the possibility to define ranges for coefficients of variables. So e.g. the correct solution [4,5]x^2-[3,4]x at the position x=10 leads to [4,5]*10^2-[3,4]*10. Here we have to find the minimum/maximum of this expression with e.g. the range of 4 to 5 as the coefficient before x^2. I have not been able to do this with native Maxima functions, so I am asking here for help. I am also wondering whether this is possible to combine with other functions such as sin, e etc. or whether this makes the whole optimisation problem too complex (and we should only allow polynomials).

Your help is greatly appreciated!

Best, Leon

Interesting problem. Is the position of x always given (e.g. x = 10 in the example), or is x also a free variable? — Robert Dodier, Dec 30 '21 at 02:24
For x or any other variables randomised numbers are used. So 10 would be an example, but we also use e.g. -23, 49 etc. — Michael Frischauf, Dec 30 '21 at 14:16
Okay, great. Is it also the case that coefficients always appear in a linear combination, e.g. `a1 * f1(x) + a2 * f2(x) + a3 * f3(x) + ...` where f1, f2, f3, etc are expressions in x and not in any coefficients a1, a2, a3, ... . If so, the problem is to maximize a linear combination of coefficients. For general constraints, the maximum value of the linear combination is somewhere on the constraint boundary, and algorithms such as linear programming are applied to find it. — Robert Dodier, Dec 30 '21 at 16:49
However, in this case the constraint region is just the rectangle (or hypercube, more generally) which is the Cartesian product I(a1) x I(a2) x I(a3) x ... where I(ak) is the interval for ak, and "x" means Cartesian product (sorry for the limitations of ASCII notation), and the maximum is reached at a corner of the rectangle or hypercube. In summary, we can find the maximum by just looking at the endpoints of the intervals. I can help you work out a way to do that, if it seem like this is a path that could work for you. (All this is contingent on whether I correctly understood your problem.) — Robert Dodier, Dec 30 '21 at 16:54
There is a linear programming package in Maxima, but it's not clear to me that it would be any simpler to set up the problem and solve it that way, than by creating a solution which works just for this problem. — Robert Dodier, Dec 30 '21 at 16:56
yeah this sound reasonable! However f1(x), f2(x) etc. are just numbers as values are set in there. If this would not been done, we would have a much more complex situation to also allow structures such as [4,5]x^2*y^2-[3,4]x*z etc. But if we e.g. use x=y=z=10, it gets to a more simple structure. — Michael Frischauf, Dec 30 '21 at 18:21
And thank you very much for the help - I have got quite desperate — Michael Frischauf, Dec 30 '21 at 18:21

score 2 · Accepted Answer · answered Dec 31 '21 at 00:29

To summarize what we said in the comments, we have something like sum(a[k]*e[k], k, 1, n) where coefficients a[k] are constrained by intervals I[k] and e[k] is an expression in x. Given that x is a specific value, then the sum is a linear combination of the a[k] and the extreme values are at the corners of the hypercube given by the Cartesian product of the intervals.

A simple solution is to just enumerate the corners of the hypercube and evaluate the sum at each corner, and see which is greatest. (If there are ties, that means that the sum is not actually a function of some coefficient. Given the problem statement, that means the corresponding e[k] is zero. Let's look for and omit such coefficients, then there can only be a unique maximum.)

Here's my attempt at a solution, hope I've understood what's going on and what needs to happen. Assume without checking that a, e, and I are all the same length, namely n.

find_maximum_corner (a, e, I, x, x1) :=
    block ([n, ee, ii_omit, a_omit, ii_keep, a_keep, e_keep, I_keep,
            corners_positions, corners_equations, corners_values,
            maximum_value, ii_maximum_value],
           n: length(a),
           ee: subst (x = x1, sum (a[k]*e[k], k, 1, n)),
           ii_omit: sublist_indices (e, lambda ([e1], subst (x = x1, e1) = 0)),
           a_omit: makelist (a[i], i, ii_omit),
           ii_keep: sublist (makelist (i, i, 1, n), lambda ([i1], not member (i1, ii_omit))),
           a_keep: makelist (a[i], i, ii_keep),
           e_keep: makelist (e[i], i, ii_keep),
           I_keep: makelist (I[i], i, ii_keep),
           corners_positions: apply (cartesian_product_list, I_keep),
           corners_equations: map (lambda ([l], map (lambda ([a1, l1], a1 = l1), a_keep, l)), corners_positions),
           corners_values: map (lambda ([eqs], subst (eqs, ee)), corners_equations),
           maximum_value: lmax (corners_values),
           ii_maximum_value: sublist_indices (corners_values, lambda ([v], v = maximum_value)),
           [maximum_value, corners_equations[ii_maximum_value[1]], a_omit]);

That returns a list comprising the maximum value, the corner at which the sum reaches its maximum, and the list of variables omitted because the corresponding e[k] is zero at x = x1.

This solution makes use of cartesian_product_list which was recently added (in Maxima 5.43). If you are working with a version older than 5.43, I can write out a simple implementation of it.

With this solution I get:

(%i6) find_maximum_corner ([a, b, c], [x, -x^2, x^3], [[3, 4], [-2, 2], [4, 5]], x, 3);
(%o6)          [165, [a = 4, b = - 2, c = 5], []]
(%i7) find_maximum_corner ([a, b, c], [x, -(x - 3)^2, x^3], [[3, 4], [-2, 2], [4, 5]], x, 3);
(%o7)              [147, [a = 4, c = 5], [b]]

the second example showing a variable that drops out because the corresponding expression is zero.

It's not necessary for the expressions e[k] to be polynomials; they can be any functions of x (provided that subst(x = x1, e[k]) simplifies to a number when x1 is a number -- this is the case for most or all of the built-in math functions).

Thank you very much! I will directly look through it :) – Michael Frischauf Jan 01 '22 at 15:33 — Michael Frischauf, Jan 01 '22 at 15:33

Intervals for term structures

1 Answers1