Divide-And-Conquer Algorithm for Trees

Question

I am trying to write a divide & conquer algorithm for trees. For the divide step I need an algorithm that partitions a given undirected Graph G=(V,E) with n nodes and m edges into sub-trees by removing a node. All subgraphs should have the property that they don't contain more than n/2 nodes (the tree should be split as equal as possible). First I tried to recursively remove all leaves from the tree to find the last remaining node, then I tried to find the longest path in G and remove the middle node of it. The given graph below shows that both approaches don't work:

Graph

Is there some working algorithm that does what I want (returns the node H in the above case).

Yes sorry for being unclear here, I can get many subtrees but I don't want one to be bigger than half of the graph to assure I only do a logarithmic count of divide steps. — Listing, Nov 19 '11 at 11:04
One more thing, how do you put "the tree should be split as equal as possible" into a computable value? — Shahbaz, Nov 19 '11 at 11:17
The maximal size of the connected components in G without the removed node should be minimized. — Listing, Nov 19 '11 at 11:19
An algorithm is still considered O(logN) even if it does not exactly halve the tree at each step. What is the importance of exactly halving it? — Jonathan Schneider, Nov 19 '11 at 11:20
I want to divide the tree at most logN times and if you always remove one leaf as the divide step you would have to divide N times which will slow the algorithm down. — Listing, Nov 19 '11 at 11:22
@Listing suppose the graph was such that L moves under D, while M and N move under L, doesn't seem like there would be any possible 8/7 division in that case, is there? — ALTN, Jan 19 '17 at 18:57

score 2 · Accepted Answer · answered Nov 19 '11 at 12:13

2

I think you could do it with an algorithm like this:

Start in the root (if the tree isn't rooted, pick any node).
In each step, try to descend into the child node that has the largest subtree (the number of nodes “below” it is the biggest).
If doing that would make the number of nodes “above” bigger than n/2, stop, otherwise continue with that child.

This algorithm should be O(log n) if the tree is reasonably balanced and we have sizes of subtrees precomputed for each node. If one of those conditions doesn't apply, it would be O(n).

answered Nov 19 '11 at 12:13

svick

236,525
50
385
514

What is a root in an undirected tree? Also how do I know how big the subtrees are? – Listing Nov 19 '11 at 12:38
Like I said, if you don't have a root given, you can pick any node as the root. And to know how big the subtrees are, you have to calculate that, ideally caching the result, so that you don't have to count it multiple times. – svick Nov 19 '11 at 12:47
This is certainly more than O(n), imagine you start from node A in the example. You would first scan the whole subtree, then move to B then to C and so on and each time you scan the whole subtree which gives higher times. – Listing Nov 19 '11 at 12:55
You don't need to scan the whole subtree, you just need to know the counts. And you can compute all of them in O(n) the first time you need one. – svick Nov 19 '11 at 13:09
Now I get it, I accepted this because it seems much easier to implement. – Listing Nov 19 '11 at 13:36

Saeed Amiri · Answer 2 · 2011-11-19T12:47:21.327

One exact algorithm is as this,

Start from leafs and create disjoint graphs (in fact all are K1), in each step find the parent of this leafs, and merge them into new tree, in each step if node x has r known child and degree of node is j such that j = r+1, simply a node which is not in child node of x is parent of current node in this case we say node x is nice, else, there are some child such that related rooted subtree of them not constructed, in this case we say the node x is bad.

So in each step connect nice nodes to their related parent, and it's obvious each step takes sum of {degree of parent nice nodes} also in each step you have at least one nice node (cause you start from leaf), So the algorithm is O(n), and it will be done completely, but for finding node which should be removed, In fact in each step is required to check size of a dijoint list (subtree lists), this can be done in O(1) in construction, Also if the size of list is equal or bigger than n/2, then select related nice node. (in fact find the nice node in minimum list which satisfies this condition).

Obvious thing is that if is possible to divide tree in good way (each part has at most n/2 node) you can done it by this algorithm, but if is not so ( in fact you cant divide it in two or more part of size smaller than n/2) this gives you good approximation for it. Also as you can see there is no assumption in input tree.

note: I don't know is possible to have a tree such that it's impossible to partition it into some parts of size smaller than n/2 by removing one node.

score 0 · Answer 3 · answered Jan 31 '19 at 17:12

Here is a method that I used and tested.

Begin by finding the root of your tree, you can do this by creating a set with all the nodes in them and then making another array NeighboursNumber[] with the number of neighbours of each node stored the corresponding index. Then iterate through the set and eliminate the leafs ( nodes i which have NeighboursNumber[i] == 1 ) , make sure you add those nodes to another set RemovedSet ( to avoid updating problems ) and then after each iteration go through the RemovedSet and decrease the NeighboursNumber[] entry for all neighbours of each element of the set. At the end you will have the root node . ( make sure you implement the case were 2 nodes with 1 neighbours are left ) . After finding the root we proceed to find the size of each subtree. The trick here is to do this while looking for the root :Before eliminating the leafs at the first iteration, start by creating an array SubTreeSize[] and each time we removing a node from the our set, we add the value of that node + 1 to the value of the parent : SubTreeSize[parent] = SubTreeSize[parent] + SubTreeSize[removedNode] + 1 ; this way, when we find the root , we also have the size of each subtree. Then we start from the root and check each neighbour it its subtree's size + 1 > nodes / 2 , if yes then we choose that node and start again. When all the children nodes sizes are <= nodes / 2 , break your loop and output that node.

This method took less than a second for a tree with 10^5 nodes.

moooeeeep · Answer 4 · 2011-11-19T13:36:08.577

This problem seems similar to finding the center of mass of an object. Assume each of your nodes is a point mass of equal mass (weight) and its position is given by the position in the graph. Your algorithm tries to find the center of mass, i.e. the node that has a similar accumulated weight of nodes in all connected sub-trees.

You may compute the accumulated weights on all sub-trees for each node. Then choose the one that is most balanced, s.t. no sub-tree weighs more than n/2. Probably this is a task for some dynamic programming.

Divide-And-Conquer Algorithm for Trees

4 Answers4

Linked