Solving problems recursively

Recursion is a natural approach to implement an exhaustive search that considers, one step at a time, all possibilities for arriving at a solution. As such, recursion is a natural tool for solving search problems.

Here we introduce the basic concepts of recursive search by pursuing one case study. Search problems of this type are studied in more detail in, for example, the course CS-E3200, Discrete Models and Search.

The subset sum problem

As our case study, let us consider the following mathematical problem:

Note

Subset Sum. Given as input a set \(W = \{w_1,w_2,...,w_n\}\) of \(n\) integers (the steps) and an integer \(t\) (the target), is there a subset \(S \subseteq W\) with \(\sum_{z\in S}z = t\)?

Example 1. Suppose the steps are \(W = \{−9,−3,7,12\}\) and the target is \(t = 10\). Then, we have a solution \(S=\{-9,7,12\}\). Indeed, \(-9+7+12=10\).

Example 2. If the steps are \(W = \{−9,−3,7,12\}\) and the target is \(t = 6\), then there is no solution.

The examples above were of course very small and easily solvable even by hand calculation. So let us look at a somewhat larger example:

Example 3. Suppose that \(W = \{4,-5,9,17,20,11,-17,63,255,-97,66,42,-33\}\) and \(t = 27\).

This already looks like something that will take some effort to solve manually, so let us rather automate the process of searching for a solution.

Clearly, to find a solution to a given instance \((W,t)\), it suffices to exhaustively search through all possible subsets of \(W\) for a subset \(S\subseteq W\) with \(\sum_{z\in S}z = t\). It turns out that recursion is a very natural programming strategy to implement such an exhaustive search.

Recursive search for a solution

Suppose we are searching for a solution to an instance consisting of the steps \(W=\{w_1,w_2,\ldots,w_n\}\) and the target \(t\). For \(n=0\), there is a solution (the empty set) if \(t=0\); otherwise there is no solution. For \(n\geq 1\), take any step, say, take \(w_n\in W\). We now observe that if there is a solution \(S\), then either

\(w_n\) does not occur in \(S\) and the set \(W \setminus \{w_n\} = \{w_1,w_2,\ldots,w_{n-1}\}\) has a subset that sums to \(t\), or
\(w_n\) does occur in \(S\) and the set \(W \setminus \{w_n\} = \{w_1,w_2,\ldots,w_{n-1}\}\) has a subset that sums to \(t - w_n\).

That is, we can partition the original problem instance (of size \(n\geq 1\)) into two new instances (of size \(n-1\)). The original has a solution if either (or both) of the new ones has a solution. If neither of the new instances has a solution, then the original does not have a solution either.

We do not know which case occurs, but we can try out both. Of course we do not know in advance which of the new instances (if any) has a solution, so we will exhaustively search both new instances. These new instances will be then solved, in turn, recursively, in the very same way; for example, \(W \setminus \{w_n\} = \{w_1,w_2,\ldots,w_{n-1}\}\) has a subset \(S\) that sums to \(t - w_n\) if either

\(w_{n-1}\) does not occur in \(S\) and the set \(W \setminus \{w_{n-1},w_n\} = \{w_1,w_2,\ldots,w_{n-2}\}\) has a subset that sums to \(t-w_n\), or
\(w_{n-1}\) does occur in \(S\) and the set \(W \setminus \{w_{n-1},w_n\} = \{w_1,w_2,\ldots,w_{n-2}\}\) has a subset that sums to \(t - w_{n-1} - w_n\).

And so on, recursively, until we are at a base case with \(n=0\), and it is trivial to decide whether a solution exists.

An example search

Let us illustrate this recursive search on our small example instance \(W = \{−9,−3,7,12\}\) and \(t = 10\). We traverse the instances, one by one, exhaustively covering all possibilities:

Can a subset of \(\{−9,−3,7,12\}\) sum to \(10\)?
- Can a subset of \(\{−3,7,12\}\) sum to \(10\)?
  - Can a subset of \(\{7,12\}\) sum to \(10\)?
    - Can a subset of \(\{12\}\) sum to \(10\)?
      - Can a subset of \(\{\}\) sum to \(10\)? No
      - Can a subset of \(\{\}\) sum to \(10-12=-2\)? No
      Neither branch (i.e., new instance) has a solution \(\to\) No
    - Can a subset of \(\{12\}\) sum to \(10-7 = 3\)?
      - Can a subset of \(\{\}\) sum to \(3\)? No
      - Can a subset of \(\{\}\) sum to \(3-12=-9\)? No
      Neither branch (i.e., new instance) has a solution \(\to\) No
    Neither branch (i.e., new instance) has a solution \(\to\) No
  - Can a subset of \(\{7,12\}\) sum to \(10-(-3) = 13\)?
    
    [Many lines omitted …]
  Neither branch (i.e., new instance) has a solution \(\to\) No
- Can a subset of \(\{−3,7,12\}\) sum to \(10-(-9) = 19\)?
  - Can a subset of \(\{7,12\}\) sum to \(19\)?
    - Can a subset of \(\{12\}\) sum to \(19\)?
      
      [Few lines omitted …]
      
      Neither branch (i.e., new instance) has a solution \(\to\) No
    - Can a subset of \(\{12\}\) sum to \(19-7=12\)?
      - Can a subset of \(\{\}\) sum to \(12\)? No
      - Can a subset of \(\{\}\) sum to \(12-12=0\)? Yes
      The branch in which we included \(12\) has a solution \(\to\) Yes, {12}
    The branch in which we included \(7\) has a solution \(\to\) Yes, {7,12}
  The branch in which we omitted \(-3\) has a solution \(\to\) Yes, {7,12}
The branch in which we included \(-9\) has a solution \(\to\) Yes, {-9,7,12}
Yes, a subset that sums to \(10\) is \(\{-9,7,12\}\)

Simple, but tedious. Therefore, clearly something that should be executed by a computer.

Implementation in Scala

Let us implement this procedure as follows. Again, we use an inner function so that the elementSelector function, which selects the next element in the current subset, does not need to be passed as an argument in every recursive call.

  /** Select an arbitrary element in w */
  def selectElementSimple(w: Set[Int], t: Int) =
    require(!w.isEmpty)
    w.head
  /**
   * Solve the subset sum problem with recursion.
   * The argument function heuristics is a function that, when called with
   * a non-empty set w and value t, returns an element in w.
   */
  def solve(steps: Set[Int], target: Int, elementSelector: (Set[Int], Int) => Int = selectElementSimple): Option[Set[Int]] =
    def inner(w: Set[Int], t: Int): Option[Set[Int]] =
      if t == 0 then
        // An empty set sums up to t when t = 0
        Some(Set[Int]())
      else if w.isEmpty then
        // An empty set cannot sum up to t when t != 0
        None
      else // in every other case
        // Select one element in the set
        val e = elementSelector(w, t)
        val rest = w - e
        // Search for a solution without e
        val solNotIn = inner(rest, t)
        // If there is a solution return it
        if solNotIn.nonEmpty then solNotIn 
        else // otherwise
        // Search for a solution with e
          val solIn = inner(rest, t - e)
          // If there is one, make sure to include e
          if solIn.nonEmpty then Some(solIn.get + e)
          else None // No solution found here, backtrack
    end inner
    inner(steps, target)
  end solve

Now we have, as expected, the following:

scala> solve(Set(-9,-3,7,12), 10)
res: Option[Set[Int]] = Some(Set(12, 7, -9))

scala> solve(Set(-9,-3,7,12), 6)
res: Option[Set[Int]] = None

Here is an illustration of the tree of recursive function calls for the case \(W = \{-9,-3,7,12\}\) and \(t = 10\). Observe that the calls proceed from top to bottom, left branch first. The red nodes indicate that it was detected that the current set \(W\) cannot sum to the current target \(t\). The green color indicates that the current target value \(t\) is 0 and thus a solution was found. The nodes in gray depict calls that we never made because a solution was found earlier.

Tree of function calls searching for solution of the subset sum case W={-9,-3,7,12} and t=10. The tree has a height 4 and at the lowest level a total of 11 cases where the empty set was reached but no solution was possible were gone through before finding the solution.

When we consider the same set of steps \(W = \{-9,-3,7,12\}\) but with a different target \(t = 6\), there is no solution and the algorithm exhaustively searches through all the possible subsets:

Tree of function calls searching for solution of the subset sum case W={-9,-3,7,12} and t=6. The tree again has height 4 and now the whole tree (resulting in 16 cases with the empty set at the lowes level) was traversed.

We observe that when an instance has no solution, this algorithm makes \(2^{|W|+1}-1\) calls to the function inner. Thus, for such instances, the algorithm runs in time \(\Omega(2^n)\), where \(n\) is the number of steps in \(W\). (For instances that do have a solution, the algorithm can of course be very lucky and find the solution with very few lucky guesses.)

Pruning the recursion tree

We now illustrate how early detection that the current instance does not have a solution can help in pruning the number of recursive calls the algorithm makes.

Let us take a closer look at the large search tree for the instance \(W = \{-9,-3,7,12\}\) and \(t = 6\) above. In some nodes it is easy to see that the current instance does not have a solution. For example, when \(W'=\{12\}\) and \(t'=-1\), we see immediately that there are no negative numbers and thus there is no way to reach the negative target \(-1\). Similarly, when \(W'=\{12\}\) and \(t'=15\), the positive number(s) in \(W'\) cannot sum to \(15\) even if we take all of them.

To translate these observations into general rules: If all the positive numbers do not suffice to reach the positive target, there is no solution. Conversely, if all the negative numbers do not suffice to reach the negative target, there is no solution. We can implement these rules in our method solve by adding this case right after the case “else if w.isEmpty ...”:

  /**
   * Solve the subset sum problem with recursion.
   * The argument function heuristics is a function that, when called with
   * a non-empty set w and value t, returns an element in w.
   */
  def solve(steps: Set[Int], target: Int, elementSelector: (Set[Int], Int) => Int = selectElementSimple): Option[Set[Int]] =
    def inner(w: Set[Int], t: Int): Option[Set[Int]] =
      if t == 0 then
        // An empty set sums up to t when t = 0
        Some(Set[Int]())
      else if w.isEmpty then
        // An empty set cannot sum up to t when t != 0
        None
      else if w.filter(_ > 0).sum < t || w.filter(_ < 0).sum > t then
        // The positive (negative) number cannot add up (down) to t
        None
      else // in every other case
        // Select one element in the set
        val e = elementSelector(w, t)
        val rest = w - e
        // Search for a solution without e
        val solNotIn = inner(rest, t)
        // If there is a solution return it
        if solNotIn.nonEmpty then solNotIn 
        else // otherwise
        // Search for a solution with e
          val solIn = inner(rest, t - e)
          // If there is one, make sure to include e
          if solIn.nonEmpty then Some(solIn.get + e)
          else None // No solution found here, backtrack
    end inner
    inner(steps, target)
  end solve

We rerun on the same instance, and much to our satisfaction get a smaller search tree:

Of course, these additional rules can also help when a solution exists. Indeed, here is the new smaller search tree for the case we considered earlier:

Tree of function calls searching for solution of the subset sum case W={-9,-3,7,12} and t=10, now with pruning. On the lowest level only 7 cases had to be considered before reaching the solution.

Note

As general principle, such pruning rules can be used to reduce the number of recursive calls that a search algorithm makes.

However, the rules do not come for free. That is, it takes time to evaluate the rules, and if this investment in time does not lead to savings elsewhere (less time spent searching), we end up with an algorithm that as a whole runs slower than the algorithm without pruning rules.

A good strategy is to use expensive rules near the top of the tree (where savings in pruning can be substantial if large subtrees can be eliminated) and use only cheap rules, or no pruning at all, at the lower parts of the tree (where speed is of essence since there typically are far more nodes at lower levels than higher levels in the tree).

Branching heuristics

Another common design consideration with recursive search is how to branch. For example, with Subset Sum, we can take any element from \(W\) and branch by (i) not including, and (ii) including that element. Since we have this freedom to choose any element we like, it is a good idea to put a little thought into how we choose our element and how this choice affects performance.

Indeed, rather than choosing an arbitrary element, perhaps one could choose more wisely. For example, we could choose an element that greedily moves to the direction of the target value \(t\): if \(t\) is positive, then choose the maximum value in \(W\), otherwise choose the minimum in \(W\).

This greedy choice in particular appears to be compatible with our earlier pruning rules; perhaps more pruning results if we greedily choose the element on which we branch?

Here is an implementation for greedy selection:

  /** Select an element in w in a greedy way */
  def selectElementGreedy(w: Set[Int], t: Int) =
    require(!w.isEmpty)
    if t > 0 then w.max
    else w.min

Let us run the new version of the algorithm with our old example with \(t=6\) (no solution):

Tree of function calls searching for solution of the subset sum case W={-9,-3,7,12} and t=6, now with pruning and a branching heuristic. Now there are a total of 7 termination points, only 2 of which are on the lowest level, compared to 16 at the lowest level of the initial solution. Also, only 13 function calls were needed compared to 31 of the initial solution.

In this particular case it appears we were lucky and greedy choice does in fact help. To systematically explore whether (or when) one selection strategy is better than another, however, one should conduct a far more extensive set of experiments on a large number of instances. Furthermore, one should try to identify (by reasoning and mathematical analysis) what are the worst case instances for a particular selection strategy. For example, maybe being greedy is, for some instances, a horribly bad idea?

Note

Observe that we did not provide any sound mathematical reasoning to justify that greedy selection would be always good or optimal. More generally, functions that decide how we branch a recursive search are called branching heuristics (“rules of thumb”) for this very reason that their general performance is difficult to analyse. (Indeed, if we completely understood how the search will behave before we executed it, then we would not need to search in the first place, we would just directly aim for the solution!) In fact, in many cases the task of finding the provably best element to branch on is as difficult problem as the whole original search problem.

Too complicated, and for no reason?

Observe that the scala.collection.immutable.Set class has the subsets method returning an iterator that can then be used to enumerate through all the subsets of \(S\) and test if one of them sums up to \(t\). Using this method, we can write a very concise program for solving subset sum:

def solve(s: Set[Int], target: Int): Option[Set[Int]] =
  s.subsets.find(subset => (subset.sum == target))

So why did we go through all the pain with recursion, given that such a wonderfully concise solution is available? (Apart, of course, from the fact that we are studying recursion and its applications.)

Observe that the find method of the iterator returned by the subsets method goes through all the subsets blindly until a solution (if one exists) is found. In contrast, as our latest search tree above shows, already our rather rudimentary pruning rule allows the recursive method to skip many subsets from consideration.

Furthermore, recursive search is a programming pattern that, once mastered, enables you to attack many other problems far more efficiently than merely iterating over all subsets, tuples, or permutations, and for each checking whether you have accidentally stumbled on a solution.

A word on hard problems and NP-completeness

Although the pruning technique and the branching heuristics described above allow the recursive method to avoid some candidates for a solution, they do not guarantee that the run-time of the algorithm would be small (or even reasonable) for all instances.

When we increase the instance size, the algorithm takes longer and longer to run. For example, you may want to try out what happens when

\[t = 24780, \, W = \left\{2245, 1585, 2245, 1024, 1321, 199, 2674, 2674, 1915, 133, 1222, 562, 2575, 1882, 2047, 2278, 2080, 1981, 2707, 232, 2938, 397, 2641, 1948, 2839, 727, 3136, 1486, 3268, 595\right\}\]

or when

\[t = 92105, \, W = \left\{316, 5356, 190, 1261, 4474, 4159, 6238, 442, 1702, 820, 3214, 2962, 1765, 5419, 6049, 4852, 4285, 5356, 4348, 1072, 5167, 2899, 1198, 6301, 2962, 2773, 4096, 5860, 3718, 6931, 1513, 4348, 1954, 5104, 4474, 3529, 4348, 3655, 5671, 1261, 6238, 3340, 1450, 1513, 1261, 1261, 5797, 4159, 3718, 5923\right\}.\]

(In case you get tired of waiting, remember that you can interrupt the current execution in the console with Ctrl+C, or simply shut it down.)

Of course, our small program is not completely bad, it can solve some very large instances very fast: for example, when \(t > \sum_{z \in W}z\), the pruning technique discovers immediately that there is no solution.

So what gives, why did we put all this effort into studying a bad algorithm? Surely a better, more clever algorithm could solve any instance of subset sum that you can, say, comfortably print on a few pieces of paper, faster than you can blink your eye?

Currently, no one knows whether this is possible. No such algorithm is known.

NP-completeness. As innocent as the problem statement may appear, the subset sum problem is (when we allow an arbitrary number of arbitrarily large integers in the instances) one of the so-called NP-complete problems. These problems have in common the following features:

given a solution \(S\), it is easy to verify that \(S\) really is a solution,
we do not know any method that would always (for an arbitrary instance given as input) solve the problem efficiently, which here means “in time that grows polynomially in the size of the instance [which is measured in bits under some very reasonable assumptions] for all instances”, and
if we could solve any one of these problems efficiently, we could solve all of them efficiently.

A large number of NP-complete problems have been identified and extensively studied, yet no efficient algorithm (for any of the problems) has been found, and neither has a mathematical proof been found that would show that NP-complete problems do not admit efficient algorithms. Bluntly, we simply do not know, despite decades of effort, and despite the fact that a great number of problems of substantial practical relevance are NP-complete.

Hardness and undecidability. Beyond NP-completeness, there are problems that are even more difficult to solve by computer (which can be established by mathematical proof!) and also many problems that provably cannot be solved by computers! (More on these topics in the courses CS-C2160 Theory of Computation and CS-E4530 Computational Complexity Theory.)