Algorithms

April 2021

An algorithm is a computational procedure that takes as input some values and produces as output some values. It is a tool for solving a well-specified computational problem. An instance of a problem consists of the input needed to compute a solution to the problem. A correct algorithm is one that halts with the correct output, for any input instance.

An algorithm may be specified in English or as a computer program.

What are some common problems that are solved by algorithms? Some examples are data structure, cryptography, resource allocation, shorting-path finding, pattern matching, topological sorting, etc. All these problems have many candidate solutions, but finding one that is both correct and efficient cant be challenging.

One class of problems, called NP-complete problems, have no known efficient solutions. NP-complete problems arise surprisingly often in applications. One example is the traveling salesman problem.

Modern processors have multiple cores. Algorithms that exploit parallelism are called multithreaded algorithms.

An efficient algorithm is one that is both fast and consumes less memory. Algorithms should be considered as a technology. Total system performance depends on choosing efficient algorithms just as much as choosing fast hardware.

Proving Correctness

One technique for proving correctness is to use loop invariants. A loop invariant is a property of a program loop that is true at the start of each iteration. To prove correctness using loop invariants, we must show three things:

Initialization: the property is true at the start of the first iteration of the loop.
Maintenance: if the property is true at the start of an iteration, then it is true at the start of the next iteration.
Termination: when the algorithm terminates, the invariant gives us a useful property that helps show that the algorithm is correct.

Algorithm Analysis

Analysing an algorithm means predicting the time and space resources that the algorithm requires. Most discussions on algorithms assume a single-processor, RAM model of computation.

The running time of an algorithm is the number of primitive operations executed, each one requiring a constant amount of time. The running time is typically described as a function of the input size. The definition of input size depends on the problem.

Most analysis of algorithms finds their worst case running time because the worst case acts as the upper bound. Furthermore, we are interested in the rate of growth of the worst-case running time. This type of worst-case, order-of-growth analysis of algorithms is called asymptotic analysis. Here we are concerned with how the function behaves with respect to large input sizes in the limit. We therefore consider only the leading term of a function, ignoring its coefficient and other lower order terms.

Big O denotes an asymptotic upper bound; big omega denotes an asymptotic lower bound; big theta denotes an asymptotically tight bound. To say that the running time (no modifier) of an algorithm is big omega of g(n) is equivalent to saying that the algorithm is lower bounded by g(n) in the best case. Similarly, the running time of an algorithm is big O of g(n) if and only if it is upper bounded by g(n) in the worst case.

An algorithm is more efficient than another if its worst case running time has a lower order of growth.

Divide-and-Conquer

Divide-and-conquer is a method for solving problems recursively. The technique has three steps:

Divide the problem into subproblems that are smaller instances of the same problem.
Conquer: solve the subproblems recursively. For the base case, solve the problem in a straightforward manner.
Combine the solutions to the subproblems into the solution to the original problem. Sometimes we will have to solve subproblems that are not the same as the original problem.

Recurrence gives us a natural way of characterizing the running times of divide-and-conquer algorithms. To obtain the asymptotic bounds on the solution, we can use one of three methods:

Substitution method: guess and verify using induction
Recursion-tree method: construct a tree and use bounding summations
Master method: cookbook technique for solving recurrences of the form T(n) = a*T(n/b) + f(n)