Trees

Tree definitions and terminology

What is a tree?

A tree is an undirected graph that is:

Connected (every node can reach every other node)
Acyclic (contains no cycle)

Equivalent definitions which some problems use:

A graph with n nodes is a tree iff it is connected and has n −1 edges.
A graph with n nodes is a tree iff it is acyclic and has n-1 edges.
A graph is a tree iff between any two nodes in the graph, there is exactly one simple path.

A forest is a collection of trees.

When a problem explicitly gives you a tree, they usually either give you an edge list or a parent array (implies rooted).

Rooted vs unrooted trees

The above definition of a tree is equivalent to the definition of an unrooted tree. However, it can be beneficial to root a tree at a specific node. You can think of this as drawing the root of the tree at the top, the root's neighbors (children) below, and so on.

In contest problems, a very standard move is to root an unrooted tree to make the problem easier to reason about.

More Tree Terminology

Degree: number of incident edges to a node
Leaf:
- In an unrooted tree: nodes with degree $1$
- In a rooted tree: a node with no children
Parent of a node: the unique node directly above it on the path from it to the root. The root has no parent.
Child: a is a child of b if b is a's parent
Ancestor/Descedant: these definitions extend intuitively from those of parent/child. We say a is an ancestor of b iff a lies on the path from b to the root and a is a descendant of b iff b is an ancestor of a.
Depth of a node (rooted): distance from the root in number of edges
Height of a tree (rooted): maximum depth of a node in the tree
Subtree of v (rooted): v plus all descendants of v
Diameter: the longest distance between any two nodes

In most tree problems, we choose dfs as our traversal and we start from the root. Some simple examples to get started:

Tree DP (dynamic programming on trees)

The general pattern:

Most simple tree DP's are:

Pick a root (or use the given root)
Solve each subtree
Combine child answers at the parent
We often store many dp values at a node and have to find an efficient way to recover the answers for our parents

Key reason it works:

Subtrees do not overlap, so you can do clean “child → parent” transitions.

Problems:

"Rerooting DP":

Binary jumping/Sparse Table

Binary jumping

Let's begin with: Company Queries I (CSES)

Precompute “jump pointers” so you can move up the tree in powers of two:

$1$ step up
$2$ steps up
$4$ steps up
$8$ steps up
… and so on

Then any jump of $k$ steps can be decomposed into powers of two (binary representation).

Trees are not the only place this binary jumping idea is useful - we can do this whenever we have a singular pointer from each state.

Tree Queries (CF) Tractor Paths (USACO)

LCA

In a rooted tree, the Lowest Common Ancestor (LCA) of nodes u and v is:

The deepest node that is an ancestor of both u and v.

How can we use the previous idea to solve the following?

Sparse Table

Suppose we need to query the minimum on a range for an array that does not change. Using a segment tree we can achieve $O(\log n)$ for a query, but how do we achieve better?

For each point in the array $x$ and each $k = 0, 1, \dots \lfloor \log _2 n \rfloor$ , start by computing the minimum of the range $[x,x+2^k-1]$ , with the binary jumping trick. To query in $O(1)$ instead of $O(\log n)$ , we realize that the value of the minimum on a queried range $[a,b]$ is not affected if we count a value twice.

Therefore if $2^m$ is the largest power of $2$ that is less than or equal to $b-a+1$ , then

\min[a,b] = \min( \min[a,a+2^m-1], \min[b-2^m+1,b])

letting us query in $O(1)$ .

Can we use this this new data structure to query the LCA of two nodes in $O(1)$ instead of our previously best achieved $O(\log n)$ ? To do this, let's take a look at the Euler Tour:

More Tree Queries

Tree Flattening

We can flatten a tree by doing a dfs traversal and recording infromation about the in/out times of nodes in the tree. For example, consider what happens if we flatten a tree and record the in-times of each node.

There are many potentially useful variations on how we flatten our tree. For example, we can record both the in and out times, or we could even record every time we "visit" a node in the dfs traversal. Which variation would be useful for making LCA work in $O(1)$ ?

-Tree Requests (CF)

Tree definitions and terminology​

What is a tree?​

Rooted vs unrooted trees​

More Tree Terminology​

Tree DP (dynamic programming on trees)​

The general pattern:​

Binary jumping/Sparse Table​

Binary jumping​

LCA​

Sparse Table​

More Tree Queries​

Tree Flattening​