Subtractive random forests

Authors: Nicolas Broutin, Luc P Devroye, Gábor Lugosi and Roberto Imbuzeiro Oliveira

Alea, Vol. 21, No 1, 575–591, May, 2024

Motivated by online recommendation systems, we study a family of random forests. The vertices of the forest are labeled by integers. Each non-positive integer i ≤ 0 is the root of a tree. Vertices labeled by positive integers n ≥ 1 are attached sequentially such that the parent of vertex n is n − Zn, where the Zn are i.i.d. random variables taking values in Z+. We study several characteristics of the resulting random forest. In particular, we establish bounds for the expected tree sizes, the number of trees in the forest, the number of leaves, the maximum degree, and the height of the forest. We show that for all distributions of the Zn, the forest contains at most one infinite tree, almost surely. If EZn < ∞, then there is a unique infinite tree and the total size of the remaining trees is finite, with finite expected value if EZ2n < ∞. If EZn = ∞ then almost surely all trees are finite.