Behavioral Phylogeny
How similar are attacker families, really? Each family is represented by its behavioral atom bag - the weighted distribution of primitive actions observed across all sessions. Pairwise cosine distances feed a UPGMA hierarchical clustering, producing this evolutionary tree. Families that cluster together share behavioral DNA. Families that stand alone are genuinely novel.
UPGMA Cladogram
Horizontal dendrogram. Root at left, leaf nodes at right. Branch position encodes merge height (behavioral distance). Leaf size scales with log(sessions). Color encodes phylum.
Behavioral Distance Matrix
Pairwise cosine distance (1 - similarity) between all family atom bags. Ordered by UPGMA leaf sequence - clusters appear as dark blocks on the diagonal. Hover for details.
Phylogenetic Insights
uname recon atom, producing maximum behavioral entropy. It shares nothing with any worm, botnet, or operator family.Method
Each session produces a sequence of behavioral atoms via the SessionDNA pipeline. Atoms are deduplicated and counted into a weighted atom bag per family variant. Family-level bags are computed as session-weighted averages across all variants.
Pairwise distance is defined as 1 - cosine(bag_i, bag_j). Distance 0 = identical behavioral profile; distance 1 = completely disjoint behavior. The hierarchical clustering uses UPGMA (Unweighted Pair Group Method with Arithmetic Mean) - at each step, the two closest clusters merge, and new distances are the weighted average of constituent distances.
Tree is rebuilt automatically when the genome registry is updated. Last build: 2026-04-04.