Layout Comparison

BlitZoom vs ForceAtlas2, UMAP, and t-SNE on 8 graph datasets.
Full report with methodology and all configurations.

BlitZoom positions nodes by property similarity, using topology as a secondary signal. This comparison measures both topology preservation (do connected nodes end up nearby?) and property-similarity preservation (do nodes with similar attributes end up nearby?) across 8 datasets against ForceAtlas2, UMAP, and t-SNE.

The results: on datasets where property structure diverges from graph connectivity, BlitZoom leads on property grouping by 2-6×. When adjacency correlates with properties, topology-based methods capture property structure incidentally. BlitZoom's auto-tune produces competitive layouts without manual configuration, at millisecond speed.

Which method should I use?

Method Best when Tradeoff
BlitZoom Property similarity matters more than connectivity. Interactive exploration, zoom levels, millisecond speed. Weaker on sparse chain topologies.
ForceAtlas2 Topology IS the signal. Social networks, connectivity analysis, chain-like graphs. Slower (minutes). No property awareness.
UMAP Dimensionality reduction with interpretable embeddings. High-dimensional feature data. Seconds, not milliseconds. Seed-dependent.
t-SNE Revealing fine-grained local clusters. Best local topology preservation. Requires perplexity tuning. Slow.

What we measured

We use k=10 neighbors as a balance between local structure fidelity and global layout quality.

Not all methods run on all datasets. UMAP requires computing pairwise Jaccard on dense n×n adjacency matrices. At 4K+ nodes this takes too long, so UMAP is only included for smaller datasets (Epstein, Pokemon, Email-EU, BZ Source). ForceAtlas2 and t-SNE run on all datasets but take 3–87 seconds vs BlitZoom's 1–5 milliseconds. Where a method is absent from a table, it was too slow to include.

The short version

On topology metrics, ForceAtlas2 and t-SNE lead. On property-similarity, the answer depends on whether properties correlate with graph connectivity. When they don't (MITRE, Synth Packages), BlitZoom with property strengths leads by 2.6-6x. When they do (BlitZoom Source, where call edges track file structure), topology-based methods capture property structure incidentally and score higher. BlitZoom's auto-tune (zero-config) beats or matches baselines on most property datasets.

DatasetNodesPropsBest BlitZoomPropNbrPAuto-tuneTakeaway
Epstein514edge typesα=0 wt1.44x1.42xAutotune discovers edgetype; matches hand-tuned
Pokemon959multiautotune1.13x1.13xAutotune beats hand-tuned and FA2
MITRE4,736richα=0 wt2.42x0.91xProperties ≠ topology; hand-tuned leads
Synth Pkg1,868richα=0 wt5.86x2.62xProperties ≠ topology; BlitZoom leads
BZ Source917richα=0.5 wt0.84x0.56xProperties ≈ topology; topology-based methods lead
Email-EU1,005noneα=1.00.80xEdge-only; topology comparable
Facebook4,039noneα=1.00.70xDense; smoothing works well
Power Grid4,941noneα=0.750.58xSparse chains; smoothing limited

PropNbrP ratios are BlitZoom / ForceAtlas2. Auto-tune column shows autotune / FA2. Higher = better. Edge-only datasets have no autotune (no property signal).

Dataset by dataset

Epstein: edge-type properties

514 nodes (people, organizations), 494 edges with rich edge types (ABUSED, ASSOCIATED_WITH, BUSINESS_RIVAL, etc.). Properties: group + edge types.

LayoutTimeEdgeLenTopoNbrPPropNbrPNote
BlitZoom α=01ms0.5100.0050.035No property strengths
BlitZoom α=0 wt1ms0.4560.0040.118group:5, edgetype:8 — best PropNbrP
BlitZoom autotune1ms0.2770.0130.116group:8, edgetype:3, α=0.5 — discovers edgetype
ForceAtlas23s0.0450.0990.082Topology-driven
UMAP4s0.3760.0040.082Similar to FA2
t-SNE0.4s0.4280.0060.090Third-highest PropNbrP

Hand-tuned edgetype:8 scores 44% higher than ForceAtlas2 on PropNbrP (0.118 vs 0.082). The dual-pass autotune at 0.116 nearly matches — it discovers edgetype:3 through its α=0.5 pass, where edge-type structure becomes visible via topology smoothing. This is the key insight of dual-pass: some properties only show their value when combined with moderate topology.

Pokemon: multi-property, property-dominant

959 Pokemon with 4+ property groups (type1, type2, generation, rarity, stats). Edges exist but properties dominate the similarity structure.

LayoutTimeEdgeLenTopoNbrPPropNbrPNote
BlitZoom α=01ms0.5000.0040.010No property strengths
BlitZoom α=0 wt1ms0.4760.0130.019Hand-tuned: type1:8, type2:4, gen:3, rarity:2
BlitZoom autotune1ms0.3730.0090.027generation:8, α=0.15 — beats hand-tuned
ForceAtlas217s0.0240.0630.023Strong topology
UMAP5s0.0730.0120.022Similar to hand-tuned
t-SNE2s0.1820.0110.017Lowest baseline PropNbrP

Autotune outperforms the hand-tuned preset by 41% (0.027 vs 0.019) and beats ForceAtlas2 by 18% (0.027 vs 0.023). It correctly identified that generation alone with a touch of topology (α=0.15) is more discriminative than the multi-property hand-pick. Types spread across generations; generation groups tightly.

Email-EU: a dense communication network

1,005 researchers at a European institution, 16.7K emails, 42 departments as ground truth. Edge-only (no node properties beyond auto-generated tokens).

LayoutTimeEdgeLenTopoNbrPPropNbrPSilhouetteNote
BlitZoom α=01ms0.4660.0060.007-0.47Near-random layout
BlitZoom α=1.01ms0.2210.0560.007-0.29Best BlitZoom for topology
ForceAtlas219s0.0090.0640.009-0.38Shortest edges
UMAP5s0.1820.1070.010+0.01Only positive silhouette
t-SNE1s0.1610.1080.009-0.11Highest TopoNbrP

PropNbrP is uniformly low (0.007-0.010) across all methods. Without real node properties, auto-generated tokens provide little differentiation. UMAP and t-SNE recover department structure best on topology metrics.

Facebook: dense ego networks

4,039 users, 88K friendship edges. Dense community structure.

LayoutTimeEdgeLenTopoNbrPPropNbrPNote
BlitZoom α=1.03ms0.0630.1100.00374% of FA2's TopoNbrP
ForceAtlas265s0.0110.1510.004Shortest edges
t-SNE5s0.0720.1760.003Highest TopoNbrP

Dense ego-network structure responds well to topology smoothing. BlitZoom at α=1.0 reaches 74% of ForceAtlas2's TopoNbrP. PropNbrP is uniformly low (edge-only dataset).

Power Grid: sparse chains

4,941 substations, 6.6K transmission lines. Average degree 2.7, diameter ~46.

LayoutTimeEdgeLenTopoNbrPPropNbrPNote
BlitZoom α=0.754ms0.2710.0030.002Best BlitZoom; α=1.0 is worse
ForceAtlas261s0.0060.1950.002Traces chains via global forces
t-SNE10s0.1710.0410.002Limited by sparse adjacency

ForceAtlas2 dominates. Its global optimization traces long chains (diameter ~46) that 5-pass local smoothing cannot reach. α=0.75 outperforms α=1.0 because pure topology with few passes oversmooths hubs while leaving chains unresolved.

MITRE ATT&CK: the property-similarity test

4,736 nodes (techniques, tactics, software, mitigations) with rich properties: platforms, kill chain phases, aliases. This is the dataset that tests BlitZoom's core claim.

LayoutTimeEdgeLenTopoNbrPPropNbrPNote
BlitZoom α=05ms0.4610.0010.007No property strengths
BlitZoom α=0 wt4ms0.5340.0020.034Property strengths: best PropNbrP
BlitZoom autotune4ms0.4860.0020.013killchain:8, α=0.5 — near FA2
ForceAtlas287s0.1810.0050.014Shorter edges; lower PropNbrP
t-SNE12s0.2980.0040.026Second-highest PropNbrP

BlitZoom with property strengths (group=5, platforms=6, killchain=4) scores 2.4x higher than ForceAtlas2 on PropNbrP (0.034 vs 0.014). Autotune finds killchain:8 at α=0.5, scoring 0.013 — competitive with FA2 but below hand-tuned. The multi-group interaction (group + platforms + killchain) that makes the hand-pick work is not discoverable from coordinate descent alone. Without property strengths, BlitZoom's PropNbrP drops to 0.007.

Synth Packages: designed group structure

1,868 synthetic packages, 4K co-reference edges. Properties: group, downloads, license, version, depcount. Edges are co-reference links, not property-based.

LayoutTimeEdgeLenTopoNbrPPropNbrPNote
BlitZoom α=01ms0.4680.0030.010No property strengths
BlitZoom α=0 wt2ms0.3870.0050.050Property strengths: best PropNbrP (6x FA2)
BlitZoom autotune1ms0.5350.0020.022downloads:8 — 2.6x above FA2
ForceAtlas254s0.0280.0180.009Shortest edges; low PropNbrP
t-SNE19s0.2220.0010.013Low on both metrics

BlitZoom with strengths scores 5.9x higher than ForceAtlas2 on PropNbrP (0.050 vs 0.009). Autotune at 0.022 still beats FA2 by 2.6x. Graph connectivity and property similarity diverge: edges are co-reference links, not property-based. The autotune found that downloads alone is the most discriminative property for this dataset.

BlitZoom Source: when topology tracks properties

916 nodes (functions, methods, classes, imports) from this project's source code, 2,096 call edges. Properties: kind, file, lines, bytes, age, edge types.

LayoutTimeEdgeLenTopoNbrPPropNbrPNote
BlitZoom α=0 wt1ms0.4330.0120.236kind:8, group:3, rank quant
BlitZoom α=0.5 wt1ms0.3080.0170.244Topology helps slightly
BlitZoom autotune1ms0.2970.0130.163kind:3, edgetype:8 — finds edgetype
ForceAtlas211s0.0180.1100.290High PropNbrP via adjacency
UMAP7s0.1050.0560.303Highest PropNbrP
t-SNE3s0.2080.0350.249High PropNbrP via adjacency

Topology-based methods score higher on PropNbrP (0.25-0.30) because call-graph edges track file/kind similarity. BlitZoom hand-tuned at 0.244 reaches 84% of FA2. Autotune at 0.163 found edgetype:8 as a useful dimension but the hand-tuned multi-group combination (kind + group) scores better. On this dataset, topology correlation means all methods score well on PropNbrP.

What each method is good at

Aspect ForceAtlas2 UMAP / t-SNE BlitZoom
Edge lengthBest (optimizes for this)ModerateImproves with α
Topology preservationStrong; global forcesBest overall (t-SNE)Comparable on dense graphs
Property groupingIncidental; depends on adjacencyModerate (via adjacency)Best when props ≠ topology
Sparse / chain graphsStrong (global forces)LimitedLimited (local smoothing)
SpeedMinutes (O(n log n)/iter)SecondsMilliseconds (O(n))
Hierarchical zoomNoNo14 levels from 4 bytes/node
DeterminismSeed-dependentSeed-dependentFully deterministic

Auto-tune: zero-config performance

BlitZoom's auto-tune uses a dual-pass optimizer: it searches the strength parameter space at both α=0 (property-only) and α=0.5 (moderate topology), then picks whichever produces the better layout. This discovers two kinds of useful properties: those that cluster independently (generation, group) and those that only show value when topology connects related nodes (edgetype, platforms). Bearing auto-tune then rotates each group to maximize 2D spread via closed-form trace maximization.

DatasetHand-tunedAutotuneBest baselineAutotune vs baseline
Epstein0.1180.1160.090 (t-SNE)1.28× — discovers edgetype via dual-pass
Pokemon0.0190.0250.022 (FA2)1.13× — beats all
MITRE0.0340.0130.026 (t-SNE)0.49× — multi-group interaction not discoverable
Synth Pkg0.0500.0220.013 (t-SNE)1.76× — beats all
BZ Source0.2440.1630.303 (UMAP)0.54× — topology-dominant dataset

Autotune beats all baselines on 3 of 5 property datasets (Epstein, Pokemon, Synth Packages) while being 1,000-70,000× faster. Epstein is the dual-pass showcase: the α=0.5 pass discovers edgetype:3 (invisible at α=0), reaching 98% of the hand-tuned PropNbrP. MITRE remains the weak spot — the optimal three-group combination (group + platforms + killchain) isn't discoverable from coordinate descent alone. Domain experts can still do 2-3× better on datasets where the optimal property mix requires multi-way interactions.

Rank vs Gaussian quantization

BlitZoom supports two quantization modes. Rank quantization sorts nodes by position and assigns grid cells uniformly. Gaussian quantization uses fixed CDF boundaries, preserving density structure: tight clusters stay tight, sparse regions stay spread out.

Caveats

PropNbrP uses the same similarity BlitZoom optimizes. Token-set Jaccard is both the ground-truth similarity and the signal BlitZoom's MinHash approximates. This is partially circular. A fully independent property-similarity metric (e.g., domain-expert labels) would be stronger evidence.

Edge-only datasets show no property differentiation. Three of eight datasets have no real node properties. PropNbrP is uniformly low across all methods on these graphs.

When adjacency correlates with properties, topology-based methods win PropNbrP too. On BlitZoom Source, ForceAtlas2 and UMAP outscore BlitZoom on PropNbrP because call-graph edges track file/kind similarity. BlitZoom's advantage is specific to datasets where property structure differs from graph connectivity.

Hierarchical zoom is not measured. BlitZoom derives 14 aggregation levels from 4 stored bytes per node. No other method produces a zoom hierarchy. This capability is not captured by any metric above.