Stanford CS448B 13 Network Layout

August 24, 2022 · 7 min read

TLDR

This article contains my notes from Stanford's CS448B (Data Visualization) course, specifically focusing on the thirteenth lecture about network layout. I'll discuss the importance of tree layout, node-link graph layout, spanning tree layout, Sugiyama-style graph layout, hierarchical edge bundles, force-directed layout, alternative layouts, attribute-driven layout, and summary.

Original

Notes

Tree Layout

Graphs with hierarchical structure
Connected graph with N-1 edges
Nodes as parents and children

indentation

Linear list, indentation encodes depth

Items along vertically spaced rows
Indentation shows parent/child relationships
Often used in interfaces
Breadth/depth contend for space
Often requires scrolling

Visualizing Large Hierarchies:
Single-Focus (Accordion) List:
Separate breadth & depth in 2D Focus on single path at a time

Node-Link diagrams

Nodes connected by lines/curves

Nodes distributed in space, connected by lines
Use 2D space to break apart breadth and depth
Space used to communicate hierarchical orientation
Typically towards authority or generality

Basic Recursive Approach Repeatedly divide space for subtrees by leaf count
Breadth of tree along one dimension
Depth along the other dimension
Problem: Exponential growth of breadth
Reingold & Tilford’s Tidier Layout Goal: maximize density and symmetry.
Originally for binary trees, extended by Walker to cover general case.
This extension was corrected by Buchheim et al. to achieve a linear time algorithm

Design concerns:
- Clearly encode depth level
- No edge crossings
- Isomorphic subtrees drawn identically
- Ordering and symmetry preserved
- Compact layout (don’t waste space)
Algorithm:
- Initial bottom-up (postorder) tree traversal
  1. Set y-coordinate based on depth
  2. Initialize x-coordinate to zero
- At each parent node, merge left and right subtrees
  1. Shift right subtree as close as possible to left
  2. Computed efficiently by maintaining subtree contours
  3. Center parent nodes above children
  4. Record “Shift”in position offset for right subtree
- Final top-down (preorder) traversal to set x-coordinates
  1. Sum aggregated shift

Radial Layout

Node-link diagram in polar coords
Radius encodes depth root at center
Angular sectors assigned to subtrees (recursive approach)
Reingold-Tilford approach can also be applied here

Problems with Node-Link Diagrams:
Scale

Tree breadth often grows exponentially
Even with tidier layout, quickly run out of space

Possible solutions

Filtering
Focus+Context
Scrolling or Panning
Zooming
Aggregation

Hyperbolic Layout

Layout in hyperbolic space, then project on to Euclidean plane
Why? Like tree breadth, the hyperbolic plane expands exponentially
Also computable in 3D, projected into a sphere

Degree-of-Interest Trees

Space-constrained, multi-focal tree layout

Cull “un-interesting”nodes on a per block basis until all blocks on a level fit within bounds Center child blocks under parents

Enclosure diagrams

Represent hierarchy by enclosure

Encode structure using spatial enclosure
Popularly known as TreeMaps
Benefits

Provides a single view of an entire tree
Easier to spot large/small nodes

Problems

Difficult to accurately read depth

Circle Packing Layout

Nodes represented as sized circles Nesting to show parent-child relationships
Problems:

Inefficient use of space
Parent size misleading

Treemaps

Hierarchy visualization that emphasizes values of nodes via area encoding
Partition 2D space such that leaf nodes have sizes proportional to data values
First layout algorithms proposed by Shneiderman et al. in 1990, with focus on showing file sizes on a hard drive

Squarified Treemaps

Greedy optimization for objective of square rectangles Slice/dice within siblings; alternate whenever ratio worsens

https://vega.github.io/vega/examples/treemap/

Why Squares? Posited Benefits of 1:1 Aspect Ratios

Minimize perimeter, reducing border ink.
Easier to select with a mouse cursor.
Validated by empirical research & Fitt’s Law!
Similar aspect ratios are easier to compare.
~~Seems intuitive, but is this true?~~
Extreme ratios & squares-only more inaccurate.
Balanced ratios better? Target golden ratio?

Notes

People's perception of area size is not very sensitive, making it difficult to obtain detailed ratios; they can only roughly distinguish big or small. If the aspect ratio is random, it may occasionally lead to misjudgment.

Comparison of squares has higher error!
Squarify works because it fails to meet its objective?

Voronoi Treemaps

Treemaps with arbitrary polygonal shape and boundary
Uses iterative, weighted
Voronoi tessellations to achieve cells with value-proportional areas

Layering

Layering and alignment

Tree layout is fast: O(n) or O(n log n),enabling real-time layout for interaction

Node-Link Graph Layout

Spanning Tree Layout

Many graphs are tree-like or have useful spanning trees

Websites, Social Networks
Use tree layout on spanning tree of graph

Trees created by BFS / DFS
Min/max spanning trees
Fast tree layouts allow graph layouts to be recalculated at interactive rates
Heuristics may further improve layout

Sugiyama-style graph layout

Evolution of the UNIX
operating system
Hierarchical layering
based on descent

Reverse some edges to remove cycles
Assign nodes in hierarchy layers -> Longest path layering

Create dummy nodes to “fill in”missing layers

Arrange nodes within layer, minimize edge crossings

Route edges – layout splines if needed

Produces hierarchical layout

Sugiyama-style layout emphasizes hierarchy However, cycles in the graph may mislead. Long edges can impede perception of proximity.

Hierarchical Edge Bundles

Trees with Adjacency Relations

Use radial tree layout for inner circle
Mirror to outside
Replace inner tree with hierarchical edge bundles

Bundle Edges along Hierarchy

Increasing Edge Tension

Force-Directed Layout

Nodes = charged particles with air resistance

Edges = springs

F=q_i*q_j / d_{ij}^2

F=-b*v_i

F=k*(L-d_{ij})

D3’s force layout uses velocity Verlet integration Assume uniform mass m and timestep Δt:

F = ma → F = a → F = Δv / Δt → F = Δv

Forces simplify to velocity offsets!

Repeatedly calculate forces, update node positions
Naïve approach O( $N^2$ )
Speed up to O( $NlogN$ ) using quadtree or k-d tree
Numerical integration of forces at each time step

Alternative Layouts

Linear node layout, circular arcs show connections.
Layout quality sensitive to node ordering!

Attribute-Driven Layout

Large node-link diagrams get messy!
Is there additional structure we can exploit?
Idea: Use data attributes to perform layout

e.g., scatter plot based on node values

Dynamic queries and/or brushing can be used to explore connectivity

The “Skitter”Layout

Internet Connectivity
Radial Scatterplot

Angle = Longitude

Geography

adius = Degree

# of connections
(a statistic of the nodes)

Summary

Tree Layout
- Indented / Node-Link / Enclosure / Layers
- How to address issues of scale?
  
  Filtering and Focus + Context techniques
Graph Layout
- Tree layout over spanning tree
- Hierarchical “Sugiyama”Layout
- Optimization (Force-Directed Layout)
- Attribute-Driven Layout

Original​

Notes​

Tree Layout​

indentation​

Node-Link diagrams​

Enclosure diagrams​

Layering​

Node-Link Graph Layout​

Spanning Tree Layout​

Sugiyama-style graph layout​

Produces hierarchical layout​

Hierarchical Edge Bundles​

Trees with Adjacency Relations​

Force-Directed Layout​

Alternative Layouts​

Attribute-Driven Layout​

Summary​

Original

Notes

Tree Layout

indentation

Node-Link diagrams

Enclosure diagrams

Layering

Node-Link Graph Layout

Spanning Tree Layout

Sugiyama-style graph layout

Produces hierarchical layout

Hierarchical Edge Bundles

Trees with Adjacency Relations

Force-Directed Layout

Alternative Layouts

Attribute-Driven Layout

Summary