Skip to main content

Paper Reading DMap

· 3 min read

TLDR

This article contains my notes from the paper "D-Map: Visual Analysis of Ego-centric Information Diffusion Patterns in Social Media". I'll discuss how the paper proposes a visualization method to analyze information propagation patterns on social media platforms, with a focus on the D-Map layout algorithm that transforms force-directed graphs into honeycomb diagrams.

Paper

Download PDF

.

Notes

The paper proposes a visualization analysis system designed to analyze how information is propagated and disseminated on Weibo. I am only noting the part I am interested in, which is the D-Map layout algorithm, focusing on the transformation from a force-directed graph to a honeycomb diagram. pseudocode:

Case

The diffusion structure of content (such as on Instagram or Twitter) should resemble the diagram, spreading over time among different groups and being more intensely disseminated within corresponding community types (such as travel, photography, anime, gaming). Overall, it forms a tree-like data structure. To ensure that nodes within the same community type are as close as possible, the author mentioned in the original text that he added new links to nodes that repost the same content. This means that the child nodes of any given tree node are interconnected. However, to avoid the formation of cycles, links are added according to the chronological order of reposting.

Assume we have a total of 500 points, of which 90% (450) conform to the community type rule, meaning the community type of the child nodes should be consistent with that of the parent nodes. The same community ensures the same interests. Then, 10% (50) random points are added, representing some random behaviors within the community. Based on the above data, attempt the layout algorithm described in the paper.

note

Note that the above data is severely distorted and is only for the purpose of reproducing the layout.

  • Use a half-normal distribution to model the reposting behavior of each child node, meaning that most posts should have zero reposts, with only a few posts going viral.
  • Randomly generate timestamps within a 3-day timeframe to simulate the timing of user reposts.

I attempted the layout method described in the original paper, but due to the excessive scene-specific processing and my persistent inability to understand how to assign values to the M variable in the fourth line of the algorithm's output, I faced difficulties.

In fact, I only need an algorithm that can convert a relational graph into a honeycomb graph while preserving the temporal structure.

Based on this requirement, I have re-implemented the following case:

It works well for me.It looks quite convincing, but I still need real social data to verify whether it is effective for me. Until then, it is merely a layout algorithm.