10 trang 34 lượt tải

Kelp Diagrams| Giáo trình quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

Visualization of one or multiple sets, possibly sharing sev-eral elements, is a recurrent theme both inside and outside
the visualization community. Sometimes, depiction of the
sets is the main concern, excluding any other information
of contained elements besides some means of identification,
i.e., a label.

Môn: Quản trị dữ liệu và trực quan hóa 50 tài liệu

Trường: Đại học Bách Khoa Hà Nội 2.8 K tài liệu

Tác giả:

Trịnh Thảo Anh

11 tháng trước

Danh sách Quiz

Eurographics Conference on Visualization (EuroVis) 2012

S. Bruckner, S. Miksch, and H. Pﬁster

(Guest Editors)

Volume 31 (2012), Number 3

Kelp Diagrams: Point Set Membership Visualization

Kasper Dinkla

, Marc J. van Kreveld

, Bettina Speckmann

, and Michel A. Westenberg

Eindhoven University of Technology, The Netherlands.

Utrecht University, The Netherlands.

Figure 1: Kelp Diagrams applied to a metabolic network (left) and cities on a map (right).

Abstract

We present Kelp Diagrams, a novel method to depict set relations over points, i.e., elements with predeﬁned

positions. Our method creates schematic drawings and has been designed to take aesthetic quality, efﬁciency, and

effectiveness into account. This is achieved by a routing algorithm, which links elements that are part of the same

set by constructing minimum cost paths over a tangent visibility graph. There are two styles of Kelp Diagrams to

depict overlapping sets, a nested and a striped style, each with its own strengths and weaknesses. We compare Kelp

Diagrams with two existing methods and show that our approach provides a more consistent and clear depiction

of both element locations and their set relations.

Categories and Subject Descriptors

(according to ACM CCS): I.3.6 [Computer Graphics]: Picture/Image

Generation—Line and curve generation

1. Introduction

Visualization of one or multiple sets, possibly sharing sev-

eral elements, is a recurrent theme both inside and outside

the visualization community. Sometimes, depiction of the

sets is the main concern, excluding any other information

of contained elements besides some means of identiﬁcation,

i.e., a label. Here, simple visualizations sufﬁce for a small

number of sets, such as a table with a mapping of elements

to rows and sets to columns, or a more space-efﬁcient Eu-

ler diagram. Not every situation warrants such an approach.

Sometimes, other data aspects (partially) dictate the posi-

tions of the elements’ visual representations. For example,

geographic places are often best positioned at their real-

world coordinates because this is consistent with the existing

knowledge of the observer and improves visual orientation.

Furthermore, it allows spatial patterns to emerge.

The two state of the art approaches, Bubble Sets [CPC09]

and LineSets [ARRC11], use colored shapes to visually con-

nect elements that belong to the same set. Bubble Sets de-

rives an element density function for each set. Isolines are

extracted from each function to form shapes around the el-

ements. These shapes are then connected further with links,

routed along the elements. LineSets draws a thick colored

curve through the elements of each set, making sure that the

traversed path over the elements is relatively short. Both ap-

proaches generate visualizations that often appear complex

and sometimes even convey invalid set memberships. In con-

trast our method strictly controls where shapes of sets are

placed. It consists of three phases (see Fig. 2): the alloca-

tion of space around each element such that there is enough

room to depict its containing sets; the allocation of space for

connecting shapes between demarcation zones with a rout-

 2012 The Author(s)

Computer Graphics Forum

 2012 The Eurographics Association and Blackwell Publish-

ing Ltd. Published by Blackwell Publishing, 9600 Garsington Road, Oxford OX4 2DQ,

UK and 350 Main Street, Malden, MA 02148, USA.

DOI: 10.1111/j.1467-8659.2012.03080.x

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

Figure 2: The three phases of generating Kelp Diagrams.

ing algorithm; and the generation of actual visualizations, by

using the allocated space, in two distinct ways.

In summary, our contribution is:

• Two styles of diagram that emphasize different aspects of

set memberships and overlap for elements with a prede-

ﬁned position;

• a routing algorithm for linking elements in a set to support

the generation of such diagrams, where aesthetic quality,

efﬁciency, and effectiveness are taken into account.

2. Related Work

Conveying information about multiple sets is a longstand-

ing problem and exists in various forms. When the loca-

tion of the elements are not speciﬁed, then the input is

simply a set system. Each set can also by interpreted as

a hyperedge of a hypergraph deﬁned on the elements (the

vertices of the hypergraph). There are several papers from

the graph drawing community [JP87, DBETT99] that dis-

cuss how to draw hypergraphs. Most recent efforts have

focused on so called planar supports for hypergraphs and

the associated subdivision drawings [KvKS09] (see also

[BCPS10a, BCPS10b, BvKM

∗

10] for further theoretical re-

sults on speciﬁc types of planar supports). Unfortunately

most hypergraphs do not have planar supports and hence

subdivision drawings are of very limited practical use.

When only sets (or hyperedges) are of concern, the de-

piction possibilities are numerous. Euler diagrams are well-

known and use contours to denote areas that represent

sets, which is sometimes referred to as the subset stan-

dard. Such diagrams can be generated automatically; by

abstracting from individual elements [FRM03], or by in-

cluding representations of the elements and related infor-

mation [BE01, SAA09, HRD10]. Here, the positions of el-

ements either do not matter as no elements are displayed,

or are assigned to optimize the visualization of the sets. The

number of elements and sets inﬂuences the effectiveness of

a visualization. For many elements and groups, augment-

ing a contour-based visualization is a possibility [ST10],

or a highly-interactive analysis environment is a necessity

[FMH08].

When dealing with predeﬁned positions of elements, dis-

playing the sets becomes more difﬁcult. If contours are

used, like Euler diagrams, a fair division of display space

is an issue [BT06, BT09, CPC09]. Another option is to con-

nect highlighted elements with (colored) links [SA06,CC07,

SLK

∗

09]. This is referred to as visual linking, which can take

on sophisticated forms [SWS

∗

11]. However, visual linking

focuses on relating elements, not the comparison of sets in

a spatial setting. For both approaches the way in which con-

tours or links are placed affects the depiction of the sets but

also of the predeﬁned visualization.

3. Problem Analysis

An input problem instance consists of three aspects: posi-

tioned elements, the sets that contain them, and the prede-

ﬁned visualization that embeds the elements. We want to

depict these sets in combination with the predeﬁned visu-

alization. To determine what makes a good set depiction we

enumerate tasks that an observer may wish to perform by in-

terpreting the visualization and hence the data. These tasks

impose constraints that any visualization has to fulﬁll when-

ever possible, but also provide (conﬂicting) optimization cri-

teria to improve task performance of the observer.

Supported tasks. The composition of multiple elements

and sets—in a spatial setting—brings forth many questions

of a comparative nature: To what extent do sets overlap or

differ? How close to each other are the elements of a set? Is

the containment of an element in a set correlated to its po-

sition? We have compiled a list of primitive tasks that have

to be supported by a visualization such that an observer is

able to answer these questions, or which the observer can

perform ad-hoc to gain insight about the data:

T1a determine the position of an element

T1b ﬁnd an element by position (relative to landmarks)

T1c estimate the density of elements in an area

T2a determine which sets contain a speciﬁc element

T2b ﬁnd the elements that belong to a speciﬁc set

T2c estimate the spatial distribution of a speciﬁc set

T3 compose a (mental) set from existing sets with opera-

tions union, intersect and complement, and apply T1 – T2

The tasks can be composed to answer more complex ques-

tions. When dealing with cities on a map, with a set of large

cities and a set of industrial cities, the following questions

may be asked: Which cities are large but not industrial?

(T2b and T3); Are industrial cities clustered together? (T2c);

Is New York considered large and/or industrial, and which

neighboring cities are similar? (T1a, T2a, T3, and T2b). As

shown, answering common questions involves the combina-

tion and execution of these primitive tasks. Thus, improving

the efﬁciency at which tasks can be performed, improves the

ease at which complex questions can be answered.

Constraints. The visualization has to satisfy the following

constraints to enable T1 and T2:

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

876

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

C1 every element is clearly represented in the ﬁnal visual-

ization at its predeﬁned position (for T1)

C2 every element is clearly marked or contained by a rep-

resentation of every set that it is a part of (for T2)

These constraints are satisﬁable, provided that all ele-

ments are visually distinct, i.e., all elements are positioned

at a discernible distance from each other. We assume this to

be the case because any predeﬁned visualization has to sup-

port T1 to be of practical purpose and should therefore have

visually distinct elements in the ﬁrst place.

Aesthetic criteria. The aforementioned constraints guaran-

tee that all tasks can be performed but do not provide di-

rection towards an efﬁcient visualization that allows (com-

posite) tasks to be performed by an observer with little ef-

fort. Generation of aesthetic shapes has been a subject of

research before for Euler diagrams [SAA09], and the gener-

ation of effective graph representations, by reduction of in-

tersections for example, is a common theme in graph draw-

ing [DBETT99]. This has inspired us to list important prop-

erties that make for good shapes that depict sets.

Shapes should have low cognitive load:

A1a small area

A1b few and shallow bends

A1c few outline intersections

Not only do aesthetic shapes appeal to the observer, they

in general are accompanied by a low cognitive load, i.e., it

takes less effort for the observer to process shapes and thus

derive the information they are meant to convey. For a set de-

piction, the faster the shapes that convey sets are interpreted,

the faster T2 can be performed.

A small area (A1a) implies less surface to inspect and pro-

cess, thus less cognitive load. Few and shallow bends (A1b)

imply smooth outlines that are easy to distinguish from their

surroundings (the predeﬁned visualization). It also means

that outlines are short and therefore easier to process. In-

tersections of outlines (A1c) make them harder to tell apart

and discern the area they contain and the elements therein.

Shapes should be effective:

A2a large area

A2b large distance between outlines

A2c little overlap of shapes that depict different sets

A2d strong continuation of shapes that depict the same set

Shapes of large surface area (A2a) attract attention, mak-

ing the presence of a set explicit (T2). If outlines of shapes

are close to each other they are harder to tell apart, often

causing the overall shapes to be less pronounced. Likewise,

overlapping shapes (A2c) may obfuscate each other and the

information they should convey. Both impact T2 negatively.

For T2b—ﬁnding the elements that belong to a speciﬁc

set—to be performed efﬁciently, the elements have to be as-

sociated as a group by the observer. Otherwise, the observer

(a) (b) (c)

Figure 3: The added beneﬁt of linking: (a) Elements are as-

sociated solely by the colored shapes that contain them; (b)

Elements are associated both by color and a common shape;

has to scan all elements in a linear fashion to determine their

corresponding sets. Incorporating distinguishing features

such as color for the shapes is an effective approach. How-

ever, creation of a visual continuation of shapes (A2d) causes

an even stronger grouping effect (see Fig. 3), on which exist-

ing approaches rely as well [CPC09, ARRC11,SWS

∗

11]. In

certain situations, like the one shown in Fig. 3(c), strong con-

tinuation also emphasizes spatial patterns of elements and

sets. This includes improving the detection of spatial clus-

ters (T2c). As is the case for other criteria, continuation is

not a strict constraint, i.e., elements that belong to the same

set do not have to be connected.

Shapes should not distort element position and density:

A3a little obfuscation of the predeﬁned visualization

A3b strong correspondence between the presence and size

of a set’s shape, and the presence and density of elements

that belong to this set, in an area

Not only do the shapes affect the way in which the prede-

ﬁned visualization is perceived through partial occlusion or

obfuscation, they also affect the way in which the elements

and their locations are perceived (A3a and A3b). For exam-

ple, when the set containment of an element is depicted with

a large colored circle, this circle will form a stronger visual

cue to the presence of an element than the element’s own de-

piction (see Fig. 4(a)). However, this can also affect the per-

ceived position of the element (see Fig. 4(b)) and the density

of elements in an area (see Fig. 4(c)).

Many of the stated criteria are in conﬂict with each other:

A2a with A2c, A1a with A1b and A1c because of the routing

that is required, A1c with A2d, and A3b with all other crite-

ria. Deﬁning an optimal visualization therefore not only re-

(a) (b) (c)

Figure 4: The distorting effect of shapes on element depic-

tion: (a) An element contained in a shape attracts more at-

tention; (b) The expected position of the element lies at the

center of the circle, which conﬂicts with its actual position;

but the difference in size of the shapes suggests otherwise.

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

877

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

quires quantifying the individual criteria, it also requires the

criteria to be prioritized and combined into an overall deﬁni-

tion of an optimum. Such an approach has been used for the

creation of aesthetically pleasing Euler diagrams [FRM03]

and label placement on maps [vDvKSW02], where different

criteria are weighted and then used as a ﬁtness function. It

underlines the varying expectations that different observers

may have of visualizations.

Moreover, when criteria like A1c, which require routing,

are included in an overall optimization scheme, the combina-

torial complexity of the problem greatly increases and forces

the use of heuristic algorithms.

4. Approach

Our approach consists of three phases: allocation of element

space, allocation of additional link space, and the generation

of visualizations (see Fig. 2). The following pseudo-code

provides an overview of the approach, where Sections 4.1,

4.2, and 4.3 elaborate on lines 1–2, 3–10, and 11, respec-

tively. Here, the elements are denoted as E, the predeﬁned

element positions as p(e) for e ∈ E, and the collection of

sets as S, where for every S ∈ S we have S ⊆ E.

Algorithm Kelp(E,S)

1. Derive Voronoi diagram of {p(e)|e ∈ E}.

2. For every e ∈ E derive A(e) as the intersection of its

Voronoi face and a circle of radius r

, centered at p(e).

3. Derive embedded graph G

from {A(e)|e ∈ E}.

4. Derive tangent graph G

from G

5. while best-to-place link l between p, q ∈ S | S ∈ S has

beneﬁt b(p, q) > b

6. do Add edges of l in G

to subgraph G

(S).

7. Derive all-pair shortest paths of G

(S).

8. Update G

with intersections introduced by l.

9. Derive all-pair shortest paths of G

for all R ∈ S.

10. Derive next best-to-place link from shortest paths

in G

(R) and G

for all R ∈ S.

11. Derive visualization style from all G

(S) | S ∈ S.

4.1. Element space (phase 1)

T2 requires that for each element it is clear to which sets

it belongs, hence each element should have ample (and at

least equally divided) surrounding space to display its sets.

Taking the trade-off between A2a and A2c into account, we

want the observer to have a level of control on how much

space is used for the set visualization. Given such ﬁxed area

per element and considering A1a and A1b, the natural choice

of area to allocate around an element is a circle of radius r

It is not always possible to allocate a perfect circle around

each element. Sometimes the distance between two elements

is smaller than 2r

, causing the circles to overlap, or smaller

than r

, causing circles to overlap each other’s elements. To

resolve this space contention, we ﬁrst calculate the Voronoi

(a) (b) (c)

Figure 5: Allocation of space for three elements: (a) Over-

lapping circles, centered over elements; (b) The Voronoi

faces of the elements’ positions; (c) Intersection of circles

and Voronoi faces.

diagram of {p(e)|e ∈ E} and then, for every element e ∈ E,

intersect the allocated circle of e with the Voronoi face that

contains p(e) and use it as the new allocated space A(e) (see

Fig. 5). Hence, no allocated space intersects and elements

get a fair share of space.

The resulting space partition is stored as an embedded

graph G

= (V

, E

), with vertices V

for the elements, and

vertices V

for the intersection points between Voronoi faces

and circles, including the points that are shared by more

than two Voronoi faces and lie within an element circle. The

edges between these intersection points are either straight

(part of a Voronoi face) or circular.

The allocated space of each element is referred to as a fair

share—not an equal share—because sometimes elements do

not receive space that is equal in surface area to their neigh-

boring elements (Fig. 6(a)). This unequal allocation could in

certain cases affect the depiction of element density (A3b)

negatively. Applying a repulsive force between the elements’

circles, and shifting their positions accordingly, would in

many situations result in an equal division (Fig. 6(b)), but is

not applicable to all situations (Fig. 6(c)). Moreover, manip-

ulating the position or shape of the circles will almost always

harm the ability of an observer to ﬁnd elements by position

(T1a) and to determine element density in an area (A3b), see

also Fig. 4. The intersection of an element’s Voronoi face

and circle is simple and allows for visual reinforcement of

the element’s position, which is discussed in Section 5.

(a) (b) (c)

Figure 6: Area division between elements: (a) Our method

allocates an unequal share of space; (b) Possible outcome

of a force-based relocation of the circles; (c) Instance where

allocation of equal space for element e is not possible with-

out a more complex shape.

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

878

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

4.2. Link space (phase 2)

A2d states that we should visually connect or link those ele-

ments that belong to the same set such that elements from the

same set are more easily associated with each other. How-

ever, any additional link will result in a more complex visu-

alization, negatively affecting most other criteria. Therefore,

we have to allocate link space for every S ∈ S, such that the

advantages of the links in the ﬁnal visualization outweigh

their disadvantages, or cost, as much as possible.

We ﬁrst consider the cost c(l) of placing a link l between

p, q ∈ S, where S ∈ S, in the scene. It can be modeled in

a simple way, where c(l) = c

d(l) + c

(l) + c

I(l). Here,

d(l) is the distance covered by l;

(l) is the aggregate angu-

lar change of l in radians; I(l) is the number of intersections

between the contours of l and the contours of links already

placed in the scene; and c

, c

, and c

are weight parameters.

Distance is penalized in accordance with A1a, aggregate an-

gular change with A1b, and intersections with A1c.

No intersections should exist between l and any allocated

space A(e) where e ∈ E\{p, q}, as l has no right to use space

of A(e). Also, c(l) dictates that l runs directly adjacent to any

A(e) that it passes, because tightening l, without altering its

topology w.r.t. A(e), will always lower d(l) and

(l) without

changing I(l) (see Fig. 7).

(a) (b)

Figure 7: Two possible paths of a link l between p, q ∈ E (in

red): (a) l with unnecessarily high cost; (b) l with identical

topology but minimized cost.

As can be seen in Fig. 7, this tightness means that any de-

sirable link can be constructed from the edges in G

when

certain (tangent) edges are added to it, similar to robot mo-

tion planning with tangent visibility graphs [WvdBH07]. So

we extend G

to form G

= (V

, E

), which includes (tan-

gent) edges between A(p) and A(q), p and A(q), A(p) and

q, and p and q. In addition, for every v ∈ V

, we have edges

to every p ∈ E and tangent edges to A(p) (see Fig. 8). This

also adds vertices at the tangent points, which split up the

original circular edges.

(a) (b) (c)

Figure 8: Examples of the additional edges in E

(in red):

(a) Tangent edges between A(p) and A(q), p, q ∈ E; (b)

Edges from p to q and A(q); (c) Edges from v ∈ V

(a) (b)

Figure 9: Beneﬁt of placing a link between p, q ∈ S, S ∈ S is

dependent on already placed links: (a) The beneﬁt of placing

is low because already placed l

has low cost; (b) The

beneﬁt of placing l

is high because already placed chain of

links l

, l

, ..., l

has high cost.

Now, any link l of set S ∈ S can be decomposed into a se-

quence of touching edges {e

, e

, ..., e

} ⊆ E

, and c(l) can

then be derived from G

by summing the costs of the indi-

vidual edges. Based on these costs, it is possible to compute

a minimum cost path between p,q ∈ E in G

and thus get l.

Using only cost as a criterion for placing links is not

enough when we want to connect elements beyond a span-

ning tree. When spanning trees have been established for all

sets, additional low-cost links are not always of great beneﬁt

to the visualization when they are placed. Consider the situ-

ations depicted in Fig. 9. For Fig. 9(a) the beneﬁt of placing

link l

is low because a low cost link l

is already in place

to provide ample visual linking between the elements. For

Fig. 9(b) the beneﬁt of placing link l

is high because the

links that already connect p and q sum to a high cost, which

means that in the current situation it is hard for the observer

to follow links from p to q, while placement of l

would

make this task considerably easier.

Let G

(S), for every S ∈ S, be a subgraph of G

, which

contains only edges of links that have so far been added for S.

We deﬁne the beneﬁt of placing a lowest cost link l between

p, q ∈ S, as b(p, q) =

d(p,q)

c(l)

, where d(p, q) is the minimum

path distance between p and q in G

(S). Hence, the beneﬁt

is higher when the cost of placing l is lower or the smallest

distance via already placed links is higher. When p and q

are not connected in G

(S), then d(p, q) = ∞. In case the

beneﬁt of links has to be compared for disconnected vertices

of G

(S), we compare only by link cost.

Given these deﬁnitions, the algorithm places links in a

greedy manner: Determine S ∈ S and p, q ∈ S with highest

b(p, q). If b(p, q) > b

, where b

is a beneﬁt threshold pa-

rameter, add link l to G

(S) and repeat the algorithm. When

b(p, q) ≤ b

, the algorithm terminates.

In our approach so far, routed links have zero width. How-

ever, routing links of parameterized radius r

(width 2r

) is

achieved by increasing element space allocation from r

+ r

(see Fig. 10).

When a link is placed, intersection information is updated

for every edge e ∈ E

, such that we know the number of

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

879

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

(a) (b) (c)

Figure 10: Link radius: (a) Increase of element space allo-

cation by r

; (b) Link with allocated space of radius r

and its

contours c

and c

; (c) Two links routed beside each other.

intersections of e’s contours (dilation of e by r

) with con-

tours of already placed links, i.e., dilation of e

by r

for all

of G

(S), S ∈ S. Tracking intersections between just the

edges is not sound as it is possible for edges to be placed

next to each other within 2r

. Links routed over such edges

would therefore intersect without receiving the right inter-

section penalty. To accommodate routing multiple links ad-

jacent to each other and around the same element space (see

Fig. 10(c)), we also extend G

to include A(e), and corre-

sponding (tangent) edges, dilated with steps of 2r

Overlapping contours are not counted as intersections.

This allows two links l

and l

to share part of the same

path with few intersections I(l

) and I(l

). Low-cost sharing

of paths is exactly what we want to properly convey overlap-

ping sets, as explained in Section 5.

4.3. Visual styles (phase 3)

The space allocated for elements and their sets’ visual links

can be used in various ways. We devised two very different

diagram styles: nested and striped Kelp.

Nested style. Nested Kelp surrounds elements of every set

with a colored shape (see Fig. 11 (top)), where the shapes

of every set are stacked on top of each other. It relies on the

ability of the observer to mentally distinguish and complete

shapes that are partially overlapped by other shapes. The al-

located space for elements and links provides a framework

to deﬁne the diagram such that every shape is sufﬁciently

visible, in correspondence with constraint C2.

We construct the set of shapes N(S) for every S ∈ S as

follows: For every element e ∈ E, assume S

, S

, ..., S

con-

tain e, ordered such that |S

| ≤ |S

| where i < j. For every S

scale the allocated space A(e) around its centroid by a factor

(

)

, where s

is a parameter, and merge it into N(S

For every edge e ∈ E

, let Q be the set of edges reachable

from e in the union of all G

(S), S ∈ S, without passing be-

yond an element node. Assume that S

, S

, ..., S

contain an

edge in Q, ordered such that |S

| ≤ |S

| where i < j. Then,

for every S

, dilate e by r

(

)

, where s

is a parameter, and

merge it into N(S

). Here dilate and erode are the equivalents

of Minkowski sum and Minkowski subtraction with a circle

of a speciﬁed radius [dBCvKO08]. Thus, for every element,

set membership depictions are bound by the space that was

allocated for the element. In addition, depictions are scaled

to nest. Links and element circles thus do not become visu-

ally dominant when many sets share an element or path.

Links of a set are scaled to nest with links of other sets

that partially share a path in G

. The order in which sets are

nested is based on the size of sets, i.e., smaller sets nest in

larger sets, which is consistent throughout the diagram. The

scaling of both element circles and links, by their level of

nesting, depends on parameters s

and s

, respectively, such

that the share of space allocated to every set can be adjusted

to compensate for the effects of perceptual scaling [DTH08].

The shapes N(S), for S ∈ S, are drawn on top of each other,

ordered by |S|, where N(S) is ﬁlled with a color and given

a gray outline to enhance contrast between shapes of differ-

ent sets. In addition, the shapes are dilated and eroded to

smoothen the corners of the allocated element space and the

transition between links and elements. A strong erosion re-

sults in a clear separation between shapes that are not nested

(A2b), as seen at the top of Fig. 11.

Striped style. Striped Kelp uses alternating stripes for ar-

eas that contain elements of multiple sets (see bottom of

Fig. 11). The allocated space A(e) of e ∈ E is partitioned

into radial slices, where every set that contains e gets the

same number of slices in an alternating pattern. Edges of G

that belong to a set have link radius r

. When an edge belongs

to multiple sets (links partially share a path), it is partitioned

into stripes of ﬁxed length where the sets give an alternating

pattern.

Stripes are drawn as consistently as possible. If two links

Figure 11: Kelp Diagram of eleven elements and three sets.

Top: Nested style. Bottom: Striped style.

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

880

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

(a) (b) (c)

(d) (e) (f)

Figure 12: Kelp applied to the capital cities of the European Union, where the eurozone is blue, the EU founding members

(European Coal and Steel Community) are pink, and members with good, average, and bad credit rating are green, orange, and

red respectively (Standard & Poor’s, October 2011). The diagrams have various conﬁgurations: (a) Nested style without links;

(b) Nested style with links; (c) Striped style; (d) Nested style with large element radius r

and small link radius r

; (e) Nested

style with large intersection penalty c

; (f) Nested style with low link addition threshold b

share edges but eventually split up, the stripes will continue

along the edges at the split. This is achieved by drawing

the stripes via a depth-ﬁrst search in the G

graphs. Cer-

tain cyclic conﬁgurations of links do not allow for consistent

stripes, but these are rare enough in practice.

4.4. Implementation and performance

We implemented our approach in Java, using the Java Topol-

ogy Suite [Viv03] for geometric operations such as dilation

and erosion. One advantage of our purely geometric routing

and diagram deﬁnitions is that vector graphics can be gen-

erated and merged with the predeﬁned (vector) visualization

to get images that can be rendered with arbitrary resolution.

The running time of the algorithm is dominated by the it-

erative placement of links. The tangent graph G

contains

O(|E|

) edges and nodes. To place one link, shortest paths

are computed to all nodes in G

from every element and

for every set. The same applies to all G

graphs. This com-

putation, including selection of the best link, amounts to

O(|S||E|

log|E|) when Dijkstra’s algorithm is used. After

a link is placed, any intersections that it introduces have

to be accounted for. Thus, intersections are determined for

all edge pairs, which amounts to O(|E|

log|E|) time with a

sweepline algorithm. It is reasonable to assume that the al-

gorithm’s parameters are conﬁgured such that for every set

S ∈ S the ﬁnal G

(S) is planar and therefore O(|E|) links

are placed for S. The entire links placement phase therefore

takes O(|S|

|E|

log|E|) time. This also bounds the running

time of the entire approach, with the space allocation and

visualization phases being relatively cheap.

The bound (though not tight) indicates that the current ap-

proach cannot be applied to data sets of thousands of ele-

ments. However, most data sets commonly used in this vi-

sualization problem do not go beyond a hundred elements

and several sets, since larger sets cannot be comprehended

by an observer. Our implementation is able to generate Kelp

diagrams for such data sets within ﬁve minutes on a modern

desktop computer (Intel Core 2 Quad CPU at 2.4 GHz).

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

881

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

(a) (b) (c)

Figure 13: Restaurants of three categories in Seattle: (a) Bubble Sets approach, generated with a public software library [KC];

(b) LineSets approach, image taken from [ARRC11]; (c) Nested Kelp Diagram.

5. Results and Discussion

Fig. 12 shows Kelp Diagrams with various parameter conﬁg-

urations. The beneﬁt of visually linking elements that belong

to the same set follows from Fig. 12(a) and (b). Increasing

the element radius r

(see Fig. 12(b) and (d)) causes more

of the original map to be occluded, making it harder to ﬁnd

capital cities by their location, but sets are more easily dis-

tinguished. Especially when multiple links share a path, the

sets of the nested style are harder to distinguish, requiring

larger r

. The striped style suffers less from this. A com-

parison of Fig. 12(b) and (e) reveals the effect of increasing

the intersection penalty c

, where the algorithm deems it of

greater beneﬁt to route a long link around Helsinki instead

of introducing additional intersections. The effect of the link

addition threshold b

is shown in Fig. 12(f), where a low b

results in the construction of spanning trees. Kelp can be ap-

plied beyond cartography, as shown for a metabolic network

in Fig. 1. Here, nested element scaling s

has been given

a very low value to push the contours away from labels of

compounds.

The strengths and weaknesses of the nested and striped

styles can be seen in Fig. 12(b) and (c). When multiple sets

share the same link, e.g., Amsterdam-Berlin, the sets of the

nested style quickly become harder to distinguish, whereas

the striped style has no such problem provided that the link

is long enough to ﬁt ample stripes for each set. The striped

style also prevents false assumptions to be made about the

nested structure of the depicted sets. The nesting at Ljub-

ljana suggests that the blue set is a subset of the green set,

while they actually only overlap. However, the striped style

ignores some of the criteria listed in Section 3. The stripes

of a shared link or node break the continuation of the sets’

shapes, which explains why the shapes of the nested style

are more easily interpreted as a whole. Stripes also increase

the visual complexity of the diagram because there are more

and stronger bends (A1b). Therefore, the nested style is eas-

ier on the eyes and likely the more versatile style in many

situations. It also happens to be the closest visual match to

existing techniques.

Fig. 13 and 14 compare the nested style with the Bub-

ble Sets and LineSets techniques for the same datasets. We

can see that the Bubble Sets approach has many bends in its

contours. In some locations, shapes and contours have un-

desirable overlap. Compared to the Kelp Diagram, the Bub-

ble Sets depiction introduces more overlapping areas. This is

not surprising since the Bubble Sets algorithm does not try

to avoid intersections between links where Kelp does. In ad-

dition, Bubble Sets create shapes that use a lot of space and

color blending introduces new colors for overlapping shapes

that may be interpreted as additional, non-existing sets.

Bubble Sets and LineSets cannot visually connect ele-

ments beyond creating a spanning tree, whereas Kelp cre-

ates a graph to interconnect elements as much as desired

via parameter b

. Additional links, beyond a spanning tree,

may not always be desired. For example, Fig. 14(c) has been

generated with a relatively high b

. For this diagram, some

links of the brown set introduce some additional visual clut-

ter (A1a, A1b, and A3a) though improve interpretation of

spatial distributions (A2d and A3b). However, when b

is set

to a low value, surplus links will only be placed when con-

tinuation greatly improves. This is the case for the green set

of Fig. 13.

LineSets do not route shapes around elements that should

not be contained by them, which the other techniques do.

Hence, in Fig. 14(b) we see an element of the brown set at

the middle left that overlaps with a shape of the orange set,

while it is not a part of the orange set. Moreover, LineSets

connect elements of a set with a single line, creating longer

links, many bends, and intersections. For example, there is a

line with a strong bend at the bottom of Fig. 13(b) that could

have been avoided.

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

882

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

(a) (b) (c)

Figure 14: Disjoint sets of locations in Manhattan, with hotels (orange), subway stations (brown), and medical clinics (purple):

(a) Bubble Sets, image taken from [CPC09]; (b) LineSets, image courtesy B. Alper and N. Riche; (c) Nested Kelp Diagram.

Even though the element glyphs are very small in

Fig. 13(c), we can immediately see the presence and den-

sity of elements because the surrounding contours are (par-

tially) circular. For LineSets, the presence of an element can

be inferred from the presence of a bend, while for Bub-

ble Sets the large shapes make elements harder to spot (see

Fig. 13(a)). Kelp’s composition of basic visual units—circles

and constant-width connections—is an advantage over Bub-

ble Sets. Depictions are consistent, which means less visual

clutter and easier interpretation.

Kelp Diagrams have some drawbacks. They sometimes

have sharp bends where two links of the same set converge.

Also, contours can become complex for clusters of elements

that belong to the same set. LineSets suffer from this as well,

in contrast to Bubble Sets. In addition, when elements are

too close to each other, their Voronoi faces become domi-

nant in space allocation and cause strong corners to occur in

nearby contours. The schematic appearance of Kelp on top

of a predeﬁned visualization that is schematic as well can be

confusing, e.g., the set shapes may be interpreted as roads or

metro lines. The more organic appearance of LineSets and

Bubble Sets make them stand out from a map.

6. Conclusion

Kelp Diagrams are a new way to depict set relations over

already positioned elements. The algorithm balances visual

complexity, based on aesthetic criteria, with effective depic-

tion of the data. Comparison of resulting visualizations with

those generated by two state of the art approaches for the

same data shows that Kelp Diagrams have a consistent, easy

to interpret, appearance. In addition, the parameters of the

algorithm give it the ﬂexibility that is required for applica-

tion to different kinds of visualization.

Still, several improvements are possible. The routing al-

gorithm is too slow for interactive use. Exploiting locality

with spatial data structures, or replacing the tangent visibility

graph with a graph of smaller complexity, are possible op-

tions. Also the greedy placement of links can be improved to

reduce visual clutter. For nested Kelp, improved derivation

of link width and nesting order with respect to aesthetic cri-

teria requires further investigation. For example, links could

be given widths according to an additive scheme, where the

width of a link is increased when it has to convey multiple

sets. In addition, allocated element and link space could be

made dependent on the element density in an area.

Other extensions are possible, such as supporting ele-

ments with dimension (instead of points) and automated

derivation of parameter settings based on the data itself.

Use of the subset standard, as is the case for Kelp Dia-

grams, has been indicated to be effective via user studies

before [ARRC11]. Nonetheless, we plan to investigate the

effectiveness of the presented diagrams, and compare them

to existing techniques, with further user studies.

Current approaches share a great deal of similarities in an

attempt to solve the same problem. It would be interesting to

explore a more generic method, which is able to produce the

distinct visualizations of current approaches, but also able to

produce hybrid visualizations of greater quality.

Acknowledgements. We would like to thank Nathalie

Riche, Basak Alper, and their colleagues for providing us

with the data for Fig. 13 and 14. B. Speckmann and K. Din-

kla are supported by the Netherlands Organisation for Sci-

entiﬁc Research (NWO) under projects no. 639.022.707 and

no. 612.001.004, respectively.

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

883

K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams

References

[ARRC11] ALPER B., RICHE N., RAMOS G., CZER WINSKI M.:

Design study of LineSets, a novel set visualization technique.

IEEE Transactions on Visualization and Computer Graphics 17,

12 (2011), 2259–2267. doi:10.1109/TVCG.2011.186. 1,

3, 8, 9

[BCPS10a] B

RANDES U., CORNELSEN S., PAMPEL B., SAL-

LABERRY A.: Blocks of hypergraphs applied to hypergraphs

and outerplanarity. In 21st Int. Workshop on Combinatorial

Algorithms—IWOCA (2010), Iliopoulos C. S., Smyth W. F.,

(Eds.), vol. 6460 of Lecture Notes in Computer Science, Springer,

pp. 201–211. doi:10.1007/978-3-642-19222-7_21.

[BCPS10b] B

RANDES U., CORNELSEN S., PAMPEL B., SAL-

LABERRY A.: Path-based supports for hypergraphs. In

21st Int. Workshop on Combinatorial Algorithms—IWOCA

(2010), Iliopoulos C. S., Smyth W. F., (Eds.), vol. 6460

of Lecture Notes in Computer Science, Springer, pp. 20–33.

doi:10.1016/j.jda.2011.12.009. 2

[BE01] B

ERTAULT F., EADES P.: Drawing hypergraphs in the

subset standard. In 8th Int. Symp. on Graph Drawing (2001),

Marks J., (Ed.), vol. 1984 of Lecture Notes in Computer Science,

Springer, pp. 45–76. doi:10.1007/3-540-44541-2_15.

[BT06] B

YELAS H., TELEA A.: Visualization of areas of in-

terest in software architecture diagrams. In Proc. 2006 ACM

Symp. on Software Visualization (2006), Kraemer E., Bur-

nett M. M., Diehl S., (Eds.), SOFTVIS, ACM, pp. 105–114.

doi:10.1145/1148493.1148509. 2

[BT09] B

YELAS H., TELEA A.: Towards realism in draw-

ing areas of interest on architecture diagrams. Journal of

Visual Languages and Computing 20, 2 (2009), 110–128.

doi:10.1016/j.jvlc.2008.09.001. 2

[BvKM

∗

10] BUCHIN K., VAN KREVELD M. J., MEIJER

H., SPECKMANN B., VERBEEK K.: On planar supports

for hypergraphs. In 17th Int. Symp. on Graph Drawing

(2010), Eppstein D., Gansner E. R., (Eds.), vol. 5849 of

Lecture Notes in Computer Science, Springer, pp. 345–356.

doi:10.1007/978-3-642-11805-0_33. 2

[CC07] C

OLLINS C., CARPENDALE S.: VisLink: Revealing re-

lationships amongst visualizations. IEEE Transactions on Vi-

sualization and Computer Graphics 13, 6 (2007), 1192–1199.

doi:10.1109/TVCG.2007.70521. 2

[CPC09] C

OLLINS C., PENN G., CARPENDALE S.: Bub-

ble Sets: Revealing set relations with isocontours over ex-

isting visualizations. IEEE Transactions on Visualiza-

tion and Computer Graphics 15, 6 (2009), 1009–1016.

doi:10.1109/TVCG.2009.122. 1, 2, 3, 9

[dBCvKO08]

DE BERG M., CHEONG O., VAN KREVELD M.,

VERMARS M.: Computational Geometry: Algorithms and Ap-

plications. Springer, 2008. 6

[DBETT99] D

I BATTISTA G., EADES P., TAMASSIA R., TOLLIS

I. : Graph Drawing: Algorithms for the Visualization of Graphs.

Prentice-Hall, 1999. 2, 3

[DTH08] D

ENT B., TORGUSON J., HODLER T.: Cartography:

Thematic Map Design. McGraw-Hill, 2008. 6

[FMH08] F

REILER W., MATKOVIC K., HAUSER H.: Interac-

tive visual analysis of set-typed data. IEEE Transactions on Vi-

sualization and Computer Graphics 14, 6 (2008), 1340–1347.

doi:10.1109/TVCG.2008.144. 2

[FRM03] F

LOWER J., RODGERS P., MUTTON P.: Layout met-

rics for Euler diagrams. In Proc. Seventh Int. Conf. on Informa-

tion Visualization (2003), IEEE Computer Society, pp. 272–280.

doi:10.1109/IV.2003.1217990. 2, 4

[HRD10] H

ENRY RICHE N., DWYER T.: Untangling

Euler diagrams. IEEE Transactions on Visualiza-

tion and Computer Graphics 16, 6 (2010), 1090–1099.

doi:10.1109/TVCG.2010.210. 2

[JP87] J

OHNSON D. S., POLLAK H. O.: Hypergraph

planarity and the complexity of drawing Venn dia-

grams. Journal of Graph Theory 11, 3 (1987), 309–325.

doi:10.1002/jgt.3190110306. 2

[KC] K

RAUSE J., COLLINS C.: Bubble sets library. URL:

https://github.com/JosuaKrause/Bubble-Sets.

[KvKS09] K

AUFMANN M., VAN KREVELD M. J., SPECKMANN

B. : Subdivision drawings of hypergraphs. In 16th Int. Symp.

on Graph Drawing (2009), Tollis I. G., Patrignani M., (Eds.),

vol. 5417 of Lecture Notes in Computer Science, Springer,

pp. 396–407. doi:10.1007/978-3-642-00219-9_39.

[SA06] S

HNEIDERMAN B., ARIS A.: Network visualiza-

tion by semantic substrates. IEEE Transactions on Visu-

alization and Computer Graphics 12, 5 (2006), 733–740.

doi:10.1109/TVCG.2006.166. 2

[SAA09] S

IMONETTO P., AUBER D., ARCHAMBAULT

D.: Fully automatic visualisation of overlapping sets.

Computer Graphics Forum 28, 3 (2009), 967–974.

doi:10.1111/j.1467-8659.2009.01452.x. 2,

[SLK

∗

09] STREIT M., LEX A., KALKUSCH M., ZATLOUKAL

K., SCHMALSTIEG D.: Caleydo: connecting pathways and

gene expression. Bioinformatics 25, 20 (2009), 2760–2761.

doi:10.1093/bioinformatics/btp432. 2

[ST10] S

ANTAMARÍA R., THERÓN R.: Visualization of inter-

secting groups based on hypergraphs. IEICE Transactions on

Information and Systems 93-D, 7 (2010), 1957–1964. 2

[SWS

∗

11] STEINBERGER M., WALDNER M., STREIT M., LEX

A., SCHMALSTIEG D.: Context-preserving visual links. IEEE

Transactions on Visualization and Computer Graphics 17, 12

(2011), 2249–2258. doi:10.1109/TVCG.2011.183. 2, 3

[vDvKSW02]

VAN DIJK S., VAN KREVELD M. J., STRIJK

T., WOLFF A.: Towards an evaluation of quality for

names placement methods. International Journal of Ge-

ographical Information Science 16, 7 (2002), 641–661.

doi:10.1080/13658810210138742. 4

[Viv03] V

IVID SOLUTIONS: Java topology suite, 2003. 7

[WvdBH07] W

EIN R., VAN DEN BERG J. P., HALPERIN

D.: The visibility-Voronoi complex and its applica-

tions. Computational Geometry 36, 1 (2007), 66–87.

doi:10.1016/j.comgeo.2005.11.007. 5

 2012 The Author(s)

 2012 The Eurographics Association and Blackwell Publishing Ltd.

884

Bấm Tải xuống để xem toàn bộ.

Preview text:

DOI: 10.1111/j.1467-8659.2012.03080.x
Eurographics Conference on Visualization (EuroVis) 2012
Volume 31 (2012), Number 3
S. Bruckner, S. Miksch, and H. Pfister (Guest Editors)
Kelp Diagrams: Point Set Membership Visualization
Kasper Dinkla1, Marc J. van Kreveld2, Bettina Speckmann1, and Michel A. Westenberg1
1Eindhoven University of Technology, The Netherlands.
2Utrecht University, The Netherlands.
Figure 1: Kelp Diagrams applied to a metabolic network (left) and cities on a map (right). Abstract
We present Kelp Diagrams, a novel method to depict set relations over points, i.e., elements with predefined
positions. Our method creates schematic drawings and has been designed to take aesthetic quality, efficiency, and
effectiveness into account. This is achieved by a routing algorithm, which links elements that are part of the same
set by constructing minimum cost paths over a tangent visibility graph. There are two styles of Kelp Diagrams to
depict overlapping sets, a nested and a striped style, each with its own strengths and weaknesses. We compare Kelp
Diagrams with two existing methods and show that our approach provides a more consistent and clear depiction
of both element locations and their set relations.
Categories and Subject Descriptors (according to ACM CCS):
I.3.6 [Computer Graphics]: Picture/Image
Generation—Line and curve generation 1. Introduction
The two state of the art approaches, Bubble Sets [CPC09]
and LineSets [ARRC11], use colored shapes to visually con-
Visualization of one or multiple sets, possibly sharing sev-
nect elements that belong to the same set. Bubble Sets de-
eral elements, is a recurrent theme both inside and outside
rives an element density function for each set. Isolines are
the visualization community. Sometimes, depiction of the
extracted from each function to form shapes around the el-
sets is the main concern, excluding any other information
ements. These shapes are then connected further with links,
of contained elements besides some means of identification,
routed along the elements. LineSets draws a thick colored
i.e., a label. Here, simple visualizations suffice for a small
curve through the elements of each set, making sure that the
number of sets, such as a table with a mapping of elements
traversed path over the elements is relatively short. Both ap-
to rows and sets to columns, or a more space-efficient Eu-
proaches generate visualizations that often appear complex
ler diagram. Not every situation warrants such an approach.
and sometimes even convey invalid set memberships. In con-
Sometimes, other data aspects (partially) dictate the posi-
trast our method strictly controls where shapes of sets are
tions of the elements’ visual representations. For example,
placed. It consists of three phases (see Fig. 2): the alloca-
geographic places are often best positioned at their real-
tion of space around each element such that there is enough
world coordinates because this is consistent with the existing
room to depict its containing sets; the allocation of space for
knowledge of the observer and improves visual orientation.
connecting shapes between demarcation zones with a rout-
Furthermore, it allows spatial patterns to emerge. c 2012 The Author(s) Computer Graphics Forum c
2012 The Eurographics Association and Blackwell Publish-
ing Ltd. Published by Blackwell Publishing, 9600 Garsington Road, Oxford OX4 2DQ,
UK and 350 Main Street, Malden, MA 02148, USA. 876
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams
is an issue [BT06, BT09, CPC09]. Another option is to con-
nect highlighted elements with (colored) links [SA06,CC07,
SLK∗09]. This is referred to as visual linking, which can take
on sophisticated forms [SWS∗11]. However, visual linking
focuses on relating elements, not the comparison of sets in
a spatial setting. For both approaches the way in which con-
tours or links are placed affects the depiction of the sets but
Figure 2: The three phases of generating Kelp Diagrams.
also of the predefined visualization.
ing algorithm; and the generation of actual visualizations, by 3. Problem Analysis
using the allocated space, in two distinct ways.
An input problem instance consists of three aspects: posi-
In summary, our contribution is:
tioned elements, the sets that contain them, and the prede-
fined visualization that embeds the elements. We want to
• Two styles of diagram that emphasize different aspects of
depict these sets in combination with the predefined visu-
set memberships and overlap for elements with a prede-
alization. To determine what makes a good set depiction we fined position;
enumerate tasks that an observer may wish to perform by in-
• a routing algorithm for linking elements in a set to support
terpreting the visualization and hence the data. These tasks
the generation of such diagrams, where aesthetic quality,
impose constraints that any visualization has to fulfill when-
efficiency, and effectiveness are taken into account.
ever possible, but also provide (conflicting) optimization cri-
teria to improve task performance of the observer. 2. Related Work
Conveying information about multiple sets is a longstand-
Supported tasks. The composition of multiple elements
and sets—in a spatial setting—brings forth many questions
ing problem and exists in various forms. When the loca-
of a comparative nature: To what extent do sets overlap or
tion of the elements are not specified, then the input is
differ? How close to each other are the elements of a set? Is
simply a set system. Each set can also by interpreted as
the containment of an element in a set correlated to its po-
a hyperedge of a hypergraph defined on the elements (the
sition? We have compiled a list of primitive tasks that have
vertices of the hypergraph). There are several papers from
to be supported by a visualization such that an observer is
the graph drawing community [JP87, DBETT99] that dis-
able to answer these questions, or which the observer can
cuss how to draw hypergraphs. Most recent efforts have
perform ad-hoc to gain insight about the data:
focused on so called planar supports for hypergraphs and
the associated subdivision drawings [KvKS09] (see also
T1a determine the position of an element
[BCPS10a, BCPS10b, BvKM∗10] for further theoretical re-
T1b find an element by position (relative to landmarks)
sults on specific types of planar supports). Unfortunately
T1c estimate the density of elements in an area
most hypergraphs do not have planar supports and hence
subdivision drawings are of very limited practical use.
T2a determine which sets contain a specific element
T2b find the elements that belong to a specific set
When only sets (or hyperedges) are of concern, the de-
T2c estimate the spatial distribution of a specific set
piction possibilities are numerous. Euler diagrams are well-
known and use contours to denote areas that represent
T3 compose a (mental) set from existing sets with opera-
sets, which is sometimes referred to as the subset stan-
tions union, intersect and complement, and apply T1 – T2
dard. Such diagrams can be generated automatically; by
The tasks can be composed to answer more complex ques-
abstracting from individual elements [FRM03], or by in-
tions. When dealing with cities on a map, with a set of large
cluding representations of the elements and related infor-
cities and a set of industrial cities, the following questions
mation [BE01, SAA09, HRD10]. Here, the positions of el-
may be asked: Which cities are large but not industrial?
ements either do not matter as no elements are displayed,
(T2b and T3); Are industrial cities clustered together? (T2c);
or are assigned to optimize the visualization of the sets. The
Is New York considered large and/or industrial, and which
number of elements and sets influences the effectiveness of
neighboring cities are similar? (T1a, T2a, T3, and T2b). As
a visualization. For many elements and groups, augment-
shown, answering common questions involves the combina-
ing a contour-based visualization is a possibility [ST10],
tion and execution of these primitive tasks. Thus, improving
or a highly-interactive analysis environment is a necessity
the efficiency at which tasks can be performed, improves the [FMH08].
ease at which complex questions can be answered.
When dealing with predefined positions of elements, dis-
playing the sets becomes more difficult. If contours are
Constraints. The visualization has to satisfy the following
used, like Euler diagrams, a fair division of display space
constraints to enable T1 and T2: c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd.
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams 877
C1 every element is clearly represented in the final visual-
ization at its predefined position (for T1)
C2 every element is clearly marked or contained by a rep-
resentation of every set that it is a part of (for T2) (a) (b) (c)
These constraints are satisfiable, provided that all ele-
Figure 3: The added benefit of linking: (a) Elements are as-
ments are visually distinct, i.e., all elements are positioned
sociated solely by the colored shapes that contain them; (b)
at a discernible distance from each other. We assume this to
Elements are associated both by color and a common shape;
be the case because any predefined visualization has to sup-
(c) Spatial patterns are emphasized.
port T1 to be of practical purpose and should therefore have
visually distinct elements in the first place.
has to scan all elements in a linear fashion to determine their
Aesthetic criteria. The aforementioned constraints guaran-
corresponding sets. Incorporating distinguishing features
tee that all tasks can be performed but do not provide di-
such as color for the shapes is an effective approach. How-
rection towards an efficient visualization that allows (com-
ever, creation of a visual continuation of shapes (A2d) causes
posite) tasks to be performed by an observer with little ef-
an even stronger grouping effect (see Fig. 3), on which exist-
fort. Generation of aesthetic shapes has been a subject of
ing approaches rely as well [CPC09, ARRC11, SWS∗11]. In
research before for Euler diagrams [SAA09], and the gener-
certain situations, like the one shown in Fig. 3(c), strong con-
ation of effective graph representations, by reduction of in-
tinuation also emphasizes spatial patterns of elements and
tersections for example, is a common theme in graph draw-
sets. This includes improving the detection of spatial clus-
ing [DBETT99]. This has inspired us to list important prop-
ters (T2c). As is the case for other criteria, continuation is
erties that make for good shapes that depict sets.
not a strict constraint, i.e., elements that belong to the same
Shapes should have low cognitive load:
set do not have to be connected.
Shapes should not distort element position and density: A1a small area
A1b few and shallow bends
A3a little obfuscation of the predefined visualization
A1c few outline intersections
A3b strong correspondence between the presence and size
Not only do aesthetic shapes appeal to the observer, they
of a set’s shape, and the presence and density of elements
in general are accompanied by a low cognitive load, i.e., it
that belong to this set, in an area
takes less effort for the observer to process shapes and thus
Not only do the shapes affect the way in which the prede-
derive the information they are meant to convey. For a set de-
fined visualization is perceived through partial occlusion or
piction, the faster the shapes that convey sets are interpreted,
obfuscation, they also affect the way in which the elements
the faster T2 can be performed.
and their locations are perceived (A3a and A3b). For exam-
A small area (A1a) implies less surface to inspect and pro-
ple, when the set containment of an element is depicted with
cess, thus less cognitive load. Few and shallow bends (A1b)
a large colored circle, this circle will form a stronger visual
imply smooth outlines that are easy to distinguish from their
cue to the presence of an element than the element’s own de-
surroundings (the predefined visualization). It also means
piction (see Fig. 4(a)). However, this can also affect the per-
that outlines are short and therefore easier to process. In-
ceived position of the element (see Fig. 4(b)) and the density
tersections of outlines (A1c) make them harder to tell apart
of elements in an area (see Fig. 4(c)).
and discern the area they contain and the elements therein.
Many of the stated criteria are in conflict with each other: Shapes should be effective:
A2a with A2c, A1a with A1b and A1c because of the routing
that is required, A1c with A2d, and A3b with all other crite- A2a large area
ria. Defining an optimal visualization therefore not only re-
A2b large distance between outlines
A2c little overlap of shapes that depict different sets
A2d strong continuation of shapes that depict the same set
Shapes of large surface area (A2a) attract attention, mak-
ing the presence of a set explicit (T2). If outlines of shapes
are close to each other they are harder to tell apart, often (a) (b) (c)
causing the overall shapes to be less pronounced. Likewise,
Figure 4: The distorting effect of shapes on element depic-
overlapping shapes (A2c) may obfuscate each other and the
tion: (a) An element contained in a shape attracts more at-
information they should convey. Both impact T2 negatively.
tention; (b) The expected position of the element lies at the
For T2b—finding the elements that belong to a specific
center of the circle, which conflicts with its actual position;
set—to be performed efficiently, the elements have to be as-
(c) Both sets of elements have the same number of elements,
sociated as a group by the observer. Otherwise, the observer
but the difference in size of the shapes suggests otherwise. c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd. 878
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams
quires quantifying the individual criteria, it also requires the
criteria to be prioritized and combined into an overall defini-
tion of an optimum. Such an approach has been used for the
creation of aesthetically pleasing Euler diagrams [FRM03]
and label placement on maps [vDvKSW02], where different
criteria are weighted and then used as a fitness function. It
underlines the varying expectations that different observers (a) (b) (c) may have of visualizations.
Figure 5: Allocation of space for three elements: (a) Over-
Moreover, when criteria like A1c, which require routing,
lapping circles, centered over elements; (b) The Voronoi
are included in an overall optimization scheme, the combina-
faces of the elements’ positions; (c) Intersection of circles
torial complexity of the problem greatly increases and forces and Voronoi faces.
the use of heuristic algorithms.
diagram of {p(e)|e ∈ E} and then, for every element e ∈ E, 4. Approach
intersect the allocated circle of e with the Voronoi face that
Our approach consists of three phases: allocation of element
contains p(e) and use it as the new allocated space A(e) (see
space, allocation of additional link space, and the generation
Fig. 5). Hence, no allocated space intersects and elements
of visualizations (see Fig. 2). The following pseudo-code get a fair share of space.
provides an overview of the approach, where Sections 4.1,
The resulting space partition is stored as an embedded
4.2, and 4.3 elaborate on lines 1–2, 3–10, and 11, respec- graph
tively. Here, the elements are denoted as G E, the predefined
A = (VA, EA), with vertices VE for the elements, and vertices element positions as V
p(e) for e ∈ E, and the collection of
I for the intersection points between Voronoi faces
and circles, including the points that are shared by more
sets as S, where for every S ∈ S we have S ⊆ E.
than two Voronoi faces and lie within an element circle. The
Algorithm Kelp(E, S)
edges between these intersection points are either straight 1.
Derive Voronoi diagram of {p(e)|e ∈ E}.
(part of a Voronoi face) or circular. 2.
For every e ∈ E derive A(e) as the intersection of its
The allocated space of each element is referred to as a fair
Voronoi face and a circle of radius re, centered at p(e).
share—not an equal share—because sometimes elements do 3.
Derive embedded graph GA from {A(e)|e ∈ E}.
not receive space that is equal in surface area to their neigh- 4.
Derive tangent graph GT from GA.
boring elements (Fig. 6(a)). This unequal allocation could in 5.
while best-to-place link l between p, q ∈ S | S ∈ S has
certain cases affect the depiction of element density (A3b)
benefit b(p,q) > bt
negatively. Applying a repulsive force between the elements’ 6.
do Add edges of l in GT to subgraph GL(S).
circles, and shifting their positions accordingly, would in 7.
Derive all-pair shortest paths of GL(S).
many situations result in an equal division (Fig. 6(b)), but is 8.
Update GT with intersections introduced by l.
not applicable to all situations (Fig. 6(c)). Moreover, manip- 9.
Derive all-pair shortest paths of GT for all R ∈ S.
ulating the position or shape of the circles will almost always 10.
Derive next best-to-place link from shortest paths
harm the ability of an observer to find elements by position
in GL(R) and GT for all R ∈ S.
(T1a) and to determine element density in an area (A3b), see
11. Derive visualization style from all GL(S) | S ∈ S.
also Fig. 4. The intersection of an element’s Voronoi face
and circle is simple and allows for visual reinforcement of
4.1. Element space (phase 1)
the element’s position, which is discussed in Section 5.
T2 requires that for each element it is clear to which sets
it belongs, hence each element should have ample (and at
least equally divided) surrounding space to display its sets. e e1 e2 e3 e
Taking the trade-off between A2a and A2c into account, we 2
want the observer to have a level of control on how much e1 e3
space is used for the set visualization. Given such fixed area
per element and considering A1a and A1b, the natural choice (a) (b) (c)
of area to allocate around an element is a circle of radius re.
Figure 6: Area division between elements: (a) Our method
It is not always possible to allocate a perfect circle around
allocates an unequal share of space; (b) Possible outcome
each element. Sometimes the distance between two elements
of a force-based relocation of the circles; (c) Instance where
is smaller than 2re, causing the circles to overlap, or smaller
allocation of equal space for element e is not possible with-
than re, causing circles to overlap each other’s elements. To
out a more complex shape.
resolve this space contention, we first calculate the Voronoi c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd.
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams 879
4.2. Link space (phase 2) l1 lr
A2d states that we should visually connect or link those ele- l1
ments that belong to the same set such that elements from the p q p q
same set are more easily associated with each other. How-
ever, any additional link will result in a more complex visu- l l a b
alization, negatively affecting most other criteria. Therefore, (a) (b)
we have to allocate link space for every S ∈ S, such that the
Figure 9: Benefit of placing a link between p
advantages of the links in the final visualization outweigh
, q ∈ S, S ∈ S is
dependent on already placed links: (a) The benefit of placing
their disadvantages, or cost, as much as possible.
la is low because already placed l1 has low cost; (b) The
We first consider the cost c(l) of placing a link l between
benefit of placing lb is high because already placed chain of
p, q ∈ S, where S ∈ S, in the scene. It can be modeled in
links l1, l2, ..., lr has high cost.
a simple way, where c(l) = cdd(l) + cαα(l) + cII(l). Here,
d(l) is the distance covered by l; α(l) is the aggregate angu-
lar change of l in radians; I(l) is the number of intersections
Now, any link l of set S ∈ S can be decomposed into a se-
between the contours of l and the contours of links already
quence of touching edges {e1,e2,...,er} ⊆ ET , and c(l) can
placed in the scene; and cd, c , and α
cI are weight parameters.
then be derived from GT by summing the costs of the indi-
Distance is penalized in accordance with A1a, aggregate an-
vidual edges. Based on these costs, it is possible to compute
gular change with A1b, and intersections with A1c.
a minimum cost path between p, q ∈ E in GT and thus get l.
No intersections should exist between l and any allocated
Using only cost as a criterion for placing links is not
space A(e) where e ∈ E\{p,q}, as l has no right to use space
enough when we want to connect elements beyond a span-
of A(e). Also, c(l) dictates that l runs directly adjacent to any
ning tree. When spanning trees have been established for all
A(e) that it passes, because tightening l, without altering its
sets, additional low-cost links are not always of great benefit
topology w.r.t. A(e), will always lower d(l) and α(l) without
to the visualization when they are placed. Consider the situ-
changing I(l) (see Fig. 7).
ations depicted in Fig. 9. For Fig. 9(a) the benefit of placing
link la is low because a low cost link l1 is already in place l
to provide ample visual linking between the elements. For l
Fig. 9(b) the benefit of placing link lb is high because the p q p q
links that already connect p and q sum to a high cost, which
means that in the current situation it is hard for the observer (a) (b)
to follow links from p to q, while placement of lb would
make this task considerably easier.
Figure 7: Two possible paths of a link l between p, q ∈ E (in
red): (a) l with unnecessarily high cost; (b) l with identical
Let GL(S), for every S ∈ S, be a subgraph of GT , which
topology but minimized cost.
contains only edges of links that have so far been added for S.
We define the benefit of placing a lowest cost link l between
As can be seen in Fig. 7, this tightness means that any de- , where
sirable link can be constructed from the edges in
p, q ∈ S, as b(p, q) = d(p,q)
d(p, q) is the minimum G c(l) A when
certain (tangent) edges are added to it, similar to robot mo-
path distance between p and q in GL(S). Hence, the benefit
tion planning with tangent visibility graphs [WvdBH07]. So
is higher when the cost of placing l is lower or the smallest we extend
distance via already placed links is higher. When G p and q
A to form GT = (VT , ET ), which includes (tan- gent) edges between are not connected in
A(p) and A(q), p and A(q), A(p) and
GL(S), then d(p, q) = ∞. In case the
benefit of links has to be compared for disconnected vertices
q, and p and q. In addition, for every v ∈ VI, we have edges to every of
p ∈ E and tangent edges to A(p) (see Fig. 8). This
GL(S), we compare only by link cost.
also adds vertices at the tangent points, which split up the
Given these definitions, the algorithm places links in a original circular edges.
greedy manner: Determine S ∈ S and p,q ∈ S with highest
b(p, q). If b(p, q) > bt , where bt is a benefit threshold pa-
rameter, add link l to GL(S) and repeat the algorithm. When
b(p, q) ≤ b v
t , the algorithm terminates. p q p q
In our approach so far, routed links have zero width. How-
ever, routing links of parameterized radius rl (width 2rl) is (a) (b) (c)
achieved by increasing element space allocation from re to
Figure 8: Examples of the additional edges in E
re + rl (see Fig. 10). T (in red):
(a) Tangent edges between A(p) and A(q), p, q ∈ E; (b)
When a link is placed, intersection information is updated
Edges from p to q and A(q); (c) Edges from v ∈ VI.
for every edge e ∈ ET , such that we know the number of c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd. 880
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams
set membership depictions are bound by the space that was c1
allocated for the element. In addition, depictions are scaled
to nest. Links and element circles thus do not become visu- c2
ally dominant when many sets share an element or path. (a) (b) (c)
Links of a set are scaled to nest with links of other sets
that partially share a path in GL. The order in which sets are
Figure 10: Link radius: (a) Increase of element space allo-
nested is based on the size of sets, i.e., smaller sets nest in
cation by rl; (b) Link with allocated space of radius rl and its
larger sets, which is consistent throughout the diagram. The
contours c1 and c2; (c) Two links routed beside each other.
scaling of both element circles and links, by their level of
nesting, depends on parameters se and sl, respectively, such
that the share of space allocated to every set can be adjusted
intersections of e’s contours (dilation of e by rl) with con-
to compensate for the effects of perceptual scaling [DTH08].
tours of already placed links, i.e., dilation of el by rl for all The shapes e
N(S), for S ∈ S, are drawn on top of each other,
l of GL(S), S ∈ S. Tracking intersections between just the
edges is not sound as it is possible for edges to be placed
ordered by |S|, where N(S) is filled with a color and given next to each other within 2
a gray outline to enhance contrast between shapes of differ-
rl. Links routed over such edges
would therefore intersect without receiving the right inter-
ent sets. In addition, the shapes are dilated and eroded to
section penalty. To accommodate routing multiple links ad-
smoothen the corners of the allocated element space and the
jacent to each other and around the same element space (see
transition between links and elements. A strong erosion re- Fig. 10(c)), we also extend
sults in a clear separation between shapes that are not nested
GT to include A(e), and corre-
sponding (tangent) edges, dilated with steps of 2
(A2b), as seen at the top of Fig. 11. rl.
Overlapping contours are not counted as intersections.
Striped style. Striped Kelp uses alternating stripes for ar-
This allows two links l1 and l2 to share part of the same
eas that contain elements of multiple sets (see bottom of
path with few intersections I(l1) and I(l2). Low-cost sharing
Fig. 11). The allocated space A(e) of e ∈ E is partitioned
of paths is exactly what we want to properly convey overlap-
into radial slices, where every set that contains e gets the
ping sets, as explained in Section 5.
same number of slices in an alternating pattern. Edges of GL
that belong to a set have link radius rl. When an edge belongs
to multiple sets (links partially share a path), it is partitioned
4.3. Visual styles (phase 3)
into stripes of fixed length where the sets give an alternating
The space allocated for elements and their sets’ visual links pattern.
can be used in various ways. We devised two very different
Stripes are drawn as consistently as possible. If two links
diagram styles: nested and striped Kelp.
Nested style. Nested Kelp surrounds elements of every set
with a colored shape (see Fig. 11 (top)), where the shapes
of every set are stacked on top of each other. It relies on the
ability of the observer to mentally distinguish and complete
shapes that are partially overlapped by other shapes. The al-
located space for elements and links provides a framework
to define the diagram such that every shape is sufficiently
visible, in correspondence with constraint C2.
We construct the set of shapes N(S) for every S ∈ S as
follows: For every element e ∈ E, assume S1,S2,...,Sr con-
tain e, ordered such that |Si| ≤ |Sj| where i < j. For every Si,
scale the allocated space A(e) around its centroid by a factor
( i )se , where s r
e is a parameter, and merge it into N(Si).
For every edge e ∈ EL, let Q be the set of edges reachable
from e in the union of all GL(S),S ∈ S, without passing be-
yond an element node. Assume that S1,S2,...,Sr contain an
edge in Q, ordered such that |Si| ≤ |Sj| where i < j. Then,
for every Si, dilate e by rl( i )sl , where s r l is a parameter, and
merge it into N(Si). Here dilate and erode are the equivalents
of Minkowski sum and Minkowski subtraction with a circle
Figure 11: Kelp Diagram of eleven elements and three sets.
of a specified radius [dBCvKO08]. Thus, for every element,
Top: Nested style. Bottom: Striped style. c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd.
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams 881 (a) (b) (c) (d) (e) (f)
Figure 12: Kelp applied to the capital cities of the European Union, where the eurozone is blue, the EU founding members
(European Coal and Steel Community) are pink, and members with good, average, and bad credit rating are green, orange, and
red respectively (Standard & Poor’s, October 2011). The diagrams have various configurations: (a) Nested style without links;
(b) Nested style with links; (c) Striped style; (d) Nested style with large element radius re and small link radius rl; (e) Nested
style with large intersection penalty cI; (f) Nested style with low link addition threshold bt .
share edges but eventually split up, the stripes will continue
a link is placed, any intersections that it introduces have
along the edges at the split. This is achieved by drawing
to be accounted for. Thus, intersections are determined for
the stripes via a depth-first search in the GL graphs. Cer-
all edge pairs, which amounts to O(|E|2 log|E|) time with a
tain cyclic configurations of links do not allow for consistent
sweepline algorithm. It is reasonable to assume that the al-
stripes, but these are rare enough in practice.
gorithm’s parameters are configured such that for every set
S ∈ S the final GS(S) is planar and therefore O(|E|) links are placed for
4.4. Implementation and performance
S. The entire links placement phase therefore
takes O(|S|2|E|3 log|E|) time. This also bounds the running
We implemented our approach in Java, using the Java Topol-
time of the entire approach, with the space allocation and
ogy Suite [Viv03] for geometric operations such as dilation
visualization phases being relatively cheap.
and erosion. One advantage of our purely geometric routing
and diagram definitions is that vector graphics can be gen-
erated and merged with the predefined (vector) visualization
to get images that can be rendered with arbitrary resolution.
The bound (though not tight) indicates that the current ap-
The running time of the algorithm is dominated by the it-
proach cannot be applied to data sets of thousands of ele-
erative placement of links. The tangent graph GT contains
ments. However, most data sets commonly used in this vi-
O(|E|2) edges and nodes. To place one link, shortest paths
sualization problem do not go beyond a hundred elements
are computed to all nodes in GT from every element and
and several sets, since larger sets cannot be comprehended
for every set. The same applies to all GS graphs. This com-
by an observer. Our implementation is able to generate Kelp
putation, including selection of the best link, amounts to
diagrams for such data sets within five minutes on a modern
O(|S||E|2 log |E|) when Dijkstra’s algorithm is used. After
desktop computer (Intel Core 2 Quad CPU at 2.4 GHz). c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd. 882
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams (a) (b) (c)
Figure 13: Restaurants of three categories in Seattle: (a) Bubble Sets approach, generated with a public software library [KC];
(b) LineSets approach, image taken from [ARRC11]; (c) Nested Kelp Diagram.
5. Results and Discussion
ier on the eyes and likely the more versatile style in many
Fig. 12 shows Kelp Diagrams with various parameter config-
situations. It also happens to be the closest visual match to
urations. The benefit of visually linking elements that belong existing techniques.
to the same set follows from Fig. 12(a) and (b). Increasing
Fig. 13 and 14 compare the nested style with the Bub-
the element radius re (see Fig. 12(b) and (d)) causes more
ble Sets and LineSets techniques for the same datasets. We
of the original map to be occluded, making it harder to find
can see that the Bubble Sets approach has many bends in its
capital cities by their location, but sets are more easily dis-
contours. In some locations, shapes and contours have un-
tinguished. Especially when multiple links share a path, the
desirable overlap. Compared to the Kelp Diagram, the Bub-
sets of the nested style are harder to distinguish, requiring
ble Sets depiction introduces more overlapping areas. This is
larger rl. The striped style suffers less from this. A com-
not surprising since the Bubble Sets algorithm does not try
parison of Fig. 12(b) and (e) reveals the effect of increasing
to avoid intersections between links where Kelp does. In ad-
the intersection penalty cI, where the algorithm deems it of
dition, Bubble Sets create shapes that use a lot of space and
greater benefit to route a long link around Helsinki instead
color blending introduces new colors for overlapping shapes
of introducing additional intersections. The effect of the link
that may be interpreted as additional, non-existing sets.
addition threshold bt is shown in Fig. 12(f), where a low bt
results in the construction of spanning trees. Kelp can be ap-
Bubble Sets and LineSets cannot visually connect ele-
plied beyond cartography, as shown for a metabolic network
ments beyond creating a spanning tree, whereas Kelp cre-
in Fig. 1. Here, nested element scaling s
ates a graph to interconnect elements as much as desired e has been given
a very low value to push the contours away from labels of
via parameter bt. Additional links, beyond a spanning tree, compounds.
may not always be desired. For example, Fig. 14(c) has been
generated with a relatively high b
The strengths and weaknesses of the nested and striped
t . For this diagram, some
links of the brown set introduce some additional visual clut-
styles can be seen in Fig. 12(b) and (c). When multiple sets
ter (A1a, A1b, and A3a) though improve interpretation of
share the same link, e.g., Amsterdam-Berlin, the sets of the
spatial distributions (A2d and A3b). However, when b
nested style quickly become harder to distinguish, whereas t is set
to a low value, surplus links will only be placed when con-
the striped style has no such problem provided that the link
tinuation greatly improves. This is the case for the green set
is long enough to fit ample stripes for each set. The striped of Fig. 13.
style also prevents false assumptions to be made about the
nested structure of the depicted sets. The nesting at Ljub-
LineSets do not route shapes around elements that should
ljana suggests that the blue set is a subset of the green set,
not be contained by them, which the other techniques do.
while they actually only overlap. However, the striped style
Hence, in Fig. 14(b) we see an element of the brown set at
ignores some of the criteria listed in Section 3. The stripes
the middle left that overlaps with a shape of the orange set,
of a shared link or node break the continuation of the sets’
while it is not a part of the orange set. Moreover, LineSets
shapes, which explains why the shapes of the nested style
connect elements of a set with a single line, creating longer
are more easily interpreted as a whole. Stripes also increase
links, many bends, and intersections. For example, there is a
the visual complexity of the diagram because there are more
line with a strong bend at the bottom of Fig. 13(b) that could
and stronger bends (A1b). Therefore, the nested style is eas- have been avoided. c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd.
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams 883 (a) (b) (c)
Figure 14: Disjoint sets of locations in Manhattan, with hotels (orange), subway stations (brown), and medical clinics (purple):
(a) Bubble Sets, image taken from [CPC09]; (b) LineSets, image courtesy B. Alper and N. Riche; (c) Nested Kelp Diagram.
Even though the element glyphs are very small in
Still, several improvements are possible. The routing al-
Fig. 13(c), we can immediately see the presence and den-
gorithm is too slow for interactive use. Exploiting locality
sity of elements because the surrounding contours are (par-
with spatial data structures, or replacing the tangent visibility
tially) circular. For LineSets, the presence of an element can
graph with a graph of smaller complexity, are possible op-
be inferred from the presence of a bend, while for Bub-
tions. Also the greedy placement of links can be improved to
ble Sets the large shapes make elements harder to spot (see
reduce visual clutter. For nested Kelp, improved derivation
Fig. 13(a)). Kelp’s composition of basic visual units—circles
of link width and nesting order with respect to aesthetic cri-
and constant-width connections—is an advantage over Bub-
teria requires further investigation. For example, links could
ble Sets. Depictions are consistent, which means less visual
be given widths according to an additive scheme, where the
clutter and easier interpretation.
width of a link is increased when it has to convey multiple
Kelp Diagrams have some drawbacks. They sometimes
sets. In addition, allocated element and link space could be
have sharp bends where two links of the same set converge.
made dependent on the element density in an area.
Also, contours can become complex for clusters of elements
Other extensions are possible, such as supporting ele-
that belong to the same set. LineSets suffer from this as well,
ments with dimension (instead of points) and automated
in contrast to Bubble Sets. In addition, when elements are
derivation of parameter settings based on the data itself.
too close to each other, their Voronoi faces become domi-
Use of the subset standard, as is the case for Kelp Dia-
nant in space allocation and cause strong corners to occur in
grams, has been indicated to be effective via user studies
nearby contours. The schematic appearance of Kelp on top
before [ARRC11]. Nonetheless, we plan to investigate the
of a predefined visualization that is schematic as well can be
effectiveness of the presented diagrams, and compare them
confusing, e.g., the set shapes may be interpreted as roads or
to existing techniques, with further user studies.
metro lines. The more organic appearance of LineSets and
Bubble Sets make them stand out from a map.
Current approaches share a great deal of similarities in an
attempt to solve the same problem. It would be interesting to
explore a more generic method, which is able to produce the 6. Conclusion
distinct visualizations of current approaches, but also able to
Kelp Diagrams are a new way to depict set relations over
produce hybrid visualizations of greater quality.
already positioned elements. The algorithm balances visual
complexity, based on aesthetic criteria, with effective depic-
Acknowledgements. We would like to thank Nathalie
tion of the data. Comparison of resulting visualizations with
Riche, Basak Alper, and their colleagues for providing us
those generated by two state of the art approaches for the
with the data for Fig. 13 and 14. B. Speckmann and K. Din-
same data shows that Kelp Diagrams have a consistent, easy
kla are supported by the Netherlands Organisation for Sci-
to interpret, appearance. In addition, the parameters of the
entific Research (NWO) under projects no. 639.022.707 and
algorithm give it the flexibility that is required for applica- no. 612.001.004, respectively.
tion to different kinds of visualization. c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd. 884
K. Dinkla & M. van Kreveld & B. Speckmann & M. Westenberg / Kelp Diagrams References
tion Visualization (2003), IEEE Computer Society, pp. 272–280.
doi:10.1109/IV.2003.1217990. 2, 4
[ARRC11] ALPER B., RICHE N., RAMOS G., CZERWINSKI M.:
Design study of LineSets, a novel set visualization technique.
[HRD10] HENRY RICHE N., DWYER T.: Untangling
IEEE Transactions on Visualization and Computer Graphics 17, Euler diagrams. IEEE Transactions on Visualiza-
12 (2011), 2259–2267. doi:10.1109/TVCG.2011.186. 1,
tion and Computer Graphics 16, 6 (2010), 1090–1099. 3, 8, 9 doi:10.1109/TVCG.2010.210. 2
[BCPS10a] BRANDES U., CORNELSEN S., PAMPEL B., SAL-
[JP87] JOHNSON D. S., POLLAK H. O.: Hypergraph
LABERRY A.: Blocks of hypergraphs applied to hypergraphs
planarity and the complexity of drawing Venn dia-
and outerplanarity. In 21st Int. Workshop on Combinatorial grams.
Journal of Graph Theory 11, 3 (1987), 309–325.
Algorithms—IWOCA (2010), Iliopoulos C. S., Smyth W. F., doi:10.1002/jgt.3190110306. 2
(Eds.), vol. 6460 of Lecture Notes in Computer Science, Springer,
[KC] KRAUSE J., COLLINS C.: Bubble sets library. URL:
pp. 201–211. doi:10.1007/978-3-642-19222-7_21.
https://github.com/JosuaKrause/Bubble-Sets. 2 8
[BCPS10b] BRANDES U., CORNELSEN S., PAMPEL B., SAL-
[KvKS09] KAUFMANN M., VAN KREVELD M. J., SPECKMANN LABERRY A.:
Path-based supports for hypergraphs. In
B.: Subdivision drawings of hypergraphs. In 16th Int. Symp.
21st Int. Workshop on Combinatorial Algorithms—IWOCA
on Graph Drawing (2009), Tollis I. G., Patrignani M., (Eds.),
(2010), Iliopoulos C. S., Smyth W. F., (Eds.), vol. 6460
vol. 5417 of Lecture Notes in Computer Science, Springer,
of Lecture Notes in Computer Science, Springer, pp. 20–33.
pp. 396–407. doi:10.1007/978-3-642-00219-9_39.
doi:10.1016/j.jda.2011.12.009. 2 2
[BE01] BERTAULT F., EADES P.: Drawing hypergraphs in the
[SA06] SHNEIDERMAN B., ARIS A.: Network visualiza-
subset standard. In 8th Int. Symp. on Graph Drawing (2001), tion by semantic substrates.
IEEE Transactions on Visu-
Marks J., (Ed.), vol. 1984 of Lecture Notes in Computer Science,
alization and Computer Graphics 12, 5 (2006), 733–740.
Springer, pp. 45–76. doi:10.1007/3-540-44541-2_15. doi:10.1109/TVCG.2006.166. 2 2 [SAA09] SIMONETTO P., AUBER D., ARCHAMBAULT
[BT06] BYELAS H., TELEA A.: Visualization of areas of in- D.:
Fully automatic visualisation of overlapping sets.
terest in software architecture diagrams. In Proc. 2006 ACM Computer Graphics Forum 28, 3 (2009), 967–974.
Symp. on Software Visualization (2006), Kraemer E., Bur-
doi:10.1111/j.1467-8659.2009.01452.x. 2,
nett M. M., Diehl S., (Eds.), SOFTVIS, ACM, pp. 105–114. 3 doi:10.1145/1148493.1148509. 2
[SLK∗09] STREIT M., LEX A., KALKUSCH M., ZATLOUKAL [BT09] BYELAS H., TELEA A.: Towards realism in draw- K., SCHMALSTIEG D.:
Caleydo: connecting pathways and
ing areas of interest on architecture diagrams. Journal of gene expression.
Bioinformatics 25, 20 (2009), 2760–2761.
Visual Languages and Computing 20, 2 (2009), 110–128.
doi:10.1093/bioinformatics/btp432. 2
doi:10.1016/j.jvlc.2008.09.001. 2
[ST10] SANTAMARÍA R., THERÓN R.: Visualization of inter-
[BvKM∗10] BUCHIN K., VAN KREVELD M. J., MEIJER
secting groups based on hypergraphs. IEICE Transactions on H., SPECKMANN B., VERBEEK K.: On planar supports
Information and Systems 93-D, 7 (2010), 1957–1964. 2 for hypergraphs.
In 17th Int. Symp. on Graph Drawing
(2010), Eppstein D., Gansner E. R., (Eds.), vol. 5849 of
[SWS∗11] STEINBERGER M., WALDNER M., STREIT M., LEX
Lecture Notes in Computer Science, Springer, pp. 345–356.
A., SCHMALSTIEG D.: Context-preserving visual links. IEEE
doi:10.1007/978-3-642-11805-0_33. 2
Transactions on Visualization and Computer Graphics 17, 12
(2011), 2249–2258. doi:10.1109/TVCG.2011.183. 2, 3
[CC07] COLLINS C., CARPENDALE S.: VisLink: Revealing re-
lationships amongst visualizations. IEEE Transactions on Vi-
[vDvKSW02] VAN DIJK S., VAN KREVELD M. J., STRIJK
sualization and Computer Graphics 13, 6 (2007), 1192–1199. T., WOLFF A.:
Towards an evaluation of quality for doi:10.1109/TVCG.2007.70521. 2 names placement methods.
International Journal of Ge-
ographical Information Science 16, 7 (2002), 641–661.
[CPC09] COLLINS C., PENN G., CARPENDALE S.: Bub-
doi:10.1080/13658810210138742. 4
ble Sets: Revealing set relations with isocontours over ex- isting visualizations.
IEEE Transactions on Visualiza-
[Viv03] VIVID SOLUTIONS: Java topology suite, 2003. 7
tion and Computer Graphics 15, 6 (2009), 1009–1016.
[WvdBH07] WEIN R., VAN DEN BERG J. P., HALPERIN
doi:10.1109/TVCG.2009.122. 1, 2, 3, 9 D.:
The visibility-Voronoi complex and its applica-
[dBCvKO08] DE BERG M., CHEONG O., VAN KREVELD M., tions.
Computational Geometry 36, 1 (2007), 66–87.
OVERMARS M.: Computational Geometry: Algorithms and Ap-
doi:10.1016/j.comgeo.2005.11.007. 5
plications. Springer, 2008. 6
[DBETT99] DI BATTISTA G., EADES P., TAMASSIA R., TOLLIS
I.: Graph Drawing: Algorithms for the Visualization of Graphs. Prentice-Hall, 1999. 2, 3
[DTH08] DENT B., TORGUSON J., HODLER T.: Cartography:
Thematic Map Design. McGraw-Hill, 2008. 6
[FMH08] FREILER W., MATKOVIC K., HAUSER H.: Interac-
tive visual analysis of set-typed data. IEEE Transactions on Vi-
sualization and Computer Graphics 14, 6 (2008), 1340–1347. doi:10.1109/TVCG.2008.144. 2
[FRM03] FLOWER J., RODGERS P., MUTTON P.: Layout met-
rics for Euler diagrams. In Proc. Seventh Int. Conf. on Informa- c 2012 The Author(s) c
2012 The Eurographics Association and Blackwell Publishing Ltd.

Kelp Diagrams| Giáo trình quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

Tài liệu liên quan:

Perception in Visualization| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

39 studies about human perception in 30 minutes| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

My steps to learn about Apache NiFi| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

Text Visualization Browser| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

Data Warehouse and OLAP| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội