Please disable Adblockers and enable JavaScript for domain CEWebS.cs.univie.ac.at! We have NO ADS, but they may interfere with some of our course material.

Q6

Ad Chapter 7., figure 7.3.: This figure is an example where there is no order in the different manifestations of the key value 'species'. How do I make sure that it is clear that the order in my bar chart is not intended but just arbitrary? It could also be read as a range from the least important/expensive/cute/… to the most important/expensive/cute/… animal.

- bar chart is the best thing you can do in this case. Just don't make a line chart out of it.

I cannot distinguish many differences betweeen adjacency matrix view and cluster heatmap, besides that heatmaps contain more complex datasets. Can heatmap also be used for networks?

- They are similar, yet different concepts in its use. An adjacency matrix is specific for networks, heatmaps are dense, typically with all cells filled.

I find that dotplots and scatterplots are very similiar and that the dotplots should be avoided unless there is a trendline between data points, like in the example presented by the author in Figure 7.18.Is there a reason why to use dotplot instead of bar chart?

- Very good question, that I don't have an immediate answer and I'll think about it.

In Chapter 8,the author describes that when we have categorical data we use categorical colormaps and that is better not to use hue, saturation, luminance of a color, but to use completely different color. But, in the case we have many categories, isn't it better to use one color in order to avoid too much variegated graphics, that make unclear, too detailed visualizations?

- You can never encode infinitely many items. In the end, you'll have to tradeoff what to encode.

Figure 7.5: I don't get the description text, does it match the picture?

- correct -- a+b is mixed.

"This heatmap uses a diverging red-green colormap, as is common in the genomics domain. (In this domain there is a strong conventionfor the meaning of red and green that arose from raw images created by the optical microarray sensors that record uorescence at speci c wavelengths." - is it realy so important that a red-gree betmap used is. and "How did this strong convention for red and green in genomics develop? Why is it used and would be a greyscale a better alternative for colorblind people?"

- How did it develop -- Tamara states this 'raw images created by the optical microarray sensors that record uorescence at speci c wavelengths'. What alternatives should be used? Anything but red-green! :) Maybe red-blue, or green-blue, or … monochrome too, depending on the specifics of its use.

Is there some kinf of isolines for 3D fealds?

- They are called iso-surfaces, see later in lecture.

"Wird in der Praxis wirklich bei den Grafiken darauf geachtet, dass diese auch lesbar/interpretierbar für Farbenblinde sind?" and "Wäre es nicht sinnvoll für Farbenblinde User zusätzlich eine Art Farbblinden-Variante von Visualisierung zu erstellen, sodass User manuell zwischen Normaler und Farbblinden Variante navigieren können? Gibt es bereits solche Gedanken?" and "Da bei der Cluster Heatmap erwähnt wird, dass sie nicht für Farbenblinde geeignet ist, stellt sich mir die Frage, inwieweit in der Visualisierung überhaupt auf Farbenblinde geachtet wird / geachtet werden kann?"

- colorbrewer allows you to pick colormaps appropriate for colour-blind folk. Und ja -- man solute schon darauf achten und es its sehr bekannt rot-gruen zu vermeiden!

Does the optimation of Diagramms for people whith blindness not affekt the cognition of people whithout color blindness?

- No, Why?

Gibt es vielleicht konkrete Beispiele/Grafiken zu einer "cluster heatmap"? Durch den Text habe ich nicht ganz verstanden wie diese Art der Darstellung funktioniert.

- Viele, aber dafür müsste man sich mit konkreten Anwendungen beschäftigen. Z.B. http://chase.cs.univie.ac.at/publications

Im Kapitel 8 wird unter anderem die Farbsättigung besprochen. Macht es hier wirklich einen Unterschied ob zum Beispiel die Grundfarben heller oder dünkler gemacht werden damit die Grafik besser aussieht (natürlich Geschmackssache)? Wenn man dann darüber spricht kann man genauso zu einem hellerem Rot einfach "Rot" sagen.

- Ja, die erkennbaren Farbstufen nehmen ab/zu. Siehe auch die Farbpaletten im Munsell color space.

http://en.wikipedia.org/wiki/Munsell_color_system

Ich verstehe leider nicht, was mit Glyphmaps kodiert bzw. dargestellt werden soll. Könnte die Grafik 151 im Kurs bitte näher erläuter werden?

- Im Kurs ist vielleicht nicht geplant, aber die grundlegende Frage ist -- ist die x-Achse horizontal oder radial? Daraus ergeben sich unterschiedliche Patterns, welche ikonifizert erkannt werden.

"Im Buch fehlen leider noch einige Beispiele, könnten diese bitte im Kurs erläutert werden? (Anspielung vor allem auf die Seiten 144+145, und hier besonders auf Choropleth Maps, Streamlines und Superquadric Glyphs)." and "Was genau kann man sich unter einem „glyph object“ vorstellen"

- ja, aber mehr gegen Ostern. (details jetzt auch im letzten update des Buches)

Was versucht Figur 8.1 zu beschreiben? Weder der entsprechende Abschnitt noch die Erläuterung der Figur helfen mir zu verstehen, was hier eigentlich dargestellt wird.

- Was ist denn unklar? Hier wird ein Zusammenhang zwischen drei Konzepten hergestellt -- categorical, sequentiell (ordered!) und divergent (zwei Seiten) datenTypen. Wie können diese kombiniert werden in eine 2D colormap?!

Warum sollte bei Figure 7.3 die alphabetische Anordnung der Namen als abgeleitetes Attribut gesehen werden?

- Weil explizit keine Ordnung der Daten gegeben ist.

What is an ID Transfer Function?

- It's really a colormap that translates a value into a 'colour'.

Why if two neighboring axes have highpositive correlation, the line segments are mostly parallel?

-y=x is a highly positive relation!

On the theoretical side, why the safest strategy is to avoid using only the hue channel to encode information?

- Under what circumstances? This is not true in general.

Wie effizient ist eine Kodierung, die mithilfe eines Radial multipodes plot kodiert wird? Wenn ich nicht irre, muss die Breite der Felder quasi willkürlich angepasst werden, um Winkel auf diese Art und Weise korrekt zu kodieren. Diese Verbreiterung oder Verschmälerung führt zwangläufig zu einer Veränderung des Flächeninhalts, obwohl dieser gar keiner Kodierung unterliegt, sondern lediglich die Höhe des Balkens einen Aussagewert hat. Der veränderte Flächeninhalt führt jedoch zu einer veränderten, evtl. falchen gewichteten Interpretation dieses Feldes. Ein Radial star plot, sofern ich es richtig verstehe, kodiert exakt dieselben Inhalte wie der Radial multipodes plot, ohne jedoch Spielraum für diese Misinterpretation zu lassen.

- Leider verstehe ich das Bedenken nicht ganz, die Idee des multipodes Plot ist die Fläche linear von der Länge abhängig zu machen und nicht quadratisch (wie das der Fall des von bisherigen radial plots war).

Wozu dient der Baum bei der Erläuterung der GrouseFlocks (Seite 170)? Der Baum (anders als die GrouseFlocks selbst) stellt den Graph der linken Seite nicht korrekt dar, da gewisse Informationen verloren gehen. Wenn der Baum lediglich zum besseren Verständnis der Grouseflocks dienen soll, dann sollte dies besser hervorgehoben / erläutert werden.

- Die Baeume in diesem Fall sind verschiedene Ansichten (durch Interaktion) des Graphen. Diese können durch Manipulationen / Interaktionen während einer iterativen Analyse entstanden sein.

5. Anmerkung: Auch wenn wir dieses Problem bereits kurz besprochen haben, so möchte ich dennoch noch einmal kurz anmerken, dass die Aussage "Trees have the node density of 1" (S. 171) aus einer streng mathematischen Sicht eigentlich falsch ist. Auch wenn dieses Buch sich vornehmlich nicht an Mathematiker richtet, so würde ich dennoch mit solchen Aussagen aufpassen, da ich ohne die Möglichkeit der kurzen Besprechung im Klassenraum (die nicht jeder Leser haben wird) schlichtweg als logischen Fehler verstanden hätte, was in weiterer Folge der Integrität der Autorin schadet.

- Tamara hat ganz bewusst alle mathematischen Formeln aus dem Buch verbannt um damit den Leserkreis zu vergrossern. Finde ich auch nicht so toll, ist aber Ihre Entscheidung. Damit werden manche feinen Unterschiede nicht klar beschreibbar (imho). Aber Ihr könnt dies ja selber mit Ihr im Mai diskutieren.

What ist the problem of channel inseparability?

- Hmm -- warum ist dies nicht klar vom Text/Bildern?

The book states that parallel coordinates only show trends among neighboring axes. Colors like in the first example make it possilbe to see correlation over more than one axis. with a small number of attributes.

- Not clear how. If you are lucky, in some specific scenarios you might see correlations across multiple axis, but typically not.

When looking for correlation would i always go with an scatterplot or is this an option?

- it's the simplest (most effective) way. So - why not?

Radial multipodes plot or any radial layouts other than a piechart seem very hard to read. Even the cyclic patterns are very easy to spot on an rectilinear layout. Can you tell me some examples where this would be a good layout?

- cyclic patterns are much harder -- see Glyphmaps: it's really hard to see whether there is a gap between beginning and end or not!

"On page 153 it is sayes that because length judgement is better than angle judgements stacked bars would be easier to read than a pie chart. Does this not ignore the fact that a pie chart encodes very well with volume. I know that stacked bars also show volume but pie charts have a different relation from angle to volume that makes it seem easier to read for me. There must be a reason why pie charts are so widely used." and "Radial charts, even though as mentionend in the book are less effective than bar charts, were very common in the past and often prefered in the past. How did this develop?"

- Study over study shows -- pie charts are no good. Still people love them. Just like biologists love red-green and engineers love rainbows. Sigh. If you'd like to device a study to test your hypothesis -- go ahead! :)

Warum verwendet man pale low-saturation statt highly saturated colors, wenn man large areas kodieren will ?

- I'll talk about it in the lecture.

Was wird beim "volume rendering" eigentlich gemacht?

- Es wird z.B. ein 3D-Datensatz (ein "Volume") visualisiert. Genaue Methoden siehe z.B. hier http://en.wikipedia.org/wiki/Volume_rendering

Page 137 / bar charts - wouldn't it be possible to put a third attribute into a bar chart by the colour of the bar (brightness or trough patterns,…)?

- yes, possible, common, but makes it harder to read.

page 163 / line charts - why is it allowed to make connections between two marks in a line chart? we do not know if the value is consistently changed between the two marks. what if the value after one mark was very low until the start of the second mark when he was high again. Isn't that a wrong interpolation which is made? And isn't it a manipulation for the viewer? Because the diagram can't really tell what happened between the two marks, but it still made a interpolation between the two, regardless of the value of the data which represents the gap between the two marks.

- I think we talked about it in length today.

"fig 7.4 (cluster heatmap) The trees on the sides is said to express the reordering done. Doesn't it actually show only how the data was clustered? I don't see a way to infer the original ordering of the data, given only the plot and the trees." and "Wie genau sind bei den cluster heatmaps die Bäume, die jeweils die Ableitung der Daten zeigen sollen, zu interpretieren?"

- yeah, it's a hierarchical clustering, implying a reordering, but no exact original order can be inferred.

Bei welchen konkreten Beispielen ist es sinnvoll Visualisierung in Form von “parallel coordinates” zu verwenden im Gegensatz zur Darstellung mit Hilfe von Scatterplots?

- Hmmm - persönlich finde ich PC nicht sehr gut und würde fast immer SPLOMs (oder alternativen) bevorzugen ...

In “8.6.1 Size Channel” wird die Wichtigkeit von untersch. Größe in verschiedenen Dimensionen besprochen. Wie ist hierbei der Satz “A larger-dimensional size coding clearly subsumes a smaller dimensional one: length and area cannot be simultaneously used to encode different dimensions.” zu verstehen? Würde zusätzlich zu einem Bereich gleichzeitig eine Länge zur Enkodierung verwendet werden, wäre die Darstellung doch wieder als drei-dimensional zu werten.

- so ginge das zum einen (ist aber auch sehr schlecht), aber wenn wir in 2D bleiben können sagt es nur aus dass das eine das andere beeinflusst. Soll heissen dass diese Channel nicht gut trennbar sind!

Can we talk more about force-directed placement algorithms in the lecture? Which books about this topic can be recommended?

- Graph drawing books. Tamara gives some refs. And now, unfortunately there is not too much time to go into great details there. Its a huge topic.

It is clear from experience that motion strongly draws attention and tried to look it up fast but couldnt find a satisfiable answer. What is the reason that objects in motion draw more attention than static objects for example?

- Detecting movement was/is important to survive, hence our visual system is really 'attentive' to movement.

Nochmal eine Frage zu Tabellen und Links. In Figure 2.4 (s. 26) steht, dass Tabellen nur Attribute und Items enthalten können. Links gibt es nur in trees und networks. Figure 7.5 (s. 141) und Figure 7.23 (s. 173) zeigen doch das Gegenteil. Ein Link ist einfach eine Verbindung zwischen zwei Items. Wenn es eine Verbindung zwischen A und B gibt, füllt man das Kästchen bei Zeile A/Spalte B und bei Zeile B/Spalte A aus. Man muss nicht zwangsläufig ein Attribut reinschreiben.

- Fig 7.5 ist eine gute visuelle encoding aber nicht unbedingt eine effiziente Datenstruktur! :)

1000x1000 is an odd standard display size. (s. 139) Most devices/screens are not square?

- agreed. not happy about this formulation myself.

"There are five major design choices for how to arrange data using spatial channels: to express values; to separate, order, and align regions; and to use spatial position in accordance with given spatial attributes." (s.132) Does the last part mean stuff like maps?

- It refers to spatial data -- scalar, vector, tensor physcial data from our physical environment. (including maps, medical data, fluid flow, etc)

Are there any generally accepted guidelines for the maximum number of marks so as to maintain the legibility of a parallel coordinate plot? How can this number be increased?

- KISS (Keep-It-Simple-Stupid) is a guideline. So, use the minimum amount of ink!

Does the accuracy of the perception of length, area or volume depend on the orientation of the element in question? Especially, when comparing two such elements, should they be directed in the same way? (For instance, should two squares have parallel edges to have better accuracy?)

- Not 100% sure, but would expect a 'yes'.

Does the use of superimposed layers preclude that of color or brightness?

- unclear -- these are two different concepts.

Bezogen zu Figure 8.4: Warum entwickelt man überhaupt neue Colormaps, wenn es bereits welche gibt die sehr gut funktionieren bzw. sich bewährt haben , in dem Fall die Colormap die es in jedem Atlas zu finden gibt?

- Was ist wenn man keinen Atlas baut sondern ein medizinisches Diagnose System?

Was ist eine "smoking gun study"?

- Eine Studie die unwiderruflich eine Hypothese bestätigt oder widerlegt.

Critique / Additional Notes

Den Abschnitt "Cluster Heatmap" fand ich etwas verwirrend und die Zeichnung hat mir auch nicht besonders geholfen. Es wird zwar beschrieben wofür es benötigt wird und wie man es lesen sollte aber ich habe es nicht verstanden. Parallel Layouts wurden auch beschrieben aber ich finde diese Methode der Darstellung nicht so gut. Es gibt zwar eine Zeichnung dazu und es wird auch beschrieben aber ich konnte die Werte nicht richtig nachvollziehen. Den Abschnitt "Dense" habe ich leider gar nicht verstanden. Ich habe es zwar gründlich gelesen aber es wäre gut wenn man diesen Punkt in der VO nochmals besprechen könnte. Was mich aber sehr gestört hat war das während Kapitel 7 sehr häufig die Grafik 7.21 angesprochen wurde das Bild jedoch wurde sehr spät gezeigt wodurch mir nicht immer alles gleich klar war.

fig 7.14 (tree layouts of large trees) In the paragraph surounding the graphic, 7.14a&b are commented on and explained a little, 7.14c (treemap) is just mentioned in the figure-description, but not really commented on, and treemaps arent mentioned for the next 9 pages. Maybe a few extra lines in the paragraph would be nice, or a reference to the next subsection. Possible Errors: 7.1. The spatial channels can <MISSING>be<\MISSING> used to arrange data 7.2.3.2 (Example Isolines) changing the chosen level on the fly, for example with a slider that allows the user to quickly <MISSING>select<\MISSING> the contour value from the minimum to the maximum

After seeing Figure 7.2 in Chapter 7, I thought that it is maybe better to use a different scatterplot as an example. One that is more simple, this example is missing the line that shows how the values are correlating and the labels at the axis are too small and therefore can't be read, at least without some effort.

[…] The following chapters contain mostly definitions. Most of them are clear, but the Cluster Heatmap was a little bit hard to understand the first time.

Letzte Änderung: 20.03.2014, 18:04 | 2827 Worte

Visualisierung (VU)

Q6

Critique / Additional Notes