Questions tagged [visualization]

Constructing meaningful and useful graphical representations of data. (If your question is only about how to get particular software to produce a specific effect, then it is likely not on topic here.)

Overview

Data visualization refers to techniques for presenting results in graphical form, such as histograms, scatterplots, or boxplots. Data visualization is a special challenge for data with high dimensionality.

If your question is only about how to get particular software to produce a specific effect, then it is likely not on topic here. Programming questions (for example, in Python, or in R with ggplot, etc.) for which you can supply a reproducible example are usually welcomed on StackOverflow.

References

The following question contains references to data visualization resources:

698 questions
185
votes
21 answers

How do you visualize neural network architectures?

When writing a paper / making a presentation about a topic which is about neural networks, one usually visualizes the networks architecture. What are good / simple ways to visualize common architectures automatically?
Martin Thoma
  • 19,540
  • 36
  • 98
  • 170
38
votes
3 answers

Calculation and Visualization of Correlation Matrix with Pandas

I have a pandas data frame with several entries, and I want to calculate the correlation between the income of some type of stores. There are a number of stores with income data, classification of area of activity (theater, cloth stores, food ...)…
gdlm
  • 535
  • 1
  • 6
  • 9
30
votes
8 answers

Purpose of visualizing high dimensional data?

There are many techniques for visualizing high dimension datasets, such as T-SNE, isomap, PCA, supervised PCA, etc. And we go through the motions of projecting the data down to a 2D or 3D space, so we have a "pretty pictures". Some of these…
hlin117
  • 685
  • 1
  • 8
  • 11
28
votes
8 answers

Visualizing a graph with a million vertices

What is the best tool to use to visualize (draw the vertices and edges) a graph with 1000000 vertices? There are about 50000 edges in the graph. And I can compute the location of individual vertices and edges. I am thinking about writing a program…
Cici
  • 453
  • 1
  • 4
  • 10
28
votes
6 answers

make seaborn heatmap bigger

I create a corr() df out of an original df. The corr() df came out 70 X 70 and it is impossible to visualize the heatmap... sns.heatmap(df). If I try to display the corr = df.corr(), the table doesn't fit the screen and I can see all the…
redeemefy
  • 661
  • 1
  • 6
  • 9
26
votes
10 answers

GraphViz not working when imported inside PydotPlus (`GraphViz's executables not found`)

I've been trying to make these packages work for quite some time now but with no success. Basically the error is: GraphViz's Executables not found EDIT: I had not posted a terminal log with the error originally. I'm using Ubuntu now so I won't be…
psygo
  • 535
  • 1
  • 6
  • 15
22
votes
3 answers

How do I create a complex Radar Chart?

So, I want to create a Player Profile Radar Chart something like this: Not only the scale of each variable different, but also I want a reversed scale for some statistics like the 'dispossessed' stat, where less actually means good. One solution…
Kunal Dharamsi
  • 323
  • 1
  • 2
  • 4
22
votes
2 answers

How to include labels in sns heatmap

I got this matrix 120 100 80 40 20 10 5 0 120 64.21 58.20 51.20 56.37 47.00 45.61 46.86 2.16 100 62.84 57.80 50.60 51.32 39.43 39.30 42.80 0.89 80 62.62 56.20 51.20 51.61 …
Srihari
  • 797
  • 4
  • 12
  • 27
21
votes
6 answers

What do you use to generate a dashboard in R?

I need to generate periodic (daily, monthly) web analytics dashboard reports. They will be static and don't require interaction, so imagine a PDF file as the target output. The reports will mix tables and charts (mainly sparkline and bullet graphs…
aiolias
18
votes
2 answers

How to plot two columns of single DataFrame on Y axis

I have two data frames (Action, Comedy). Action contains two columns (year, rating) ratings columns contains average rating with respect to year. The Comedy data frame contains the same two columns with different mean values. I merged both data…
Bilal Butt
  • 291
  • 1
  • 2
  • 4
17
votes
1 answer

Visualization of multiple Markov models

I am working on a project where we compare over 10 different Markov models, each representing a different treatment plan. Most often single models are visualized with a decision tree or transition state diagram. However, with multiple different…
Andrew Brown
  • 279
  • 1
  • 3
16
votes
2 answers

Visualizing deep neural network training

I'm trying to find an equivalent of Hinton Diagrams for multilayer networks to plot the weights during training. The trained network is somewhat similar to a Deep SRN, i.e. it has a high number of multiple weight matrices which would make the…
15
votes
3 answers

How can I make big confusion matrices easier to read?

I have recently published a dataset (link) with 369 classes. I ran a couple of experiments on them to get a feeling for how difficult the classification task is. Usually, I like it if there are confusion matrices to see the type of error being made.…
Martin Thoma
  • 19,540
  • 36
  • 98
  • 170
15
votes
1 answer

Heatmap on a map in Python

Mode Analytics has a nice heatmap feature, but it is not conducive to comparing maps (only one per report). What they do allow is data to be pulled easily into a wrapped python notebook. And then any image in python can easily be added to a…
ScottieB
  • 333
  • 1
  • 2
  • 8
15
votes
1 answer

Can closer points be considered more similar in T-SNE visualization?

I understand from Hinton's paper that T-SNE does a good job in keeping local similarities and a decent job in preserving global structure (clusterization). However I'm not clear if points appearing closer in a 2D t-sne visualization can be assumed…
Javierfdr
  • 1,500
  • 13
  • 14
1
2 3
46 47