Data Visualization Software
Class Outline and Teaching Materials


U. of Edinburgh and Greg's supplementary notes

Theoretical Backgrounds

  1. Overview: Hans Rosling' talk: long/ short , d3.js , tools
  2. examples of misleading data viz, MacWorld's iPhone Pie Chart, Cumulative iPhone sales chart
  3. colors: HSL and HSV, I Want Hue, Viz Palette for Data Visualization Color, sequential, divergent, and qualitative color schemes,
  4. color blindness => redudant encoding: Accessible Color Experiences, redundant encoding
  5. geospatial dataviz: kdnuggets, safegraph

Text Processing Tools

  1. a quick review of regular expressions
  2. block cut-paste in nano, cut, paste, uniq, ...
  3. html2csv
  4. csvkit and textql
  5. jq / fx / zq
  6. gpx/kml/geojson conversion

Types of Plots and Diagrams

  1. pie chart vs stacked bars vs side-by-side bars
  2. The issue with Pie chart
  3. line chart / stacked line chart? / gnuplot's filledcurves
  4. histogram, histogram, [ application in photography] violin chart
  5. Treemap: when, examples, treemap tool
  6. More charts for tree data: icicle, sunburst, circlepack
  7. Marimekko chart for a restaurant + food-cost.csv / Marimekko chart vs Treemap / Simpson’s Paradox: UCB admission + batting + treatment (causality) / juxtaposed "binary" Marimekko charts
  8. scatter plot, bubble chart
  9. treevis.net, one zoom tree of life, dogs breeds, simplified, phylogram_d3,
  10. Chernoff face
  11. How to Choose the Most Appropriate Chart?
  12. Visual Cheatsheet

Plotting Tools

  1. rawgraphs, Greg's notes
  2. metabase getting started, Greg's notes
  3. Python Crash Course:
    1. slice notation: array, string,
    2. Shallow Copy vs Deep Copy (zh_TW)
    3. list comprehension: simple examples, fast paced,
  4. matplotlib gallery, Read a CSV file with Numpy, visual intro to numpy, numpy illustrated, genfromtxt(), scat.py
  5. gnuplot: HMC, Manning, bubble chart (zh_TW)
  6. scatplot: satellites and android app sizes
  7. metabase; grafana
  8. pipdeptree (zh_TW) / why is pandas so slow? (zh_TW) / ER Diagrams from *.sql files ==> graphviz
  9. GeoDa
  10. QGIS
  11. visualization of high dimensional data: UMAP

Geographic Data and Plotting

  1. Taiwan's public transportation info (zh_TW)
  2. OpenStreetMap: History,
  3. osmand: intro (zh_TW), gpx <==> geojson conversion (zh_TW), importing gpx into osmand (zh_TW),
  4. overpass turbo and examples, full map feature listing, more examples (zh_TW),
  5. osm pbf (zh_TW)
  6. marker cluster map, potluckmap

Conclusions

  1. Greg's Drawing Toolbox (zh_TW)

Sample data sets

  1. Our World in Data
  2. Education Quality from "Our World in Data"
  3. US census datasets: poverty by county,
  4. CIA factbook
  5. covid19 from EU

Other links

  1. Best Data Visualization Projects of 2021
  2. Information is beautiful

(back to course homepage)