Clustering data at multiple scales using trees.
Examples of high-dimensional data.
Linear dimensionality reduction using PCA.
Visualizing and interpreting PCA.
An overview of the UMAP algorithm.
More examples of dimensionality reduction using PCA and UMAP.
Typical tasks and example network datasets.
Navigating across related time series.
An introduction to click events in Shiny
An introduction to brush events in Shiny.
More examples defining brush queries using Shiny and `ggplot2`.
Linking in web-based visualizations.
Vocabulary used by the R Shiny Library, and a few example apps.
Viewing shiny code execution as a graph.
Using Shiny to explore a movies dataset
An overview of common formats, with illustrative examples.
A discussion of ggplot2 terminology, and an example of iteratively refining a simple scatterplot.
Examples of encodings and sequential refinement of a plot.
The definition of tidy data, and why it's often helpful for visualization._
Tools for reshaping data into tidy format.
Using `separate`, `mutate`, and `summarise` to derive new variables for downstream visualization.
An extended example of tidying a real-world dataset.
An extended example of faceting with data summaries.
Showing different variables across subpanels.
Implementing compound figures in R
A data structure for managing time series data.
Vocabulary for describing visual structure in time series.
Approaches for visualizing seasonality.
Summaries of relationships between and within time series.
Manipulating and visualizing spatial vector data.
Storing spatially gridded information in rasters.
The projection problem, and how to check your CRS.
Idioms for interacting with geographic data.
The most common network visualization strategy.
A scalable network visualization strategy.
Visualization of hierarchical structure using containment.
An introduction to clustering and how to manage its output.
Visualizing table values, ordered by clustering results.
Diagnostics for the quality of a clustering.
How reliable are the results of a clustering?
An overview of dimensionality reduction via topics.
Data preparation and model fitting code for topics.
Once we've fit a topic model, how should we inspect it?
An application to a gene expression dataset.
An introduction to partial dependence profiles.
Discovering richer structure in partial dependence profiles.
The relationship between exploratory analysis and model development.
Simulating data to evaluate model quality.
Evaluating the fit at particular observations in Bayesian models.
An introduction to compositional feature learning.
A first look at activations in a deep learning model.
Analyzing feature activations across datasets
Interpreting neurons by finding optimal inputs
Some major themes from STAT 436, in a nutshell.
Tracing the refinement of questions and design.
What is the purpose of data analysis?
A look at the origins of the field.
Using small multiples to create information dense plots.