July 14: Introduction to Data Mining | Rebuilding the Portfolio: DH for Art Historians

Readings

Judith H. Dobrzynski, “Modernizing Art History,” The Wall Street Journal.
http://online.wsj.com/news/articles/SB10001424052702304518704579519632304010744
Ted Underwood, “Where to Start with Text Mining,” The Stone and the Shell. http://tedunderwood.com/2012/08/14/where-to-start-with-text-mining/
Dan Cohen, “Searching for the Victorians,” Dan Cohen’s Digital Humanities Blog (October 4, 2010). http://www.dancohen.org/2010/10/04/searching-for-the-victorians/
Hadley Wickham, “Tidy Data,” Journal of Statistical Software, Submitted. http://vita.had.co.nz/papers/tidy-data.pdf. (Read through the end of section 3 on page 13.)

Introduction to data and digital methods for measuring, describing and analyzing text and numeric datasets with Lisa Rhody.
- Slides from session
Hands-on Session 1: Google N-Grams Viewer and Bookworm
Demo Session 1: Identifying and preparing specific datasets: Looking together at example datasets, we will identify three types of data that we want to work with and identify important attributes of a usable dataset for description, measurement, and analysis.
- Anatomy of tabular data
- Anatomy of textual data
Break for lunch

Demo Session 2: Finding the right tool for your questions: A quick overview followed by hands-on activities
Hands-on Session 2: Using Voyant, participants will perform word frequency, corpus grid, corpus summary, and keyword in context analysis.
Demo Session 3: Examining large corpora of texts to detect trends and patterns.
Close 4pm. Bus: 4:15pm

Think about the kinds of data you have, or might want to work with, and what steps are needed to get it ready for the appropriate method of analysis you learned about today.