How can we approach qualitative data with structured and/or quantitative methods? This lesson walks through some of the ways anthropologists and other social scientists can add new layers of analysis to their data interpretation with the use of data science tools.
Even when working with qualitative data, there are underlying ways to extract structured variables. This can be as simple as looking through your interview notes and noting how many people and which groups of people mentioned a certain theme or idea. This is slightly different than the text analysis methods described above as in this case, you are cherry picking through the whole dataset in search of one specific variable. These types of descriptions can help contextualize your results for readers. Other methods are described below.
In some cases, your data may have built in quantitative or structured summary variables that aren’t initially apparent. For example, if your dataset contains newspapers from several cities in the US over a defined time period, you could imagine describing your dataset in several ways.
“This research uses newspaper articles from across major US cities published during the 20th century”
“This research uses 60 articles published in four major newspapers (LA Times [n=30], NY Times [n=10], Washington Post [n=15], Chicago Tribune [n=5]) between 1925 and 2000, with an average publication date of 1970.”
Each of these descriptions may be adequate for describing your dataset, but think about the different levels of information conveyed in each. With large datasets where individual data sources have multiple attributes (e.g. tweets, museum object tags, environmental policies) you can also try running cluster analyses on these sources to group them based on similar attributes. This can help you better understand the structure of your data.