Research design with computing: Something old, something new

by Matti Nelimarkka, PhD from the University of Helsinki, Finland and Aalto University, Finland. Dr. Nelimarkka is the author of the new book, Computational Thinking and Social Science: Combining Programming, Methodologies and Fundamental Concepts. He will share his expertise in a Methodspace webinar about computational social science in March, 2023. Find information and registration here.


Use the code MSPACEQ123 for a 20% discount, valid until March 31, 2023.

In 2009, Savage and Burrows foresaw a crisis of empirical sociology: social data made it possible for companies and non-social scientists to study society. Instead of a traditional survey-based network collection, companies could tap into telephone logs to see who talks with whom; in a few years, this would be replaced by services such as Facebook, Whatsapp and Telegram. For Grimmer (2015), easier access to social data meant that almost anyone could become some kind of social scientist. It was clear that the world was changing, and scholars must change with it, or risk becoming obsolete.

Therefore, social scientists began to move towards a new kind of social science, where computers have a more pivotal role in data collection and data analysis. Computational social sciences help scholars to renew their research in several directions.

For example, with the help of computing, scholars can work with larger data sets; computers allow us to work with large datasets which are inhumane in scale, no amount of research assistants would help us to work with these. We can for example study 6% of all published books to understand how social class is expressed (Kozlowski et al; 2019), this is such an enormous data set to work on and allows detailed analysis on societal change. Similar to scale, computers can also be much faster. For example, Bosch et al. (2019) got analysis done in five minutes with a computer while it took a research assistant about 35 hours. In academic settings, we might not prioritise speed as much, but there are events where it would be nice to get the work done in minutes, not in days; like remaining files before the actual analysis work can commence.

Moving away from scale and speed, computing also helps to reflect established practices. Some fields of social sciences, such as philosophy, have worked extensively with thought experiments. For such research, computing provides an interesting benefit: computers are dumb: they do exactly as commanded. Therefore, computers force us to articulate our assumptions in a formal language and see what comes from our thought experiments with it. (Ylikoski, 2014). Similarly, certain research traditions, like grounded theory, try to limit the impact of preconceptions and let the data speak. (It is still a question of debate if and how this actually takes place within the research process; can researcher ever be a tabula rasa.) The same requirement for formal statement of assumptions might allow us to identify any assumptions which go into analysis work and identify and justify them. Therefore, the computer approaches the data as a tabula rasa, only conducting the analysis as we instruct it to.

The final reason to consider moving into computational methods is the changing landscape of human behaviour: increasingly our life is mediated and supported by digital services. Therefore, accessing such data requires scholars to “speak Computer”, communicate with the digital services in their mother tongue. In the new kind of social science, we may use datasets from Facebook, TikTok and Telegram to see what people speak. We can use data from Google Maps or similar services to see where they are and where they move. We can even understand the social organisation of Wikipedia and conflict management by observing a huge amount of conflicts taking place on it (Yasseri et al, 2012).

As the five cases above demonstrate, there are many compelling reasons to take the leap and start one's journey into computational social science. There are some things new and some things old. I do not believe we can replace benefits from social science concepts, theories and operationalisation or the skills to identify interesting and relevant questions. No amount of data can make a dull question interesting. Classics like Mills’s Sociological Imagination or Becker’s Tricks of the Trade are not obsolete. Instead, scholars must learn to use these with computing and its methodological approaches.

However, these classics do not get us alone into computational social science. Scholars need to also learn something new. Most importantly, we need to understand what computers can do and how those capabilities can be tamed for social scientists. We need some level of technological imagination to understand what is possible and how different methods may aid with our research questions.

Learning computational social science builds around three core perspectives

  • programming and the technical know-how to make computers do things

  • understanding what kind of questions computing can answer and connecting them to social science ideas and theories

  • exploring fundamental research skills, such as ethics, validity and reliability in the new context of computational social science


More Methodspace posts about Big Data and Computational Social Science

Previous
Previous

Transcending an essentialized qualitative/quantitative divide when making methodological choices

Next
Next

Author Interview with Dr. Callie Rennison