Cecilia Aragon

Autumn 2018

Cultural Differences in Data Privacy Perspectives on Social Media

Note: Enrollment in this research group is at capacity for Autumn 2018

The Cambridge Analytica scandal has triggered a discussion about data privacy in social media. As the news regarding this issue has traveled around the world, a worldwide public discussion about data privacy has emerged. Motivated by this context, we aim to answer this research question: Does the public online debate reveal different perspectives on data privacy across countries/cultures? To do so, we have collected Twitter activity associated with data privacy and the Cambridge Analytica scandal in both English and Spanish. Our work will result in insights about the different aspects of data privacy that are emphasized by people in different countries; a characterization of how geography, time, and bots influence the worldwide online conversation on data privacy; and, lessons learned about how best to apply human-centered data science techniques to support cross-cultural comparisons of social media data.

We have collected a large-scale Twitter dataset around this issue and are in the process of analyzing the data through both qualitative coding and automated analysis. The research group will take a mixed-methods approach to understanding the data, and as a result we are currently focused on qualitative coding of a large Twitter dataset.

The group is open both graduate and undergraduate students. Qualitative research experience in grounded theory and qualitative coding is desirable but not required. Bilingualism is a plus, particularly in Spanish. We strongly encourage interested undergrads to apply, even if you have little or no experience with this type of research. This is an excellent opportunity to be introduced to the methods of human-centered data science, as well as a chance to gain valuable insight into the way that research is carried out.


Autumn 2018

Distributed Mentoring and Fanfiction Data Analytics

Note: Enrollment in this research group is at capacity for Autumn 2018

Are you interested in applying human-centered data science to study how people learn from online fandom?

This ongoing research project studies informal learning in online fanfiction communities. We are looking for students with experience in either (a) programming and analysis of large text datasets or (b) qualitative research in online fandoms, to join an existing research group. We have published multiple papers on our research and are in the process of submitting others.

We have found quantitative and qualitative evidence that distributed mentoring plays a positive role in fanfiction authors’ development as writers, and this quarter’s project continues our efforts with a specific focus on visual analytics of a large dataset. We’ve collected a vast, rich text dataset of over 61.5 billion words (the largest fiction dataset outside of the Google Books corpus) of stories, reviews, and associated metadata from fanfiction sites and have applied both qualitative (ethnography) and quantitative techniques (machine learning, statistical analysis, data visualization) to investigate the relationship between distributed mentoring and writing quality (e.g., grammar, reading level).


Dr. Aragon's Research Group archive