Topics and institutions in the reproduction of intersectional inequalities in scienceKozlowski, Diego ![]() Doctoral thesis (2023) Detailed reference viewed: 104 (3 UL) Intersectional Inequalities in ScienceKozlowski, Diego ![]() Presentation (2022, November 16) Detailed reference viewed: 67 (2 UL) Race and gender homophily in collaborations and citationsKozlowski, Diego ; ; et alScientific Conference (2022, October 09) Detailed reference viewed: 297 (5 UL) Institutional determinants of intersectional inequalities in scienceKozlowski, Diego ; ; et alin BRIDGES BETWEEN DISCIPLINES: GENDER IN STEM AND SOCIAL SCIENCES (2022, September 12) Detailed reference viewed: 179 (6 UL)![]() Automatic Classification of Peer Review RecommendationKozlowski, Diego ; ; et alPoster (2022, September 08) Detailed reference viewed: 83 (2 UL) Applying an Intersectional Lens to Author Composition at Women’s Colleges, Historically Black Colleges and Universities, and Hispanic Serving Institutions in the United StatesKozlowski, Diego ; ; et alScientific Conference (2022, September 07) Detailed reference viewed: 83 (3 UL) Desigualdades interseccionales en la cienciaKozlowski, Diego ![]() Speeches/Talks (2022) Detailed reference viewed: 75 (1 UL) Race And Gender Inequalities In Citations And Research Topics In USKozlowski, Diego ; Article for general public (2022) Detailed reference viewed: 90 (3 UL) Intersectional inequalities in scienceKozlowski, Diego ![]() Scientific Conference (2022, April 28) Detailed reference viewed: 88 (2 UL) Large-scale computational content analysis on magazines targeting men and women: the case of Argentina 2008-2018Kozlowski, Diego ; ; et alin Feminist Media Studies (2022) Differences in magazines content aimed specifically at women or men are a means to create and reproduce gender stereotypes. Novel computational tools allow to study differences in magazines content taking ... [more ▼] Differences in magazines content aimed specifically at women or men are a means to create and reproduce gender stereotypes. Novel computational tools allow to study differences in magazines content taking into account all available articles. In this study, we analyse the case of two Argentinian magazines published by the same publishing house over a decade (2008–2018), advertised by the publishing house as targeting women and men respectively. Using computational tools, we are able to analyse more than 24,000 articles, which would have been an impossible task using manual content analysis methodologies. With Topic Modelling techniques we identify the main themes discussed in the magazines and quantify their different frequency between magazines over time. Then, we performed a word-frequency analysis to validate this methodology and extend the analysis to other subjects. Our results show that topics such as Family, Business and Women as sex objects present an initial bias that tends to disappear over time. Conversely, in Fashion and Science topics, the initial differences are maintained. Also, we identify a considerable increase in the use of words associated with feminism since 2015 and specifically the word abortion in 2018. Furthermore, we develop a website where everyone can perform additional analysis. [less ▲] Detailed reference viewed: 111 (5 UL) Avoiding bias when inferring race using name-based approachesKozlowski, Diego ; ; et alin PLoS ONE (2022), 3(17), 0264270 Racial disparity in academia is a widely acknowledged problem. The quantitative understanding of racial-based systemic inequalities is an important step towards a more equitable research system. However ... [more ▼] Racial disparity in academia is a widely acknowledged problem. The quantitative understanding of racial-based systemic inequalities is an important step towards a more equitable research system. However, because of the lack of robust information on authors’ race, few large-scale analyses have been performed on this topic. Algorithmic approaches offer one solution, using known information about authors, such as their names, to infer their perceived race. As with any other algorithm, the process of racial inference can generate biases if it is not carefully considered. The goal of this article is to assess the extent to which algorithmic bias is introduced using different approaches for name-based racial inference. We use information from the U.S. Census and mortgage applications to infer the race of U.S. affiliated authors in the Web of Science. We estimate the effects of using given and family names, thresholds or continuous distributions, and imputation. Our results demonstrate that the validity of name-based inference varies by race/ethnicity and that threshold approaches underestimate Black authors and overestimate White authors. We conclude with recommendations to avoid potential biases. This article lays the foundation for more systematic and less-biased investigations into racial disparities in science. [less ▲] Detailed reference viewed: 146 (1 UL) Intersectional Inequalities in ScienceKozlowski, Diego ; ; et alin Proceedings of the National Academy of Sciences of the United States of America (2022), 119(2), 2113067119 The US scientific workforce is primarily composed of White men. Studies have demonstrated the systemic barriers preventing women and other minoritized populations from gaining entry to science; few ... [more ▼] The US scientific workforce is primarily composed of White men. Studies have demonstrated the systemic barriers preventing women and other minoritized populations from gaining entry to science; few, however, have taken an intersectional perspective and examined the consequences of these inequalities on scientific knowledge. We provide a large-scale bibliometric analysis of the relationship between intersectional identities, topics, and scientific impact. We find homophily between identities and topic, suggesting a relationship between diversity in the scientific workforce and expansion of the knowledge base. However, topic selection comes at a cost to minoritized individuals for whom we observe both between- and within-topic citation disadvantages. To enhance the robustness of science, research organizations should provide adequate resources to historically underfunded research areas while simultaneously providing access for minoritized individuals into high-prestige networks and topics. [less ▲] Detailed reference viewed: 134 (5 UL) Intersectional Inequalities in ScienceKozlowski, Diego ![]() Presentation (2021, December 06) Detailed reference viewed: 96 (3 UL) Metascience: Disrupting the status quo or perpetuating inequitiesKozlowski, Diego ![]() Scientific Conference (2021, September 23) Detailed reference viewed: 86 (1 UL) Avoiding bias when inferring race using name-based approachesKozlowski, Diego ; ; et alin 18th INTERNATIONAL CONFERENCE ON SCIENTOMETRICS & INFORMETRICS, 12–15 July 2021KU Leuven, Belgium (2021, July) Racial disparity in academia is a widely acknowledged problem. The quantitative understanding of racial-based systemic inequalities is an important step towards a more equitable research system. However ... [more ▼] Racial disparity in academia is a widely acknowledged problem. The quantitative understanding of racial-based systemic inequalities is an important step towards a more equitable research system. However, few large-scale analyses have been performed on this topic, mostly because of the lack of robust race-disambiguation algorithms. Identifying author information does not generally include the author’s race. Therefore, an algorithm needs to be employed, using known information about authors, i.e., their names, to infer their perceived race. Nevertheless, as any other algorithm, the process of racial inference can generate biases if it is not carefully considered. When the research is focused on the understanding of racial-based inequalities, such biases undermine the objectives of the investigation and may perpetuate inequities. The goal of this article is to assess the biases introduced by the different approaches used name-based racial inference. We use information from US census and mortgage applications to infer the race of US author names in the Web of Science. We estimate the effects of using given and family names, thresholds or continuous distributions, and imputation. Our results demonstrate that the validity of name-based inference varies by race and ethnicity and that threshold approaches underestimate Black authors and overestimate White authors. We conclude with recommendations to avoid potential biases. This article fills an important research gap that will allow more systematic and unbiased studies on racial disparity in science. [less ▲] Detailed reference viewed: 231 (7 UL) Semantic and Relational Spaces in Science of Science: Deep Learning Models for Article VectorisationKozlowski, Diego ; Dusdal, Jennifer ; Pang, Jun et alin Scientometrics (2021) Over the last century, we observe a steady and exponentially growth of scientific publications globally. The overwhelming amount of available literature makes a holistic analysis of the research within a ... [more ▼] Over the last century, we observe a steady and exponentially growth of scientific publications globally. The overwhelming amount of available literature makes a holistic analysis of the research within a field and between fields based on manual inspection impossible. Automatic techniques to support the process of literature review are required to find the epistemic and social patterns that are embedded in scientific publications. In computer sciences, new tools have been developed to deal with large volumes of data. In particular, deep learning techniques open the possibility of automated end-to-end models to project observations to a new, low-dimensional space where the most relevant information of each observation is highlighted. Using deep learning to build new representations of scientific publications is a growing but still emerging field of research. The aim of this paper is to discuss the potential and limits of deep learning for gathering insights about scientific research articles. We focus on document-level embeddings based on the semantic and relational aspects of articles, using Natural Language Processing (NLP) and Graph Neural Networks (GNNs). We explore the different outcomes generated by those techniques. Our results show that using NLP we can encode a semantic space of articles, while with GNN we are able to build a relational space where the social practices of a research community are also encoded. [less ▲] Detailed reference viewed: 178 (23 UL) Latent Dirichlet Allocation Models for World Trade AnalysisKozlowski, Diego ; ; in PLoS ONE (2021), 16(2), 0245393 The international trade is one of the classic areas of study in economics. Nowadays, given the availability of data, the tools used for the analysis can be complemented and enriched with new methodologies ... [more ▼] The international trade is one of the classic areas of study in economics. Nowadays, given the availability of data, the tools used for the analysis can be complemented and enriched with new methodologies and techniques that go beyond the traditional approach. The present paper shows the application of the Latent Dirichlet Allocation Models, a well known technique from the area of Natural Language Processing, to search for latent dimensions in the product space of international trade, and their distribution across countries over time. We apply this technique to a dataset of countries' exports of goods from 1962 to 2016. The findings show the possibility to generate higher level classifications of goods based on the empirical evidence, and also allow to study the distribution of those classifications within countries. The latter show interesting insights about countries' trade specialisation. [less ▲] Detailed reference viewed: 128 (7 UL) Machine Learning on GraphsKozlowski, Diego ![]() Presentation (2020, November 18) Graphs are a ubiquitous data structure that can be exploited in many different problems. In tasks where observations are not independently drawn from the data generating process, but their codependencies ... [more ▼] Graphs are a ubiquitous data structure that can be exploited in many different problems. In tasks where observations are not independently drawn from the data generating process, but their codependencies add valuable information, a network analysis might be useful for modelling those relations. In this seminar we will discuss about Graph Neural Networks, the deep learning approach for dealing with networks. [less ▲] Detailed reference viewed: 137 (6 UL) Package development in RKozlowski, Diego ![]() Presentation (2020, October 12) Detailed reference viewed: 98 (1 UL) |
||