![]() Perceptual approaches to the study of GermanPurschke, Christoph ![]() in Bousquette, Joshua; Pickl, Simon (Eds.) The Oxford Handbook of the German Language (in press) Detailed reference viewed: 93 (4 UL) Comparing Pre-Training Schemes for Luxembourgish BERT ModelsLothritz, Cedric ; Ezzini, Saad ; Purschke, Christoph et alin Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023) (2023, September) Despite the widespread use of pre-trained models in NLP, well-performing pre-trained models for low-resource languages are scarce. To address this issue, we propose two novel BERT models for the ... [more ▼] Despite the widespread use of pre-trained models in NLP, well-performing pre-trained models for low-resource languages are scarce. To address this issue, we propose two novel BERT models for the Luxembourgish language that improve on the state of the art. We also present an empirical study on both the performance and robustness of the investigated BERT models. We compare the models on a set of downstream NLP tasks and evaluate their robustness against different types of data perturbations. Additionally, we provide novel datasets to evaluate the performance of Luxembourgish language models. Our findings reveal that pre-training a pre-loaded model has a positive effect on both the performance and robustness of fine-tuned models and that using the German GottBERT model yields a higher performance while the multilingual mBERT results in a more robust model. This study provides valuable insights for researchers and practitioners working with low-resource languages and highlights the importance of considering pre-training strategies when building language models. [less ▲] Detailed reference viewed: 225 (0 UL) Diskurs-Figuren. Wie Politik und Öffentlichkeit in Luxemburg über Sprache sprechenPurschke, Christoph ![]() in Hemecht: Zeitschrift für Luxemburger Geschichte (2023), (2023/3), 313-330 Detailed reference viewed: 128 (10 UL) 46. Sociolinguistics in LuxembourgPurschke, Christoph ; Gilles, Peter ![]() in Ball, Martin; Mesthrie, Rajend; Meluzzi, Chiara (Eds.) The Routledge Handbook of Sociolinguistics around the World (2023) Detailed reference viewed: 86 (3 UL) Mapping knowledge and perceptions of language: societal multilingualism and its socio-pragmatic groundingPurschke, Christoph ; in Jucker, Andreas; Hausendorf, Heiko (Eds.) Pragmatics of Space (2022) Detailed reference viewed: 144 (11 UL) Mit Lingscape auf Pad in der Stadt. Ein Schulprojekt zu gesellschaftlicher Mehrsprachigkeit in WindhoekPurschke, Christoph ; in Marten, Heiko; Ziegler, Evelyn (Eds.) Linguistic Landscapes im deutschsprachigen Kontext: Forschungsperspektiven, Methoden und Anwendungsmöglichkeiten im Unterricht und Sprachmarketing (2021) Detailed reference viewed: 206 (8 UL) Street name changes as language and identity inscription in the cityscape; Purschke, Christoph ![]() in Linguistics Vanguard (2021), 7(s5), 1-13 Detailed reference viewed: 102 (2 UL) Cultural representation in Luxembourgish street naming practicesPurschke, Christoph ![]() in Linguistics Vanguard (2021), 7(s5), 111 Detailed reference viewed: 114 (6 UL) Kennen, Können, Wissen. Zur Konstruktion von Expertise; Purschke, Christoph ![]() in Hoffmeister, Toke; Hundt, Markus; Naths, Saskia (Eds.) LaienWissenSprache. Konzepte, Anwendungsfelder und Perspektiven der Folk Linguistics im deutschsprachigen Raum (2021) Detailed reference viewed: 87 (5 UL) Crowdscapes. Participatory research and the collaborative (re)construction of linguistic landscapes with LingscapePurschke, Christoph ![]() in Linguistics Vanguard (2021), 7(s1), Detailed reference viewed: 181 (13 UL) Schnëssen. Surveying language dynamics in Luxembourgish with a mobile research appEntringer, Nathalie ; Gilles, Peter ; Martin, Sara et alin Linguistics Vanguard (2021), 7(s1), The mobile app Schnëssen is intended to establish a state-of-the-art digital platform to collect data on the present-day language situation of Luxembourgish by means of crowd-sourcing and to document and ... [more ▼] The mobile app Schnëssen is intended to establish a state-of-the-art digital platform to collect data on the present-day language situation of Luxembourgish by means of crowd-sourcing and to document and present results to a broader public. Users can participate in a large set of audio recordings tasks and in sociolinguistic surveys. By presenting all audio recordings via an interactive map, participants can explore the language variation of their language. In the first year of data collection, around 210.000 recordings could be collected for numerous variation phenomena from all linguistic levels and over 2800 sociolinguistic questionnaires have been filled out. The app allowed us to compile thus the largest systematic spoken language corpus of Luxembourgish. [less ▲] Detailed reference viewed: 377 (41 UL) Findings of the VarDial Evaluation Campaign 2021; ; et al in Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects (2021) Detailed reference viewed: 144 (1 UL) spellux – Automatic text normalization for LuxembourgishPurschke, Christoph ![]() Software (2020) Detailed reference viewed: 168 (4 UL) Exploring the Linguistic Landscape of Cities through Crowdsourced DataPurschke, Christoph ![]() in Brunn, Stanley; Kehrein, Roland (Eds.) Handbook of the Changing World Language Map (2020) Detailed reference viewed: 309 (6 UL) A Report on the VarDial Evaluation Campaign 2020; ; et al in Zampieri, Marcos; Preslav, Yakov; Ljubešić, Nikola (Eds.) et al Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects (2020) Detailed reference viewed: 105 (0 UL) Attitudes towards multilingualism in Luxembourg. A comparative analysis of online news comments and crowdsourced questionnaire dataPurschke, Christoph ![]() in Frontiers in Artificial Intelligence (2020), (3), 536086 Detailed reference viewed: 329 (7 UL) Fescher als dein Schatten. Zur Präsenz des Deutschen in Österreich in der AlltagspraxisPurschke, Christoph ![]() in Hundt, Markus; Kleene, Andrea; Plewnia, Albrecht (Eds.) et al Regiolekte. Objektive Sprachdaten und subjektive Wahrnehmung (2020) Detailed reference viewed: 377 (8 UL) Luxemburgisch zwischen Variation und Normierung. Ein EinwurfPurschke, Christoph ![]() Article for general public (2019) Detailed reference viewed: 129 (5 UL) A Temporal Warehouse for Modern Luxembourgish Text CollectionsGierschek, Daniela ; Gilles, Peter ; Purschke, Christoph et alPresentation (2019) Detailed reference viewed: 183 (22 UL) Lörres, Möppes, and the Swiss. (Re)Discovering Regional Patterns in Anonymous Social Media DataPurschke, Christoph ; in Journal of Linguistic Geography (2019), 7(3), 113-134 Detailed reference viewed: 158 (6 UL) |
||