Source Themes

Evaluating Large Language Models via Linguistic Profiling

Large Language Models (LLMs) undergo extensive evaluation against various benchmarks collected in established leaderboards to assess their performance across multiple tasks. However, to the best of our knowledge, there is a lack of comprehensive …

Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)

In this paper, we explore the impact of augmenting pre-trained Encoder-Decoder models, specifically T5, with linguistic knowledge for the prediction of a target task. In particular, we investigate whether fine-tuning a T5 model on an intermediate …

T-FREX: A Transformer-based Feature Extraction Method for Mobile App Reviews

Mobile app reviews are a large-scale data source for software-related knowledge generation activities, including software maintenance, evolution and feedback analysis. Effective extraction of features (i.e., functionalities or characteristics) from …

Lost in Labels: An Ongoing Quest to Optimize Text-to-Text Label Selection for Classification

In this paper, we present an evaluation of the influence of label selection on the performance of a Sequence-to-Sequence Transformer model in a classification task. Our study investigates whether the choice of words used to represent classification …

Unmasking the Wordsmith: Revealing Author Identity through Reader Reviews

Traditional genre-based approaches for book recommendations face challenges due to the vague definition of genres. To overcome this, we propose a novel task called Book Author Prediction, where we predict the author of a book based on user-generated …

LangLearn at EVALITA 2023: Overview of the Language Learning Development Task

Language Learning Development (LangLearn) is the EVALITA 2023 shared task on automatic language development assessment, which consists in predicting the evolution of the written language abilities of learners across time. LangLearn is conceived to be …

Evaluating Large Language Models via Linguistic Profiling

Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)

T-FREX: A Transformer-based Feature Extraction Method for Mobile App Reviews

Lost in Labels: An Ongoing Quest to Optimize Text-to-Text Label Selection for Classification

Unmasking the Wordsmith: Revealing Author Identity through Reader Reviews

LangLearn at EVALITA 2023: Overview of the Language Learning Development Task

Tell me how you write and I'll tell you what you read: a study on the writing style of book reviews

Testing the Effectiveness of the Diagnostic Probing Paradigm on Italian Treebanks

On Robustness and Sensitivity of a Neural Language Model: A Case Study on Italian L1 Learner Errors

Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts