Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts

Michele Papucci, Chiara De Nigris, Alessio Miaschi, Felice Dell'Orletta

November 2022

PDF

Abstract

In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5, on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single- and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic

Type

Conference paper

Publication

In Proceedings of Sixth Workshop on Natural Language for Artificial Intelligence (NL4AI @ AIxIA 2022)

Source Themes

Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts

Abstract

Alessio Miaschi

Full-time researcher (RTD) in Natural Language Processing