Alessio Miaschi

Full-time researcher (RTD) in Natural Language Processing

ItaliaNLP Lab (CNR-ILC)

Biography

I am a full-time researcher (RTD) at the ItaliaNLP Lab, Institute for Computational Linguistics “A. Zampolli” (CNR-ILC, Pisa). In 2022, I received my PhD in Computer Science at the University of Pisa.

My research interests lie primarily in the context of Natural Language Processing (NLP) and in the study of Language Models (LM). I am particularly interested in the interpretability of large-scale LMs and in the evaluation of their internal representations, with a specific emphasis on understanding their inner linguistic abilities. Furthermore, I work in the development of NLP tools tailored for building educational applications.

In my free time, I enjoy going climbing, watching movies, listening to music, and reading books. Additionally, I have a passion for good beers and capturing moments with my analog camera.

Interests

Natural Language Processing
Language Models
Representation Learning
Interpretability for Deep Learning

Education

PhD in Computer Science, 2022

University of Pisa
MSc in Digital Humanities, 2017

University of Pisa
BSc in Digital Humanities, 2015

University of Pisa

Recent Publications

Cristiano Ciaccio, Alessio Miaschi, Felice Dell'Orletta (2025). Evaluating Lexical Proficiency in Neural Language Models. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025, Vienna, Austria).

PDF

Andrea Pedrotti, Michele Papucci, Cristiano Ciaccio, Alessio Miaschi, Giovanni Puccetti, Felice Dell'Orletta, Andrea Esuli (2025). Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors. Proceedings of the Findings of the 2025 Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025, Vienna, Austria).

Preprint PDF

Cristiano Ciaccio, Marta Sartor, Alessio Miaschi, Felice Dell'Orletta (2025). Beyond the Spelling Miracle: Investigating Substring Awareness in Character-Blind Language Models. Proceedings of the Findings of the 2025 Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025, Vienna, Austria).

PDF

Chiara Alzetta, Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi, Simonetta Montemagni (2025). Parallel Trees: a novel resource with aligned dependency and constituency syntactic representations. Language Resources and Evaluation.

PDF DOI

Luca Moroni, Giovanni Puccetti, Pere-Lluís Huguet Cabot, Andrei Stefan Bejgu, Alessio Miaschi, Edoardo Barba, Felice Dell'Orletta, Andrea Esuli, Roberto Navigli (2025). Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation. Proceedings of the Findings of the 2025 Annual Conference of the Nations of the Americas Chapter of the ACL (Findings of NAACL 2025, Albuquerque, Nex Mexico).

PDF

See all publications

Teaching & Talks

Teaching

Teaching:

Linguistica Computazionale (Computational Linguistics), MSc Linguistics, University of Padova, a.y. 2022/23 - ongoing
Linguistica Computazionale II (Computational Linguistics II), MSc Digital Humanities, University of Pisa, a.y. 2022/23 - ongoing

Teaching assistant:

Linguistica Computazionale (Computational Linguistics), BSc Digital Humanities, University of Pisa, a.y. 2020/21, 2021/22
Progettazione e programmazione web (Web Design and Programming), BSc Digital Humanities, University of Pisa, a.y. 2018/19, 2019/20

Other courses:

Python per umanisti principianti (Python for beginners), corsi della didattica speciale "Hands on: Strumenti digitali per le DH", May - June, 2021 (Materials).

Thesis supervised/co-supervised:

Silvio Calderaro, OuLiBench: un nuovo framework per testare i Large Language Model attraverso sfide linguistiche sull'italiano, 10/04/2025 (MSc Digital Humanities, University of Pisa).
Elena Scaglione, The explanation game: il ruolo delle spiegazioni nel task di Natural Language Inference, 11/07/2024 (MSc Digital Humanities, University of Pisa).
Cristiano Ciaccio, Lilium lunaris e quadrofono. Large Language Models e Neologia computazionale di neoformazioni lessicali italiane, 15/02/2024 (MSc Digital Humanities, University of Pisa) [Emanuele Pianta Award for the Best Master Thesis @ CLiC-it 2024]

Talks

07/04/2025: Invited talk at the NLP4RE Workshop (REFSQ 2025 @ Barcelona, Spain) - Evaluating Linguistic Abilities of Neural Language Models (Slides)
20/02/2025: Talk at the FAIR Spoke Workshop 2025 (Sapienza Università di Roma) - Controllable Text Generation for Evaluating LLMs' Linguistic Competence (Slides)
09/12/2024: Invited Talk at the AI Seminar 2024/25 (PhD in Digital Humanities, University of Genova) - Evaluating Linguistic Abilities of Neural Language Models (Slides)
09/06/2023: Invited Talk at the DCP23 Workshop - Opening Large Language Models (Slides)
31/05/2023: Lab at the Lectures on Computational Linguistics 2023 - Explaining Neural Language Models from Internal Representations to Model Predictions (Github Repo)
08/03/2023: Talk at Seminario di Cultura Digitale (Informatica Umanistica, Università di Pisa) - Le risorse linguistiche al tempo delle reti neurali (Slides and Talk)
13/12/2022: Invited Talk at "Ab urbe condata" seminar series organized by KDD Lab - Profiling Neural Language Models
10/11/2022: Talk at ILC Seminars - Profiling a Neural Language Model (Slides)
27/10/2022: Tech Talk at the School of AI (Pi School) 2022 - Interpreting Neural Language Models (Slides and Code)
21/09/2022: Invited Talk at the International Summer School on "Advances in AI" 2022 - Profiling Neural Language Models (Slides)

Contact

Via G. Moruzzi 1, Pisa, PI
Institute for Computational Linguistics “A. Zampolli” (CNR)
DM Me
Skype Me

Alessio Miaschi

Full-time researcher (RTD) in Natural Language Processing

ItaliaNLP Lab (CNR-ILC)

Biography

Interests

Education

Recent Publications

News

Co-organizing EVALITA 2026

ACL 2025 Papers

Invited Talk at NLP4RE @ REFSQ 2025 (Barcelona, Spain)

Talk at FAIR Spoke Workshop 2025

NAACL 2025 Findings Paper

Teaching & Talks

Teaching

Talks

Contact