Alessio Miaschi

Alessio Miaschi

Full-time researcher (RTD) in Natural Language Processing

ItaliaNLP Lab (CNR-ILC)

Biography

I am a full-time researcher (RTD) at the ItaliaNLP Lab, Institute for Computational Linguistics “A. Zampolli” (CNR-ILC, Pisa). In 2022, I received my PhD in Computer Science at the University of Pisa.

My research interests lie primarily in the context of Natural Language Processing (NLP) and in the study of Language Models (LM). I am particularly interested in the interpretability of large-scale LMs and in the evaluation of their internal representations, with a specific emphasis on understanding their inner linguistic abilities. Furthermore, I work in the development of NLP tools tailored for building educational applications.

In my free time, I enjoy going climbing, watching movies, listening to music, and reading books. Additionally, I have a passion for good beers and capturing moments with my analog camera.

Interests

  • Natural Language Processing
  • Language Models
  • Representation Learning
  • Interpretability for Deep Learning

Education

  • PhD in Computer Science, 2022

    University of Pisa

  • MSc in Digital Humanities, 2017

    University of Pisa

  • BSc in Digital Humanities, 2015

    University of Pisa

Recent Publications

(2025). All-in-one: Understanding and Generation in Multimodal Reasoning with the MAIA Benchmark. Proceedings of the Findings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025, Suzhou, China) (upcoming).

Preprint

(2025). Crossword Space: Latent Manifold Learning for Italian Crosswords and Beyond. In Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025, Cagliari) (upcoming).

(2025). MAIA: a Benchmark for Multimodal AI Assessment. In Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025, Cagliari) (upcoming).

(2025). The OuLiBench Benchmark: Formal Constraints as a Lens into LLM Linguistic Competence . In Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025, Cagliari) (upcoming).

(2025). Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors. Proceedings of the Findings of the 2025 Annual Meeting of the Association for Computational Linguistics (Findings of ACL 2025, Vienna, Austria).

Preprint PDF

News

Cruciverb-IT (Shared task at EVALITA 2026)

CLiC-it 2025 Papers

EMNLP 2025 Findings Paper

Co-organizing EVALITA 2026

ACL 2025 Papers

Teaching & Talks

Teaching

Teaching: Teaching assistant: Other courses: Thesis supervised/co-supervised:

Talks

  • 10/06/2025: Invited talk at the CNR-IVI Workshop on AI Technologies (CNR-ISTI, Pisa) - Linguistic Profiling of Large Language Models (Slides)
  • 07/04/2025: Invited talk at the NLP4RE Workshop (REFSQ 2025 @ Barcelona, Spain) - Evaluating Linguistic Abilities of Neural Language Models (Slides)
  • 20/02/2025: Talk at the FAIR Spoke Workshop 2025 (Sapienza Università di Roma) - Controllable Text Generation for Evaluating LLMs' Linguistic Competence (Slides)
  • 09/12/2024: Invited Talk at the AI Seminar 2024/25 (PhD in Digital Humanities, University of Genova) - Evaluating Linguistic Abilities of Neural Language Models (Slides)
  • 09/06/2023: Invited Talk at the DCP23 Workshop - Opening Large Language Models (Slides)
  • 31/05/2023: Lab at the Lectures on Computational Linguistics 2023 - Explaining Neural Language Models from Internal Representations to Model Predictions (Github Repo)
  • 08/03/2023: Talk at Seminario di Cultura Digitale (Informatica Umanistica, Università di Pisa) - Le risorse linguistiche al tempo delle reti neurali (Slides and Talk)
  • 13/12/2022: Invited Talk at "Ab urbe condata" seminar series organized by KDD Lab - Profiling Neural Language Models
  • 10/11/2022: Talk at ILC Seminars - Profiling a Neural Language Model (Slides)
  • 27/10/2022: Tech Talk at the School of AI (Pi School) 2022 - Interpreting Neural Language Models (Slides and Code)
  • 21/09/2022: Invited Talk at the International Summer School on "Advances in AI" 2022 - Profiling Neural Language Models (Slides)

Projects & Resources

Cruciverb-IT @ EVALITA 2026

Webpage of the Cruciverb-IT Shared Task at EVALITA 2026

LLMs Anatomy Course

Materials for the ‘LLMs Anatomy Course’ course

LLM Profiling Data

Data associated with the paper ‘Evaluating Large Language Models via Linguistic Profiling’

Linguistically Informed T5

Suite of linguistically-informed T5 models

LANGLEARN @ EVALITA 2023

Webpage of the LANGLEARN Shared Task at EVALITA 2023

XNLM Lab

Materials for the XNLM Lab organized at the Lectures on Computational Linguistics 2023

GPT-Dante

GPT-Dante web interface

Python for Beginners

Materials for the ‘Python for Beginners’ course

PRELEARN @ EVALITA 2020

Webpage of the PRELEARN Shared Task at EVALITA 2020

ITA-PREREQ

ITA-PREREQ dataset.

Contact

  • Via G. Moruzzi 1, Pisa, PI
  • Institute for Computational Linguistics “A. Zampolli” (CNR)
  • DM Me
  • Skype Me