Alessio Miaschi
News
Papers
Courses & Theses
Talks
Resources & Projects
2025
October
Charting a Decade of Computational Linguistics in Italy: The CLiC-it Corpus
Lesson at the Autumn School in AI, PhD in Digital Humanities (Università di Genova)
All-in-one: Understanding and Generation in Multimodal Reasoning with the MAIA Benchmark
September
Best Student Paper Award @ CLiC-it 2025
Crossword Space: Latent Manifold Learning for Italian Crosswords and Beyond [🏆 Best Student Paper Award 🏆]
MAIA: a Benchmark for Multimodal AI Assessment
The OuLiBench Benchmark: Formal Constraints as a Lens into LLM Linguistic Competence
LM4DH @ RANLP 2025 Invited Talk
Cruciverb-IT (Shared task at EVALITA 2026)
CLiC-it 2025 Papers
August
Cruciverb-IT @ EVALITA 2026
EMNLP 2025 Findings Paper
July
Beyond the Spelling Miracle: Investigating Substring Awareness in Character-Blind Language Models
Evaluating Lexical Proficiency in Neural Language Models
Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors
Co-organizing EVALITA 2026
June
Parallel Trees: a novel resource with aligned dependency and constituency syntactic representations
May
LLMs Anatomy Course
ACL 2025 Papers
April
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation
Leveraging encoder-only large language models for mobile app review feature extraction
Invited Talk at NLP4RE @ REFSQ 2025 (Barcelona, Spain)
Contextualized Counterspeech: Strategies for Adaptation, Personalization, and Evaluation
February
Talk at FAIR Spoke Workshop 2025
January
NAACL 2025 Findings Paper
WWW 2025 Paper
2024
December
Controllable Text Generation To Evaluate Linguistic Abilities of Italian LLMs
Talk at AI Seminars 2024/25
November
Fantastic Labels and Where to Find Them: Attention-Based Label Selection for Text-to-Text Classification
Evaluating Large Language Models via Linguistic Profiling
LLM Profiling Data
October
CLiC-it 2024 Paper
NL4AI 2024 Paper
September
EMNLP 2024 Paper
June
New Position
May
Linguistic Knowledge Can Enhance Encoder-Decoder Models (If You Let It)
Linguistically Informed T5
March
T-FREX: A Transformer-based Feature Extraction Method for Mobile App Reviews
February
LREC-COLING 2024 Paper
January
Premio di ricerca “Dino Buzzetti” 2023
2023
December
SANER 2024 Paper
November
Lost in Labels: An Ongoing Quest to Optimize Text-to-Text Label Selection for Classification
Unmasking the Wordsmith: Revealing Author Identity through Reader Reviews
October
Lab @ Bright Night 2023
September
LANGLEARN @ EVALITA 2023
LangLearn at EVALITA 2023: Overview of the Language Learning Development Task
June
Journal of Documentation 2023
Tell me how you write and I’ll tell you what you read: a study on the writing style of book reviews
Talk at DCP23
May
XNLM Lab
Lab @ Lectures on Computational Linguistics 2023
February
Information 2023, Volume 14, Number 3
Testing the Effectiveness of the Diagnostic Probing Paradigm on Italian Treebanks
2022
October
CLiC-it 2023 Papers
December
LangLearn (Shared task at EVALITA 2023)
IEEE/ACM Transactions on Audio, Speech and Language Processing
On Robustness and Sensitivity of a Neural Language Model: A Case Study on Italian L1 Learner Errors
November
NL4AI 2022 (AIxIA) Paper
Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts
October
Tech Talk
September
Summer School - Advances in AI
July
Probing Linguistic Knowledge in Italian Neural Language Models across Language Varieties
Punctuation Restoration in Spoken Italian Transcripts with Transformers
May
PhD Thesis Defense
Tracking Linguistic Abilities in Neural Language Models
2021
December
On the role of Textual Connectives in Sentence Comprehension: a new Dataset for Italian
Probing Tasks Under Pressure
November
NL4AI 2021 (AIxIA) Paper
Evaluating Transformer Models for Punctuation Restoration in Italian
October
GPT-Dante
CLiC-it 2021 Papers
May
Python for Beginners
A dissemination workshop for introducing young Italian students to NLP
How Do BERT Embeddings Organize Linguistic Knowledge?
Teaching NLP with Bracelets and Restaurant Menus: An Interactive Workshop for Italian Students
What Makes My Model Perplexed? A Linguistic Investigation on Neural Language Models Perplexity
April
Science Web Festival Workshop
NAACL 2021 Workshop Papers
Journal of Writing Research (JoWR) Paper
A NLP-based stylometric approach for tracking the evolution of L1 written language compentece
2020
December
ATE_ABSITA @ EVALITA2020: Overview of the Aspect Term Extraction and Aspect-based Sentiment Analysis Task
Is Neural Language Model Perplexity Related to Readability?
Italian Transformers Under the Linguistic Lens
PRELEARN @ EVALITA 2020: Overview of the Prerequisite Relation Learning Task for Italian
COLING 2020 Outstanding Paper Award
October
CLiC-it 2020 Papers
September
COLING 2020 Paper
June
Linguistic Profiling of a Neural Language Model [🏆 Outstanding Paper Award 🏆]
Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation
Tracking the Evolution of Written Language Competence in L2 Spanish Learners
May
BEA-2020 Paper
RepL4NLP-2020 Paper
March
Shared tasks at EVALITA 2020
2019
November
Prerequisite or Not Prerequisite? That’s the problem! An NLP-based Approach for Concept Prerequisite Learning
PhD Giveback Event
August
Linguistically-Driven Strategy for Concept Prerequisites Learning on Italian
March
Trattamento Automatico della Lingua per la creazione di percorsi didattici personalizzati
2017
December
Deep learning for social sensing from tweets
October
Il Codice Pelavicino tra edizione digitale e Public History
1970
January
PRELEARN @ EVALITA 2020
ITA-PREREQ