Generating and Evaluating Multi-Level Text Simplification: A Case Study on Italian
Pre-Proceedings

Michele Papucci, Giulia Venturi, Felice Dell'Orletta

Recent advances in Generative AI and Large Language Models (LLMs) have enabled the creation of highly realistic synthetic content, yet controlling model outputs remains a challenge. In this study, we explore the use of LLMs to generate high-quality synthetic data for Automatic Text Simplification (ATS), evaluating the ability of models fine-tuned on Italian to produce multiple simplified versions of the same original sentence that vary in readability and in their lexical and (morpho-)syntactic characteristics. The approach is tested across two domains, Wikipedia and Public Administration, allowing us to explore domain sensitivity. Additionally, we compare the linguistic phenomena observed in the generated data with those found in ATS resources previously created through manual or semi-automatic methods. Our results suggest that the best-performing LLM can generate linguistically diverse simplifications that align with known simplification patterns, offering a promising direction for building reliable ATS resources, including simplifications suited to varying levels of reader proficiency.

Published in Proceedings of the Eleventh Italian Conference on Computational Linguistics (CLiC-it 2025), Cagliari.

Year: 2025

Text Simplification Controllable Text Generation

Check it Out! Resources Available

Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors
Proceedings

Andrea Pedrotti, Michele Papucci, Cristiano Ciaccio, Alessio Miaschi, Giovanni Puccetti, Felice Dell'Orletta, Andrea Esuli

Recent advancements in Generative AI and Large Language Models (LLMs) have enabled the creation of highly realistic synthetic content, raising concerns about the potential for malicious use, such as misinformation and manipulation. Moreover, detecting Machine-Generated Text (MGT) remains challenging due to the lack of robust benchmarks that assess generalization to real-world scenarios. In this work, we present a pipeline to test the resilience of state-of-the-art MGT detectors (e.g., Mage, Radar, LLM-DetectAIve) to linguistically informed adversarial attacks. To challenge the detectors, we fine-tune language models using Direct Preference Optimization (DPO) to shift the MGT style toward human-written text (HWT). This exploits the detectors’ reliance on stylistic clues, making new generations more challenging to detect. Additionally, we analyze the linguistic shifts induced by the alignment and which features are used by detectors to de tect MGT texts. Our results show that detectors can be easily fooled with relatively few examples, resulting in a significant drop in detection performance. This highlights the importance of improving detection methods and making them robust to unseen in-domain texts. We release code, models, and data to support fu ture research on more robust MGT detection benchmarks.

Published in Findings of the Association for Computational Linguistics: ACL 2025, Vienna, Austria.

Year: 2025

Machine Generated Text Detection Controllable Text Generation

Check it Out! Code Available

Fantastic Labels and Where to Find Them: Attention-Based Label Selection for Text-to-Text Classification
Proceedings

Michele Papucci, Alessio Miaschi, Felice Dell'Orletta

Generative language models, particularly adopting text-to-text frameworks, have shown significant success in NLP tasks. While much research has focused on input representations via prompting techniques, less attention has been given to optimizing output representations. Previous studies found inconsistent effects of label representations on model performance in classification tasks using these models. In this work, we introduce a novel method for selecting well-performing label representations by leveraging the attention mechanisms of Transformer models. We used an Italian T5 model fine-tuned on a topic classification task, trained on posts extracted from online forums and categorized into 11 classes, to evaluate different label representation selection strategies. We’ve employed a context-mixing score called Value Zeroing to assess each token’s impact to select possible representations from the training set. Our results include a detailed qualitative analysis to identify which label choices most significantly affect classification outcomes, suggesting that using our approach to select label representations can enhance performance.

Published in Proceedings of the 8th Work-shop on Natural Language for Artificial Intelligence (NL4AI 2024) co-located with 23th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2024), Bolzano.

Year: 2024

Label Representation

Check it Out! Code Available

Label Selection in Text-to-Text Neural Language Models for Classification
Thesis

Michele Papucci

This work contains a set of preliminary experiments with the objective of exploring and optimizing the use of reasonably small text-to-text Transformers to solve classification tasks. The broader objective is to see if we can use text-to-text Language Models, that aren’t costly to train and deploy like the ones that are currently very popular (e.g. Chat-GPT or LLaMa), as a unifying framework to solve any Natural Language Processing tasks. Contrary to what we need to do with larger models, with reasonably sized Transformers we need to find optimal way of casting the tasks into a text-to-text form, i.e. having a textual input, and expecting a textual output from the model. This thesis focuses on classification tasks, and in particular on the problem of how to represent class names into the best possible strings that maximize performances for the model. First, we evaluated whether this smaller models can obtain reasonable performances in classification tasks. Then, we tested the importance of label representation in this settings, finding that is, indeed, important to maximize the model performances. Finally, we presented and evaluated a novel technique to extract label representation from the training set of a classification task based on Attention attribution explainability methods.

Published in University of Pisa, Master's Thesis in Data Science and Business Informatics

Year: 2024

Label Representation

Check it Out!

Lost in Labels: An Ongoing Quest to Optimize Text-to-Text Label Selection for Classification
Proceedings

Michele Papucci, Alessio Miaschi, Felice Dell'Orletta

In this paper, we present an evaluation of the influence of label selection on the performance of a Sequence-to-Sequence Transformer model in a classification task. Our study investigates whether the choice of words used to represent classification categories affects the model’s performance, and if there exists a relationship between the model’s performance and the selected words. To achieve this, we fine-tuned an Italian T5 model on topic classification using various labels. Our results indicate that the different label choices can significantly impact the model’s performance. That being said, we did not find a clear answer on how these choices affect the model performances, highlighting the need for further research in optimizing label selection.

Published in Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023), Venezia.

Year: 2023

Label Representation

Check it Out!

Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts
Proceedings

Michele Papucci, Chiara De Nigris, Alessio Miaschi, Felice Dell'Orletta

In this paper, we propose an extensive evaluation of the first text-to-text Italian Neural Language Model (NLM), IT5, on a classification scenario. In particular, we test the performance of IT5 on several tasks involving both the classification of the topic and the style of a set of Italian posts. We assess the model in two different configurations, single-and multi-task classification, and we compare it with a more traditional NLM based on the Transformer architecture (i.e. BERT). Moreover, we test its performance in a few-shot learning scenario. We also perform a qualitative investigation on the impact of label representations in modeling the classification of the IT5 model. Results show that IT5 could achieve good results, although generally lower than the BERT model. Nevertheless, we observe a significant performance improvement of the Text-to-text model in a multi-task classification scenario. Finally, we found that altering the representation of the labels mainly impacts the classification of the topic.

Published in Proceedings of the 6th Work-shop on Natural Language for Artificial Intelligence (NL4AI 2022) co-located with 21th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2022), Udine

Year: 2022

Label Representation

Check it Out!

L'Italiano Amministrativo dagli Atti al Web: una risorsa per la semplificazione automatica
Thesis

Michele Papucci

Published in University of Pisa, Bachelor's Thesis in Informatica Umanistica (Digital Humanities)

Year: 2022

Language: Italian

Text Simplification

Check it Out!

Generating and Evaluating Multi-Level Text Simplification: A Case Study on Italian Pre-Proceedings

Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Proceedings

Fantastic Labels and Where to Find Them: Attention-Based Label Selection for Text-to-Text Classification Proceedings

Label Selection in Text-to-Text Neural Language Models for Classification Thesis

Lost in Labels: An Ongoing Quest to Optimize Text-to-Text Label Selection for Classification Proceedings

Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts Proceedings

L'Italiano Amministrativo dagli Atti al Web: una risorsa per la semplificazione automatica Thesis

Generating and Evaluating Multi-Level Text Simplification: A Case Study on Italian
Pre-Proceedings

Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors
Proceedings

Fantastic Labels and Where to Find Them: Attention-Based Label Selection for Text-to-Text Classification
Proceedings

Label Selection in Text-to-Text Neural Language Models for Classification
Thesis

Lost in Labels: An Ongoing Quest to Optimize Text-to-Text Label Selection for Classification
Proceedings

Evaluating Text-To-Text Framework for Topic and Style Classification of Italian texts
Proceedings

L'Italiano Amministrativo dagli Atti al Web: una risorsa per la semplificazione automatica
Thesis