Paloma Piot
IRLab, UDC
A Coruña, Spain
I am a PhD researcher in Natural Language Processing at the IRLab, University of A Coruña, Spain, where I explore how large language models (LLMs) perceive, assess, and sometimes reproduce hate speech on social media. I am intrigued by the ways AI can both reflect and amplify societal biases, and by how we can make models more responsible, fair, and understandable.
My research interests revolve around hate speech and abusive language detection, with a focus on making NLP systems responsible, fair, and explainable. I am particularly interested in how large language models behave across languages and communities, and in understanding the social and sociolinguistic dimensions of online discourse. I also explore low-resource and multilingual NLP, data-centric approaches, and the design of tools and datasets that help evaluate and improve AI systems in detecting and mitigating online hate.
I maintain open-source implementations of my work and related experiments on GitHub. You can also follow my academic and professional updates on LinkedIn and X.
Feel free to reach out via email at <paloma.piot[at]udc.es>.
Selected Publications
- MetaHate: A Dataset for Unifying Efforts on Hate Speech DetectionIn Proceedings of the International AAAI Conference on Web and Social Media, 2024
- Decoding hate: Exploring language models’ reactions to hate speechIn Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2025
- Towards Efficient and Explainable Hate Speech Detection via Model DistillationIn European Conference on Information Retrieval, 2025
- Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language ModelsarXiv preprint arXiv:2505.02252, 2025
- WATCHED: A Web AI Agent Tool for Combating Hate speech by Expanding DataSoftwareX, 2025