Latest News

Our paper "Histoires Morales: A French Dataset for Assessing Moral Alignment" is accepted to NAACL 2025

Link to the dataset : https://hf.co/datasets/LabHC/histoires_morales.
Link for the paper : https://arxiv.org/abs/2501.17117.
Code : https://github.com/upunaprosk/histoires-morales

Last updated on Feb 7, 2025 2 min read

Our paper "FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts" is accepted to LREC-COLING-2024

TL;DR We create and release FrenchToxicityPrompts
Link to the dataset : https://download.europe.naverlabs.com/FrenchToxicityPrompts/.
Link for the paper : https://aclanthology.org/2024.trac-1.12/.

Last updated on May 1, 2024 1 min read

Our paper "FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts" is accepted to LREC-COLING-2024

Our paper "When Quantization Affects Confidence of Large Language Models?" is accepted to NAACL 2024

TL;DR We investigate the impact of LLM compression on three aspects within QA tasks: (i) model confidences, (ii) calibration error, and (iii) predictive entropy.
Link for the paper : https://arxiv.org/abs/2405.00632.

Last updated on Mar 15, 2024 3 min read

Our paper "When Quantization Affects Confidence of Large Language Models?" is accepted to NAACL 2024

Our paper "Fair Text Classification with Wasserstein Independence" is accepted to EMNLP 2023

TL;DR We propose a novel approach to mitigate bias in text encoders, that aims to tackle bias directly in the latent space on which documents are projected, making our model applicable to any text encoder or decoder.

Link for the paper : https://aclanthology.org/2023.emnlp-main.978/.

Last updated on Oct 10, 2023 3 min read

Our paper "Fair Text Classification with Wasserstein Independence" is accepted to EMNLP 2023

+1 paper accepted to IDA 2023: "An Investigation of Structures Responsible for Gender Bias in BERT and DistilBERT"

TL;DR We offer empirical insights into bias investigation within the inner layers and heads of BERT and compare these findings with results obtained from DistilBERT.

Link for the paper : https://link.springer.com/chapter/10.1007/978-3-031-30047-9_20.

Last updated on Jan 23, 2023 4 min read

+1 paper accepted to IDA 2023: "An Investigation of Structures Responsible for Gender Bias in BERT and DistilBERT"

Our paper "The Other Side of Compression: Measuring Bias in Pruned Transformers" is accepted to IDA 2023

TL;DR We analyse the layers’ contribution to rational model decision-making in terms of performance and fairness.

Link for the paper : https://hal.science/hal-04104840/document.

Last updated on Jan 23, 2023 2 min read

Our paper "The Other Side of Compression: Measuring Bias in Pruned Transformers" is accepted to IDA 2023

Our paper "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" is accepted to EMNLP 2022

TL;DR We introduce SMaLL-100, a distilled version of the M2M100 (12B) model, a massively multilingual machine translation model covering 100 languages.

Link for the paper : https://arxiv.org/abs/2210.11621.

Last updated on May 24, 2022 2 min read

Our paper “What Do Compressed Multilingual MT Models Forget?” is accepted to EMNLP 2022

TL;DR We show in this article that the compression of M2M-100 amplifies biases and hurts under-represented languages.

Link for the paper : https://arxiv.org/abs/2205.10828.

Last updated on May 24, 2022 3 min read

Our paper “What Do Compressed Multilingual MT Models Forget?” is accepted to EMNLP 2022