FairBotBench

No Thumbnail Available

Date

2025-07-03

Journal Title

Journal ISSN

Volume Title

Publisher

Hochschule für Technik und Wirtschaft Dresden

Abstract

This paper presents a concept for building a benchmark dataset to systematically evaluate chatbot responses in e-commerce with respect to ethical quality dimensions such as bias, toxicity and personalization. The core approach involves generating neutral base dialogues of varying lengths, which are then expanded into more problematic variants and enriched with linguistic diversity. The data is annotated by humans through a multi-stage process. The resulting dataset is intended to support the development and calibration of ethical AI components. To increase utility an english translation of the dataset is provided.

Description

Keywords

Citation

Collections

Attribution-NonCommercial-NoDerivatives 4.0 International