HAF Toxicity Dataset Evaluator

Select one of the HAF-paper toxicity datasets and a model, then run a HAF-style evaluation. Each row in the table shows the input text and the model's stance (toxic / non-toxic). Use the selector below to reveal the full theory-grounded explanation.

HAF Toxicity Dataset Evaluator

Settings

Results

Theory-grounded explanations