HAF Toxicity Dataset Evaluator

Select one of the HAF-paper toxicity datasets and a model, then run a HAF-style evaluation. Each row in the table shows the input text and the model's stance (toxic / non-toxic). Use the selector below to reveal the full theory-grounded explanation.

Settings

Dataset
Model to evaluate
1 10

Results

Ready.

Theory-grounded explanations

Run an evaluation to see explanations for each example.