Select one of the HAF-paper toxicity datasets and a model, then run a HAF-style evaluation. Each row in the table shows the input text and the model's stance (toxic / non-toxic). Use the selector below to reveal the full theory-grounded explanation.
Settings
Dataset
Model to evaluate
110
Results
Ready.
Theory-grounded explanations
Run an evaluation to see explanations for each example.