INDEX

Explanations

references to unlawful or harsh treatment of individuals

New Auto-Interp

Embeds

IFrame

Link

Not in Any Lists

No Comments

Negative Logits

 Corpus

-0.67

Perfect

-0.66

bern

-0.66

 fallacy

-0.65

agree

-0.64

behind

-0.64

 Leth

-0.62

pose

-0.62

 Fail

-0.61

 Lose

-0.61

POSITIVE LOGITS

 treated

1.07

 teased

1.03

 punished

0.98

 discriminated

0.94

 subjected

0.94

 bullied

0.94

 singled

0.94

 interrogated

0.91

 rewarded

0.90

 ridiculed

0.90

Activations Density 1.688%