INDEX
Explanations
expressions of blame and accountability regarding societal or systemic issues
New Auto-Interp
Negative Logits
istrovstvÃŃ
-0.15
konkrét
-0.15
Wet
-0.15
ìĹ´
-0.14
sö
-0.14
rek
-0.14
/screen
-0.14
hots
-0.14
PRESSION
-0.14
imp
-0.13
POSITIVE LOGITS
ernal
0.17
rale
0.17
Bark
0.16
erner
0.14
dane
0.14
Kidd
0.14
.crm
0.14
ulis
0.14
ument
0.14
isses
0.14
Activations Density 1.079%