INDEX
Explanations
words indicating criticism and judgment
New Auto-Interp
Negative Logits
otal
-0.16
adier
-0.15
agal
-0.15
éĤ¦
-0.15
Catal
-0.14
olla
-0.14
cak
-0.14
irl
-0.13
oleon
-0.13
629
-0.13
POSITIVE LOGITS
mare
0.16
mort
0.15
ource
0.15
mortar
0.15
άνι
0.14
arter
0.14
GIS
0.14
_TC
0.14
$('[0.14
utex
0.14
Activations Density 0.001%