INDEX
Explanations
words and phrases related to pain and burden
New Auto-Interp
Negative Logits
Nose
-0.17
gratis
-0.16
longleftrightarrow
-0.15
nose
-0.15
935
-0.15
sut
-0.15
ãĥĵãĥ¼
-0.14
afia
-0.14
contr
-0.14
phia
-0.14
POSITIVE LOGITS
»¿
0.17
za
0.17
isize
0.17
ynet
0.17
Kro
0.17
ZA
0.16
Pik
0.16
omen
0.15
SE
0.15
nowhere
0.15
Activations Density 0.033%