INDEX
Explanations
instances of emotions and sentiments expressed by individuals
New Auto-Interp
Negative Logits
ONO
-0.17
wor
-0.15
mall
-0.15
_capability
-0.15
wart
-0.15
encv
-0.15
acha
-0.15
cuckold
-0.14
âu
-0.14
ENO
-0.14
POSITIVE LOGITS
beb
0.15
alker
0.15
reesome
0.14
essel
0.14
compression
0.14
igos
0.14
itness
0.13
termin
0.13
orage
0.13
erson
0.13
Activations Density 0.292%