INDEX
Explanations
words related to emotional or psychological states
New Auto-Interp
Negative Logits
manship
-0.61
minecraft
-0.55
choice
-0.54
offense
-0.54
Admir
-0.54
Cruiser
-0.53
ocular
-0.53
addons
-0.53
Boxing
-0.53
gain
-0.53
POSITIVE LOGITS
cially
0.86
agog
0.79
llah
0.77
76561
0.72
ciating
0.71
oult
0.68
cial
0.68
cific
0.67
Ö¼
0.67
ruct
0.65
Activations Density 0.019%