INDEX
Explanations
phrases expressing disbelief or strong negative emotions
expressions of disbelief or incredulity
New Auto-Interp
Negative Logits
aukee
-0.82
exting
-0.76
iHUD
-0.74
incial
-0.72
orage
-0.70
abase
-0.67
tails
-0.66
kamp
-0.66
ãĥīãĥ©
-0.66
´
-0.65
POSITIVE LOGITS
someone
1.17
somebody
1.06
anyone
0.96
nobody
0.90
someone
0.85
anybody
0.83
they
0.81
we
0.79
people
0.77
somehow
0.76
Activations Density 0.115%