INDEX
Explanations
strongly worded expressions or emphasis, such as 'damn' or 'darn'
expressions of strong frustration or emphasis
New Auto-Interp
Negative Logits
NetMessage
-1.10
KY
-0.76
ramid
-0.75
Interstitial
-0.75
KER
-0.72
CRE
-0.71
idon
-0.70
membr
-0.68
cn
-0.68
chn
-0.68
POSITIVE LOGITS
darn
0.86
damn
0.83
selves
0.83
ibly
0.79
damned
0.76
holes
0.73
ation
0.72
kidding
0.71
wit
0.71
nuts
0.70
Activations Density 0.018%