INDEX
Explanations
expressions of desire or intention
New Auto-Interp
Negative Logits
issance
-0.87
ework
-0.69
icol
-0.69
unny
-0.68
Kings
-0.67
ilian
-0.66
Enhancement
-0.66
bard
-0.66
utical
-0.64
âĨij
-0.63
POSITIVE LOGITS
anybody
0.96
anymore
0.95
anything
0.95
anyone
0.93
any
0.87
nor
0.86
ANY
0.79
ANY
0.76
spoilers
0.75
wasting
0.74
Activations Density 0.028%