INDEX
Explanations
words related to desire or willingness
New Auto-Interp
Negative Logits
issance
-0.87
ework
-0.69
bard
-0.69
Enhancement
-0.69
unny
-0.67
icol
-0.66
igmatic
-0.65
ilian
-0.65
Kings
-0.64
utical
-0.63
POSITIVE LOGITS
anymore
0.94
anybody
0.94
anything
0.93
anyone
0.91
nor
0.83
any
0.83
ANY
0.78
ANY
0.77
them
0.74
undue
0.72
Activations Density 0.039%