INDEX
Explanations
phrases related to alternatives or choices
New Auto-Interp
Negative Logits
hips
-0.93
haw
-0.80
older
-0.77
encers
-0.73
Import
-0.73
ahon
-0.72
eding
-0.72
ching
-0.72
bane
-0.70
ffee
-0.68
POSITIVE LOGITS
alternative
1.19
alternatives
1.14
Altern
1.06
altern
0.94
solutions
0.93
Alternative
0.88
atives
0.84
options
0.84
explanations
0.80
alternate
0.77
Activations Density 0.015%