INDEX
Explanations
commands or prompts related to selection
New Auto-Interp
Negative Logits
aps
-0.16
Alive
-0.15
onga
-0.15
nues
-0.15
iliary
-0.14
idir
-0.14
succeeded
-0.14
-selection
-0.14
rescia
-0.13
-defense
-0.13
POSITIVE LOGITS
ivity
0.27
ively
0.23
ive
0.21
IVE
0.20
ives
0.18
IVITY
0.18
ieve
0.18
iveness
0.17
'gc
0.16
iven
0.16
Activations Density 0.010%