INDEX
Explanations
expressions related to making choices or decisions
New Auto-Interp
Negative Logits
houſe
-0.74
Monfieur
-0.72
poffible
-0.71
ſche
-0.69
Conſ
-0.66
paff
-0.64
ſeveral
-0.63
")");
-0.63
AndEndTag
-0.62
/*
-0.60
POSITIVE LOGITS
decided
0.85
instead
0.82
rather
0.77
opted
0.76
décidé
0.76
решила
0.76
opting
0.75
решили
0.71
chose
0.71
choose
0.70
Activations Density 0.269%