INDEX
Explanations
verbs and phrases expressing needs, abilities, or actions
French/Russian words indicating requests or possibilities
can and must
New Auto-Interp
Negative Logits
Jefus
-0.73
Eſ
-0.70
Monfieur
-0.68
增加了
-0.68
bbene
-0.67
myſelf
-0.66
Efq
-0.66
pleaſure
-0.66
Majefty
-0.66
带来了
-0.66
POSITIVE LOGITS
being
1.07
Being
0.85
Being
0.84
having
0.82
being
0.79
taking
0.73
getting
0.73
BEING
0.72
take
0.67
make
0.66
Activations Density 0.060%