INDEX
Explanations
words with 's' endings, possessive pronouns, and words like 'such', 'both', and 'other'
alternatives and choices
New Auto-Interp
Negative Logits
ſeveral
-0.94
diſt
-0.90
ſtand
-0.87
pleaſure
-0.86
ſta
-0.85
myſelf
-0.84
Majefty
-0.84
ſmall
-0.82
Reſ
-0.82
ſte
-0.82
POSITIVE LOGITS
,
0.68
it
0.53
-
0.52
'
0.52
choisissez
0.49
’
0.49
cima
0.47
оригіналу
0.47
pourrais
0.45
ize
0.44
Activations Density 1.376%