INDEX
Explanations
key phrases or elements indicating change or transition
New Auto-Interp
Negative Logits
pre
-0.43
ici
-0.42
che
-0.41
com
-0.41
к
-0.40
Com
-0.39
de
-0.39
k
-0.37
r
-0.37
za
-0.37
POSITIVE LOGITS
itſelf
1.36
indeed
1.33
finalement
1.24
inderdaad
1.23
Anſ
1.18
myſelf
1.17
Majefty
1.16
Monfieur
1.15
ſche
1.11
faſt
1.11
Activations Density 0.267%