INDEX
Explanations
repeated phrases or connectors, particularly variations of "in."
New Auto-Interp
Negative Logits
―――――
-1.50
་་
-1.48
Anſ
-1.46
――――――――
-1.28
iſt
-1.24
itſelf
-1.24
Monfieur
-1.23
ſelf
-1.21
Theſe
-1.13
myſelf
-1.12
POSITIVE LOGITS
en
2.46
EN
1.19
em
1.08
in
1.04
En
0.97
en
0.96
σε
0.92
on
0.91
в
0.88
at
0.85
Activations Density 0.024%