INDEX
Explanations
occurrences of the language code "en," indicating English language content
New Auto-Interp
Negative Logits
―――――
-1.11
་་
-1.09
――――――――
-1.03
Anſ
-0.99
Theſe
-0.97
iſt
-0.97
Monfieur
-0.94
―――
-0.93
^(@)
-0.92
ſind
-0.90
POSITIVE LOGITS
en
2.06
EN
1.90
EN
1.86
En
1.72
en
1.70
En
1.68
Coen
0.90
enn
0.85
enin
0.82
Eno
0.82
Activations Density 0.037%