INDEX
Explanations
dates written in different languages
occurrences of the character sequence "ör"
New Auto-Interp
Negative Logits
Bengal
-0.71
Crusader
-0.68
Virgin
-0.67
Brave
-0.65
birth
-0.65
corn
-0.65
Cavaliers
-0.64
Blacks
-0.63
Mattis
-0.63
Crus
-0.62
POSITIVE LOGITS
ör
1.22
ö
1.21
andom
1.13
ð
1.05
ü
1.03
ön
0.99
ä
0.98
sson
0.96
án
0.90
én
0.88
Activations Density 0.005%