INDEX
Explanations
words containing special characters like "ı" and "ÅŁ"
repeated characters or letters in a specific context
New Auto-Interp
Negative Logits
Appalach
-0.84
arsity
-0.72
Sussex
-0.70
guiActiveUnfocused
-0.69
Spartan
-0.67
Buyable
-0.66
Willow
-0.66
HMS
-0.66
maxwell
-0.65
Indigo
-0.65
POSITIVE LOGITS
ı
1.06
¶
1.01
Ì
1.01
oÄŁ
0.99
·
0.97
ÅŁ
0.95
¾
0.94
Ķ
0.88
ĥ
0.87
Ĩ
0.86
Activations Density 0.010%