INDEX
Explanations
non-standard characters and symbols at the end of words
negative contractions
New Auto-Interp
Negative Logits
RAD
-0.76
çīĪ
-0.68
guiActiveUnfocused
-0.66
Samar
-0.62
scapego
-0.61
mosaic
-0.60
camer
-0.60
Shib
-0.60
guiActiveUn
-0.60
jug
-0.59
POSITIVE LOGITS
£
1.15
¹
0.99
º
0.95
¿
0.95
¼
0.93
Ń
0.91
İ
0.91
¬
0.91
Ķ
0.90
ĵ
0.90
Activations Density 0.148%