INDEX
Explanations
words written using characters from different languages or possibly corrupted text
symbols or characters related to formatting or encoding issues
New Auto-Interp
Negative Logits
Mandela
-0.64
ifying
-0.64
cloaked
-0.63
icity
-0.62
LIFE
-0.62
Perkins
-0.62
Dragonbound
-0.61
ification
-0.61
"]=>
-0.60
confounding
-0.59
POSITIVE LOGITS
ĥ
2.17
Ĺ
1.97
ħ
1.88
Ľ
1.81
ī
1.81
Ļ
1.77
ı
1.76
Ŀ
1.75
ij
1.75
Į
1.72
Activations Density 0.024%