INDEX
Explanations
sequences of characters in a foreign language, possibly indicating text encoding or corruption
symbols or special characters used in text
New Auto-Interp
Negative Logits
ierrez
-0.85
ministic
-0.72
aminer
-0.71
ethic
-0.71
achus
-0.69
ebus
-0.69
ilater
-0.67
lopp
-0.66
ettings
-0.65
ription
-0.63
POSITIVE LOGITS
¾
1.16
ħ
1.09
Ĩ
0.98
¼
0.96
Û
0.96
µ
0.94
°
0.94
¹
0.93
Ĥª
0.92
Į
0.90
Activations Density 0.003%