INDEX
Explanations
A specific character or symbol (ł)
special characters and symbols in the text
New Auto-Interp
Negative Logits
auga
-0.85
merce
-0.77
reproduction
-0.72
literacy
-0.69
ufact
-0.69
raints
-0.66
anooga
-0.66
adolesc
-0.65
adults
-0.65
readiness
-0.64
POSITIVE LOGITS
ł
1.29
×ķ
1.09
ा
1.02
ķ
0.93
ãĥ¼ãĥ³
0.92
×Ļ×
0.90
Ö¼
0.87
Ñĥ
0.85
¼
0.83
Ĭ
0.83
Activations Density 0.003%