INDEX
Explanations
unique characters or symbols in the text
New Auto-Interp
Negative Logits
ocu
-0.17
//č↵
-0.16
enna
-0.15
ASTER
-0.14
"title
-0.14
rado
-0.14
à¹ĭ
-0.14
xmm
-0.13
à¥ĩब
-0.13
icks
-0.13
POSITIVE LOGITS
Fr
0.26
-fr
0.25
fr
0.23
/fr
0.23
_fr
0.23
franchise
0.22
kap
0.22
fr
0.22
Fr
0.22
ÙģØ±Ø§ÙĨ
0.22
Activations Density 0.005%