INDEX
Explanations
special characters or symbols in the text
New Auto-Interp
Negative Logits
»-
-0.74
—
-0.71
««
-0.69
—,
-0.69
Porto
-0.69
싶
-0.65
\_
-0.64
hend
-0.63
т
-0.61
$(
-0.61
POSITIVE LOGITS
�
2.20
�
2.20
��
1.63
��
1.60
���
1.35
¿½
1.22
����
0.85
```
0.83
gewel
0.82
تضيفلها
0.81
Activations Density 0.146%