INDEX
Explanations
phrases with the special character "âĢ" followed by a number
special characters or symbols
New Auto-Interp
Negative Logits
pei
-0.73
obscurity
-0.72
descending
-0.69
wagen
-0.67
charm
-0.66
semblance
-0.64
playbook
-0.62
bung
-0.62
division
-0.61
board
-0.60
POSITIVE LOGITS
ł
1.15
¹
1.06
ONSORED
1.05
ª
0.92
£
0.91
ij
0.91
IJ
0.87
Ī
0.85
ĺ
0.85
ı
0.84
Activations Density 0.153%