INDEX
Explanations
words with special characters, potentially indicating names or titles
occurrences of special characters or symbols
New Auto-Interp
Negative Logits
scatter
-0.77
scattering
-0.74
Roc
-0.73
FISA
-0.70
Saga
-0.70
DRAG
-0.67
Dirt
-0.67
Farming
-0.65
osate
-0.65
whirlwind
-0.62
POSITIVE LOGITS
º
1.12
¹
1.06
į
1.01
»
1.00
£
0.98
½
0.93
¼
0.88
¡
0.87
ake
0.87
²
0.86
Activations Density 0.471%