INDEX
Explanations
This neuron activates on numeric tokens representing decimal or floating-point values.
New Auto-Interp
Negative Logits
подаль
-0.07
initiatives
-0.06
�
-0.06
設
-0.06
perk
-0.06
놓
-0.06
intro
-0.06
์ร
-0.06
dẫn
-0.06
_days
-0.06
POSITIVE LOGITS
罪
0.07
ины
0.06
Tipo
0.06
lim
0.06
Brasil
0.06
scient
0.06
gift
0.06
Law
0.06
sebagai
0.06
.':
0.06
Activations Density 0.153%