INDEX
Explanations
This neuron selectively activates on numeric or numerical‐measurement tokens (e.g. decimal values, percentages, units) in the text.
New Auto-Interp
Negative Logits
одерж
-0.07
brains
-0.06
YES
-0.06
morally
-0.06
roy
-0.06
ationally
-0.06
_block
-0.06
xpath
-0.06
acity
-0.06
onClick
-0.06
POSITIVE LOGITS
Ciudad
0.07
هناك
0.06
uses
0.06
Morm
0.06
forme
0.06
buena
0.06
mekan
0.06
енных
0.06
_tF
0.06
�
0.06
Activations Density 0.054%