INDEX
Explanations
biological/scientific research
This neuron activates on numerical values (digits, decimal numbers, and measurements) in the text.
New Auto-Interp
Negative Logits
-health
-0.07
health
-0.07
-Control
-0.07
Prod
-0.06
fooled
-0.06
yd
-0.06
введ
-0.06
COLOR
-0.06
budete
-0.06
मत
-0.06
POSITIVE LOGITS
/autoload
0.06
heaps
0.06
||||
0.06
القرآن
0.06
@register
0.06
ging
0.06
.lon
0.06
+]
0.06
WELL
0.06
şik
0.06
Activations Density 0.104%