INDEX
Explanations
percentages and quantity
The neuron fires on numerical tokens—digits, percentages, or numbers in the text.
New Auto-Interp
Negative Logits
.Private
-0.07
lỗ
-0.06
lecturer
-0.06
.addTab
-0.06
STRICT
-0.06
Monkey
-0.06
Patient
-0.06
uma
-0.06
Brake
-0.06
-/↵
-0.06
POSITIVE LOGITS
dazz
0.07
.clock
0.06
мінім
0.06
لع
0.06
NaN
0.06
벽
0.06
cpp
0.06
�
0.06
점
0.06
ฤ
0.06
Activations Density 0.158%