INDEX
Explanations
The neuron is looking for floating‐point number tokens (i.e. decimal numeric literals).
New Auto-Interp
Negative Logits
061
-0.08
더
-0.07
surf
-0.07
andal
-0.06
surprises
-0.06
列
-0.06
otechnology
-0.06
충
-0.06
trợ
-0.06
anal
-0.06
POSITIVE LOGITS
Peygamber
0.07
ísk
0.07
newNode
0.06
Xiao
0.06
víde
0.06
ná
0.06
performan
0.06
#ae
0.06
.yang
0.06
backed
0.06
Activations Density 0.058%