INDEX
Explanations
The neuron activates on floating-point numeric literals (decimal numbers) in the code.
New Auto-Interp
Negative Logits
Thursday
-0.07
kraje
-0.07
enames
-0.06
_elem
-0.06
الأمريكي
-0.06
düzen
-0.06
فت
-0.06
.Guna
-0.06
eceği
-0.06
まま
-0.06
POSITIVE LOGITS
education
0.07
Bab
0.06
plung
0.06
월까지
0.06
redesigned
0.06
mousedown
0.06
ventilation
0.06
translating
0.06
rex
0.06
ce
0.06
Activations Density 0.002%