INDEX
Explanations
The neuron fires on floating-point numeric literals (numbers with a decimal point) embedded in the text.
New Auto-Interp
Negative Logits
hateful
-0.07
;:;:;:;:
-0.07
टन
-0.06
íř
-0.06
ApplicationUser
-0.06
.bpm
-0.06
الیا
-0.06
idae
-0.06
-associated
-0.06
За
-0.06
POSITIVE LOGITS
Loss
0.07
Bahrain
0.07
investment
0.06
impr
0.06
nilai
0.06
]];↵
0.06
"])↵↵
0.06
"path
0.06
MASK
0.06
لیس
0.06
Activations Density 0.010%