INDEX
Explanations
problems or difficulties
This neuron activates on numeric tokens—especially decimal numbers or measurements—in technical/instructional text.
New Auto-Interp
Negative Logits
getContentPane
-0.06
Tiles
-0.06
==========
-0.06
synthesized
-0.06
discrimination
-0.06
approach
-0.06
resignation
-0.06
Join
-0.06
.Click
-0.06
.Do
-0.06
POSITIVE LOGITS
urrences
0.07
рі
0.06
ува
0.06
มห
0.06
дром
0.06
бар
0.06
alara
0.06
всегда
0.06
_nth
0.06
일
0.06
Activations Density 0.026%