INDEX
Explanations
time units
This neuron activates on numeric tokens, particularly decimal numbers.
New Auto-Interp
Negative Logits
/>,↵
-0.07
nces
-0.06
_fun
-0.06
QPointF
-0.06
incel
-0.06
าของ
-0.06
plat
-0.06
konkrét
-0.06
Ans
-0.06
fflush
-0.06
POSITIVE LOGITS
din
0.08
파일
0.07
toxicity
0.07
whisper
0.07
–and
0.07
assist
0.07
Contributions
0.06
Packaging
0.06
.Part
0.06
####
0.06
Activations Density 0.032%