INDEX
Explanations
SVG code
The neuron never activates on any tokens—it effectively doesn’t detect anything.
New Auto-Interp
Negative Logits
//---------------------------------------------------------------------------↵
-0.07
ỏ
-0.07
arp
-0.06
机场
-0.06
zero
-0.06
maker
-0.06
provoz
-0.06
िकत
-0.06
willingness
-0.06
color
-0.06
POSITIVE LOGITS
_regions
0.07
_PATCH
0.07
Writers
0.07
.lon
0.06
Claud
0.06
bytearray
0.06
qty
0.06
Truy
0.06
sdk
0.06
appell
0.06
Activations Density 0.001%