INDEX
Explanations
This neuron strongly activates on occurrences of the token “to.”
New Auto-Interp
Negative Logits
aerobic
-0.06
leans
-0.06
德
-0.06
иск
-0.06
queen
-0.06
<"
-0.06
UIKit
-0.06
.ci
-0.06
/'
-0.06
.unpack
-0.06
POSITIVE LOGITS
お
0.07
target
0.07
IntArray
0.07
zipfile
0.07
This
0.07
_supported
0.07
flo
0.06
Sicher
0.06
gü
0.06
Algeria
0.06
Activations Density 0.033%