INDEX
Explanations
technical content
This neuron does not activate on any token and thus does not detect any particular pattern.
New Auto-Interp
Negative Logits
ISTS
-0.07
ART
-0.06
ート
-0.06
GLE
-0.06
Fire
-0.06
115
-0.06
that
-0.06
achievements
-0.06
Dropbox
-0.06
ioneer
-0.06
POSITIVE LOGITS
___
0.07
uyệt
0.07
.geo
0.06
posterior
0.06
buat
0.06
θεση
0.06
j
0.06
proportion
0.06
lor
0.06
equivalence
0.06
Activations Density 0.031%