INDEX
Explanations
patterns/trends
This neuron never fires (all activations are zero), indicating it does not respond to any particular token or pattern.
New Auto-Interp
Negative Logits
Cost
-0.07
rotates
-0.07
وید
-0.06
Crypt
-0.06
Sources
-0.06
cumshot
-0.06
iable
-0.06
mát
-0.06
Kills
-0.06
Gratis
-0.06
POSITIVE LOGITS
<&
0.07
アイ
0.06
.single
0.06
↵ ↵
0.06
imb
0.06
846
0.06
patterns
0.06
+"\
0.06
للإ
0.06
'*
0.06
Activations Density 0.025%