INDEX
Explanations
This neuron activates on academic citation markers (the bracketed reference tokens like “[@…]”).
New Auto-Interp
Negative Logits
vyrob
-0.07
questi
-0.07
میان
-0.06
unnel
-0.06
trưởng
-0.06
.Cmd
-0.06
งม
-0.06
спроб
-0.06
suspected
-0.06
premi
-0.06
POSITIVE LOGITS
(column
0.07
overflowing
0.06
Happy
0.06
(matrix
0.06
(query
0.06
Quaternion
0.06
Fauc
0.06
الف
0.06
[],↵
0.06
query
0.06
Activations Density 0.009%