INDEX
Explanations
conversation snippets
situations involving relationship dynamics and emotional connections.
This neuron detects speaker-identifying tokens or turn labels (e.g., “Z:”, “O:”, names) marking who’s speaking.
New Auto-Interp
Negative Logits
CEPTION
-0.07
ise
-0.06
Px
-0.06
strap
-0.06
Ax
-0.06
okul
-0.06
Ax
-0.06
System
-0.06
decoder
-0.06
footwear
-0.06
POSITIVE LOGITS
іка
0.07
自分の
0.07
ấn
0.06
isOpen
0.06
віт
0.06
_CMD
0.06
سال
0.06
建议
0.06
/Create
0.06
bruk
0.06
Activations Density 0.064%