INDEX
Explanations
dynamics
This neuron activates on mentions of a system’s dynamics or time evolution in a scientific/technical discussion.
New Auto-Interp
Negative Logits
kone
-0.06
ケ
-0.06
methods
-0.06
孔
-0.06
석
-0.06
شرکت
-0.06
eos
-0.06
Kir
-0.06
perceptions
-0.06
Kul
-0.06
POSITIVE LOGITS
_jump
0.07
lassen
0.07
ScreenState
0.06
ops
0.06
_Record
0.06
tiến
0.06
відповід
0.06
опер
0.06
ереж
0.06
_detected
0.06
Activations Density 0.012%