INDEX
Explanations
This neuron responds to the initial question words in a user’s query—e.g. the “How can I” at the start of each question.
New Auto-Interp
Negative Logits
Overrides
-0.06
ves
-0.06
inserting
-0.06
descriptor
-0.06
_exception
-0.06
834
-0.06
ैं।
-0.06
hes
-0.05
valid
-0.05
gett
-0.05
POSITIVE LOGITS
저
0.06
้ว
0.06
breed
0.06
располож
0.06
__":↵
0.06
MOD
0.06
یدن
0.06
مي
0.06
یک
0.06
impedance
0.06
Activations Density 0.043%