INDEX
Explanations
This neuron fires on occurrences of the word “answer” (e.g. “Answer,” “answering,” “Questions”)—i.e. it detects Q&A or answer segments.
New Auto-Interp
Negative Logits
strr
-0.07
duk
-0.07
ICC
-0.06
mmc
-0.06
Gap
-0.06
envision
-0.06
adoop
-0.06
.setScene
-0.06
ساخته
-0.06
arı
-0.06
POSITIVE LOGITS
answering
0.08
answered
0.08
reassure
0.07
responding
0.06
,but
0.06
ितन
0.06
наг
0.06
,),
0.06
Meaning
0.06
sem
0.06
Activations Density 0.043%